Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0939 |
Symbol | |
ID | 6145601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 955827 |
End bp | 957047 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641615826 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001743018 |
Protein GI | 170682065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.456123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAAAAG CACTAAACAA ACTGAGCGAT TCGACGTTAA AAAAATTGGC GGCTGTCCAG GCAGAAAAAG AGCGTTTTTA CTCCGACGGG GGCGGGCTGG AGATTAAACA CTCAAAGGGC GGCAAATTAA CCTGGTATTT CCGGTACCGA ACGGGAGGCC GTGAGGTTGC CGCAGAGCGG TTAAAGCTGG GGGCTTACCC TGAATTGTCG CTGAAAGCCG CAAGGGAAAA ACGCACACTG TGCCGGGCAT GGCTGGCTGA GGGTAAAAAC CCCCGTTATG AGTTGTGCGC CACAGTACAG GAAGCACTAA AACCCGTTAC GGTGAAGGAA GCGATTAACT ACTGGCTGGA GGAATACGCG AAGGATAACC GCAAAGATTA CATAAAGCTG GTGCAGCGTA TGGATAAGCA CATCATTTGC CATATTGGGG CAATTCCCCT TGATAAGTGT GATACAAGGC AGTGGATCGC ATGTTTTGAC CGCGTACGAA AAAAAGCACC AGTAGCAGCG GGCCATGTCA TGCAGACATG CAAACAGGCG CTAAAGTTTT GCCGCAGGCG GCGCTACGCG TTTAGCAACG CCCTGGACGA TTTGATCGTT ACTGATGTGG GTAAGAGAGC AGAAATCCGC GAGAGAGTGC ACAGCAACAG CGAACTAAAA GAAATTCTAC GCGCTATTGA TGGTGATGTG TTCGCTCCCT ATTACAGTGC GTTAATGCGC TTGTTAATTG TGTTCGGGTG CAGAACGGCA GAGATCAGAC TTTCAGAGAT CAAAGAATGG GATCTGAAAG AAATGTTGTG GACAGTGCCA AAAGAGCACA GCAAAACGAA GGTAACAATA TTCCGACCTA TTCCTGATGG TATTTTGCCG TTCATTCAGA AGCTGGTGGA GCAAAACGCA CACACTGGGT TATTACTCGG CGAACTGAAA AAAGATACCA CGGTGGCGCA ATATGGACGA AATGCGCATA AGCGGCTTAA GCAGGAACAC TGGACGCTGC ATGATTTCCG ACACACGTTT ACAACTATGC TGAATGATTT AGGTGTCGAT CCGCATATCG TGGAGCACAT CACAGCGCAT CAGATGCCAG GTCAGCAAAA AACCTATAAC CATTCACGCT ATTTGCAGGC GAAACGGGAC GCACTGAATC TATGGGTTGA GCGTCTTGAT ATGATTGCAG GATATAATGA AAATATTGTG ATATTGAGAG GGATACAATG A
|
Protein sequence | MGKALNKLSD STLKKLAAVQ AEKERFYSDG GGLEIKHSKG GKLTWYFRYR TGGREVAAER LKLGAYPELS LKAAREKRTL CRAWLAEGKN PRYELCATVQ EALKPVTVKE AINYWLEEYA KDNRKDYIKL VQRMDKHIIC HIGAIPLDKC DTRQWIACFD RVRKKAPVAA GHVMQTCKQA LKFCRRRRYA FSNALDDLIV TDVGKRAEIR ERVHSNSELK EILRAIDGDV FAPYYSALMR LLIVFGCRTA EIRLSEIKEW DLKEMLWTVP KEHSKTKVTI FRPIPDGILP FIQKLVEQNA HTGLLLGELK KDTTVAQYGR NAHKRLKQEH WTLHDFRHTF TTMLNDLGVD PHIVEHITAH QMPGQQKTYN HSRYLQAKRD ALNLWVERLD MIAGYNENIV ILRGIQ
|
| |