Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1753 |
Symbol | trg |
ID | 6142692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1756817 |
End bp | 1758457 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616629 |
Product | methyl-accepting chemotaxis protein III |
Protein accession | YP_001743807 |
Protein GI | 170682410 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 2.79367e-17 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATACAA CTCCCTCACA GCGATTAGGT TTTTTGCATC ACATCAGGCT GGTTCCGTTA TTTGCCTGCA TTCTTGGCGG TATCTTAGTT CTATTCGCAT TAAGTTCTGC TCTGGCTGGC TATTTCCTCT GGCAGGCCGA TCGCGATCAG CGTGATGTTA CTGCGGAGAT TGAGATTCGG ACCGGGTTAG CGAACAGTTC AGATTTTTTG CGTTCAGCCC GGATCAATAT GATTCAGGCC GGGGCAGCGA GTCGTATTGC GGAAATGGAA GCAATGAAGC GAAATATTGC GCAAGCCGAA TCGGAGATTA AACAGTCGCA GCAAGGTTAT CGTGCTTATC AGAATCGACC GGTGAAAACA CCTGCTGATG AAGCCCTCGA CACTGAATTA AATCAACGCT TCCAGGCTTA TATCACGGGT ATGCAACCGA TGTTGAAATA TGCCAAAAAT GGCATGTTTG AAGCGATTAT CAATCATGAA AGTGAGCAGA TCCGACCGCT GGATAATGCT TATACCGATA TTTTGAACAA AGCCGTTAAG ATACGTAGCA CCAGAGCCAA CCAACTGGCG GAACTGGCCC ATCAGCGCAC CCGCCTGGGT GGGATGTTCA TGATTGGCGC GTTTGTGCTT GCCCTGGTGA TGACGCTGAT AACATTTATG GTGCTACGTC GGATCGTCAT TCGTCCACTG CAAAATGCCG CACAACGGAT TGAAAAAATC GCCAGTGGCG ATCTGACGAT GAATGATGAA CCGGCGGGTC GTAATGAAAT CGGTCGCTTA AGTCGTCATT TACAGCAAAT GCAGCATTCA CTGGGGATGA CTGTAGGGAC CGTTCGACAG GGCGCGGAAG AGATTTATCG TGGCACCAGC GAAATTTCAG CTGGCAATGC GGACCTGTCA TCTCGCACCG AAGAACAAGC GGCGGCTATC GAACAAACTG CCGCCAGCAT GGAGCAACTC ACTGCGACGG TGAAACAGAA TGCGGATAAC GCGCATCATG CCAGCAAACT GGCGCAGGAG GCTTCTATTA AAGCCAGCGA TGGCGGGCAG ATGGTTTCCG GTGTAGTAAA AACGATGGGC GCTATCTCCA CGAGTTCGAA GAAAATTTCT GAGATCACCG CCGTCATCAA CAGTATTGCT TTCCAGACGA ATATTCTGGC ACTGAATGCT GCCGTTGAAG CCGCGCGAGC GGGTGAGCAA GGACGTGGAT TTGCCGTTGT CGCCAGCGAA GTACGGACAC TCGCAAGCCG CAGCGCTCAG GCGGCGAAAG AGATTGAAGG CTTGATCAGT GAATCAGTCA GGTTAATTGA CCTGGGGTCG GATGAGGTGG CAACGGCAGG GAAAACCATG AGCACTATTG TTGATGCCGT CGCGAGTGTC ACACATATCA TGCAGGAAAT CGCCGCCGCC TCGGATGAAC AAAGTAGAGG CATAACGCAG GTTAGCCAGG CGATTTCTGA AATGGATAAG GTGACGCAAC AGAATGCTTC TCTGGTAGAA GAGGCCTCAG CGGCGGCGGT GTCCCTTGAA GAACAGGCGG CACGATTAAC TGAGGCGGTG GACGTATTCC GTCTGAACAA ACATTCTGTG TCGGCAGAAC CTCGCGGAGC GGGTGAACCA GTTAGTTTCG CTACGGTGTG A
|
Protein sequence | MNTTPSQRLG FLHHIRLVPL FACILGGILV LFALSSALAG YFLWQADRDQ RDVTAEIEIR TGLANSSDFL RSARINMIQA GAASRIAEME AMKRNIAQAE SEIKQSQQGY RAYQNRPVKT PADEALDTEL NQRFQAYITG MQPMLKYAKN GMFEAIINHE SEQIRPLDNA YTDILNKAVK IRSTRANQLA ELAHQRTRLG GMFMIGAFVL ALVMTLITFM VLRRIVIRPL QNAAQRIEKI ASGDLTMNDE PAGRNEIGRL SRHLQQMQHS LGMTVGTVRQ GAEEIYRGTS EISAGNADLS SRTEEQAAAI EQTAASMEQL TATVKQNADN AHHASKLAQE ASIKASDGGQ MVSGVVKTMG AISTSSKKIS EITAVINSIA FQTNILALNA AVEAARAGEQ GRGFAVVASE VRTLASRSAQ AAKEIEGLIS ESVRLIDLGS DEVATAGKTM STIVDAVASV THIMQEIAAA SDEQSRGITQ VSQAISEMDK VTQQNASLVE EASAAAVSLE EQAARLTEAV DVFRLNKHSV SAEPRGAGEP VSFATV
|
| |