Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1298 |
Symbol | tar |
ID | 6146189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1286049 |
End bp | 1287710 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616176 |
Product | methyl-accepting chemotaxis protein II |
Protein accession | YP_001743356 |
Protein GI | 170680418 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.00038087 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTAACC GTATCCGCGT AGTCACGCTG TTGGTAATGG TGCTGGGGGT ATTTGCACTG TTACAGCTTA TTTCCGGCAG TCTGTTTTTT TCTTCCCTTC ACCATAGCCA GAAGAGCTTT GTGGTTTCCA ATCAATTACG GGAACAGCAG GGCGAGCTGA CGTCAACCTG GGATTTAATG CTGCAAACGC GCATTAACCT GAGTCGTTCA GCGGTGCGGA TGATGATGGA TTCATCCAAT CAGCAAAGTA ATGCCAAAGT TGAATTGCTC GATAGCGCCA GGAAAACATT GGCCCAGGCC GCGACGCATT ATAAAAAATT CAAAAGCATG GCACCGTTAC CTGAAATGGT CGCTACCAGT CGTAATATTG ATGAAAAATA TAAAAACTAT CACACAGCGT TAACTGAACT GATTGATTAT CTTGATTATG GCAATACTGG AGCTTATTTC GCTCAGCCAA CCCAGGGAAT GCAAAATGCA ATGGGCGAAG CGTTTGCTCA GTACGCCCTC AGCAGTGAAA AACTGTATCG CGATATCGTC ACTGACAACG CAGATGATTA CCGATTTGCC CAGTGGCAAC TGGCGGTTAT CGCGCTGGTG GTGATATTGA TTCTGCTGGT GGCGTGGTAC GGCATTCGCC GTATGTTGCT TACACCGCTG GCAAAAATTA TTGCTCACAT TCGCGAAATC GCCGGTGGTA ACCTGGCGAA TACCCTGACC ATTGACGGGC GCAGTGAAAT GGGCGACCTG GCGCAGAGCG TTTCACATAT GCAACGCTCT TTGACTGACA CCGTCACTCA TGTGCGTGAA GGTTCAGATG CCATCTATGC CGGTACCCGT GAAATTGCGG CGGGCAACAC CGATCTTTCC TCCCGTACGG AACAGCAGGC ATCCGCGCTG GAAGAAACTG CCGCCAGCAT GGAGCAGCTC ACCGCGACAG TGAAGCAAAA CGCCGATAAC GCCCGCCAGG CCTCGCAACT GGCGCAAAGT GCCTCCGACA CCGCCCAGCA CGGCGGCAAA GTGGTGGATG GCGTAGTGAA AACGATGCAT GAGATCGCCG ATAGTTCGAA GAAAATTGCC GACATTATCA GCGTTATCGA CGGTATTGCC TTCCAGACTA ATATACTCGC GCTGAATGCC GCTGTTGAAG CCGCGCGAGC GGGTGAACAG GGCCGTGGTT TTGCCGTGGT GGCGGGCGAA GTGCGTAATC TTGCCAGTCG CAGCGCCCAG GCGGCAAAAG AGATCAAAGC CCTCATTGAA GACTCCGTCT CGCGCGTTGA TACCGGTTCG GTGCTGGTCG AAAGCGCCGG GGAAACAATG AACAATATCG TCAATGCTGT CACTCGCGTG ACTGACATTA TGGGCGAGAT TGCATCGGCA TCGGATGAAC AGAGCCGTGG CATCGATCAA GTCGCATTGG CGGTTTCGGA AATGGATCGC GTCACGCAAC AGAACGCATC GCTGGTGCAG GAATCAGCTG CCGCCGCAGC TGCGCTGGAA GAACAGGCGA GTCGTTTAAC GCAAGCGGTT TCCGCGTTCC GTCTGGCAGC CAGCCCACTC ACCAATAAAC CGCAAACACC ATCCCGTCCT GCCAGTGAGC AACCACCGGC ACAGCCACGA CTGCGAATTA CTGAACAAGA TCCAAACTGG GAAACATTTT GA
|
Protein sequence | MINRIRVVTL LVMVLGVFAL LQLISGSLFF SSLHHSQKSF VVSNQLREQQ GELTSTWDLM LQTRINLSRS AVRMMMDSSN QQSNAKVELL DSARKTLAQA ATHYKKFKSM APLPEMVATS RNIDEKYKNY HTALTELIDY LDYGNTGAYF AQPTQGMQNA MGEAFAQYAL SSEKLYRDIV TDNADDYRFA QWQLAVIALV VILILLVAWY GIRRMLLTPL AKIIAHIREI AGGNLANTLT IDGRSEMGDL AQSVSHMQRS LTDTVTHVRE GSDAIYAGTR EIAAGNTDLS SRTEQQASAL EETAASMEQL TATVKQNADN ARQASQLAQS ASDTAQHGGK VVDGVVKTMH EIADSSKKIA DIISVIDGIA FQTNILALNA AVEAARAGEQ GRGFAVVAGE VRNLASRSAQ AAKEIKALIE DSVSRVDTGS VLVESAGETM NNIVNAVTRV TDIMGEIASA SDEQSRGIDQ VALAVSEMDR VTQQNASLVQ ESAAAAAALE EQASRLTQAV SAFRLAASPL TNKPQTPSRP ASEQPPAQPR LRITEQDPNW ETF
|
| |