Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1089 |
Symbol | |
ID | 6146743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1104532 |
End bp | 1105830 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641615974 |
Product | anaerobic C4-dicarboxylate transporter |
Protein accession | YP_001743166 |
Protein GI | 170683393 |
COG category | [R] General function prediction only |
COG ID | [COG2704] Anaerobic C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTTGGC TGGAAATTAT CGTTGTACTT GGCGCAATAT TTTTTGGTAT TCGTCAGGGA GGAATCGGCA TTGGTTTATG CGGCGGTCTT GGCCTTGCGA TCCTGACTCT GGGATTTGGT CTGCCTATGG GATCACCGCC AGTCGATGTG ATCCTGATTA TTATGACCGT AGTTGTGGCA GCTTCAGCCT TACAGGCGGC AGGTGGGATG GATTATCTGG TACGTTTAGC CAGTAATTTC ATGCGACGTA ACCCAAAATA TATCAACATA ATCGCACCGA TTATCACTTG GTTAATGACC ATTATGGCCG GTACCGGGTT TATTGTTTTC TCCACACTGC CGGTAATTGC TGAAGTAGCA AAAGAGTCCG GTATCCGGCC TTCCCGCACA CTCGCTGGCT CAGTTGTCGC CTCACAGGTT GCAATTTCTG GCTCACCAAT CAGTGCCGCA ATGGCAGCTA TGCTCACTAT TATGGAGGTT AATGGCGTCA GTTTTATTCA GGTGATGTCG GTATGCCTTC CCACCTCTTT TGTCGCCGCG ATGGTTGCTG CTTTTATCGC CTCCCGACAG GGGTGTGAAT TACAAGATGA CGAAGTATAT CTTGAACGTC TGCAAAAAGG TCTGGTGCAA AAATACGAGA ATAATAACAG CATCAAACCT GGTGCAACGC TTTCCGTCGG ATTATTCATG CTGGCGACGA TCGCCATTGT TATTCTTGCT GCATTCCCCC AATTGCGTCC GGGTTTTGAT ATCAGCAAAC CGATGGAAAC GCGCGATATT ATTATTATTT GCATGCTCTC TGCTGCCTGC CTGATGGTAA TACTGTGTAA GATGTCTACT GATGACATTA TTCTGACCTC CACATTTCGT GCTGGAATGA GTTCGCTGGC AGTGATTCTT GGTATTGTAA CCTTAGGTAC CACCTTTATT GATGCGCATT TAACTGAGAT CAAAGATATC GCTGGTGATA TTTTACAAAC TTATCCGATG CTTTTAGCAT TAGTTCTTTT TTTTACCTGC GCTTTACTTT ACTCACAGGG AGCGACTACG CCGTTAATTA TTCCTCTGGC TGTAGCACTC AACATACCAA CCTGGGCAAT TCTTGCATCT TATGTTGCCG TTACGGGAGT ATTCGTGCTT CCAACGTATC CAACATCACT CGCCGCGATG GAATTTGATA CTACCGGCAC AACCCGTGTG GGCAATTATT TATTAAATCA CCCATTTATG CTACCCGGGT TGGGTGGTGT TATTGCAGGA GTGACATTTG GGTTTGTGAT TGCACCAATG ATGGTCTGA
|
Protein sequence | MVWLEIIVVL GAIFFGIRQG GIGIGLCGGL GLAILTLGFG LPMGSPPVDV ILIIMTVVVA ASALQAAGGM DYLVRLASNF MRRNPKYINI IAPIITWLMT IMAGTGFIVF STLPVIAEVA KESGIRPSRT LAGSVVASQV AISGSPISAA MAAMLTIMEV NGVSFIQVMS VCLPTSFVAA MVAAFIASRQ GCELQDDEVY LERLQKGLVQ KYENNNSIKP GATLSVGLFM LATIAIVILA AFPQLRPGFD ISKPMETRDI IIICMLSAAC LMVILCKMST DDIILTSTFR AGMSSLAVIL GIVTLGTTFI DAHLTEIKDI AGDILQTYPM LLALVLFFTC ALLYSQGATT PLIIPLAVAL NIPTWAILAS YVAVTGVFVL PTYPTSLAAM EFDTTGTTRV GNYLLNHPFM LPGLGGVIAG VTFGFVIAPM MV
|
| |