Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1121 |
Symbol | |
ID | 6143228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1139530 |
End bp | 1140846 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641616001 |
Product | anaerobic C4-dicarboxylate transporter |
Protein accession | YP_001743193 |
Protein GI | 170680566 |
COG category | [R] General function prediction only |
COG ID | [COG2704] Anaerobic C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.387894 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTAT TCGTAATTCA GTTTGCAATA GTACTTACAT GCATTGGGAT AGGTGGGCGC TTTGGTGGAA TCGGCTTAGG GGCGGCGGGG GGATTAGGGC TCGCAATTCT GACTTTTGGT TTTGGAGTCC CGCCTGATTC CCCGCCTATC ACTGTCATTT CCATTATTCT TGCTGTTATC ACATGTATTG CTATCCTGCA GGCAGCGGGA GGTCTTGATC TTTTCGTGAC AATAGCTGAA AAAATACTGC AAAAGAGGCC TGGTGCAATC ACGTTTCTTG GGCCAGCAGT TGCTTATCTT TTTACGGCAA TATGTGGAAC TGGTTATGTC GCTTTTTCCA TTTATCCTGT TATTGCTGAA ATCGCAGCTG ATGCAAGAGT GAGGCCAGAA CGCGCCATGT CAATGTCAGT CATTGCCGCC AACTTTGGTC TTATAGCGAG CCCAGTAAGT GCGGTCGTTA CAGGAACGAT TGCGGTCTTT TCCGGATTAC ATGTTTCTGC TTTAGATATT TTATTGATTA CTGTACCGGG AACTATTTTG GGGTGTCTTG TTGGTTGTTT GTTTGTTTAT AAACGGGGCC ATGATCTTGA AACGGACCCA GAGTTTCAAC GCAGAGTTGC AGAAGGTGAA TTTGAGTCAG TAAAAACTGG GGAACGTACA ATCAGTATTA TATCTAAAAC AGCGAAGAAA GCCCTGATGA TTTTTATTTC TGGGATCATT TTGGTTGTTG TTTTAGGCTC TGTTCCAGAA TTACGTCCGG TATGGAATAC CAGTGCCGGT GTAGAACGGA TGAGTATTCC AACAGCATTG CAAATCATCA TGTTGACTAC AGCATGTATC ATTATGATGG TATGTCGGAT TTCTCCATCG AAACTTGACT CAGGATCCGT TTTTAAGGCC GGTCTGGTTG GAGTTGTTGC CATATTTGGT CTTTCATGGA TGATGAGTTC TTTCTTTGAA GCATGGCAAG ATTTATTTAA TAACACTTTC AATGATTTTC ATAACCCAGT TATATTCGGT GTGATTGTGT TTGTTCTTTC TGCTGTGATT TACAGTCCCG CAGCAACTGC TGTGGCATTA TTTCCTGCTG GCGTATTAAT GGGATATTCG ACTGAGACGC TCATAGCTTT GCTTCCGGTA ACGTGCGGAT CATTCATTAT TCCTGGTGGT GCACAAATTG CTTGCGTGGC GTTTGACAGA ACAGGAACAA CCAGAGTTGG CAAGTATGTA GTAAATCACA GTTATATGTT ACCCGGTCTG ATTACCGTTC TGGCTTCAAC TATCTTCTGC TTCCTTTTTT CCACTATATT AGTGTAA
|
Protein sequence | MMLFVIQFAI VLTCIGIGGR FGGIGLGAAG GLGLAILTFG FGVPPDSPPI TVISIILAVI TCIAILQAAG GLDLFVTIAE KILQKRPGAI TFLGPAVAYL FTAICGTGYV AFSIYPVIAE IAADARVRPE RAMSMSVIAA NFGLIASPVS AVVTGTIAVF SGLHVSALDI LLITVPGTIL GCLVGCLFVY KRGHDLETDP EFQRRVAEGE FESVKTGERT ISIISKTAKK ALMIFISGII LVVVLGSVPE LRPVWNTSAG VERMSIPTAL QIIMLTTACI IMMVCRISPS KLDSGSVFKA GLVGVVAIFG LSWMMSSFFE AWQDLFNNTF NDFHNPVIFG VIVFVLSAVI YSPAATAVAL FPAGVLMGYS TETLIALLPV TCGSFIIPGG AQIACVAFDR TGTTRVGKYV VNHSYMLPGL ITVLASTIFC FLFSTILV
|
| |