Gene EcSMS35_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1121 
Symbol 
ID6143228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1139530 
End bp1140846 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content43% 
IMG OID641616001 
Productanaerobic C4-dicarboxylate transporter 
Protein accessionYP_001743193 
Protein GI170680566 
COG category[R] General function prediction only 
COG ID[COG2704] Anaerobic C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.387894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTAT TCGTAATTCA GTTTGCAATA GTACTTACAT GCATTGGGAT AGGTGGGCGC 
TTTGGTGGAA TCGGCTTAGG GGCGGCGGGG GGATTAGGGC TCGCAATTCT GACTTTTGGT
TTTGGAGTCC CGCCTGATTC CCCGCCTATC ACTGTCATTT CCATTATTCT TGCTGTTATC
ACATGTATTG CTATCCTGCA GGCAGCGGGA GGTCTTGATC TTTTCGTGAC AATAGCTGAA
AAAATACTGC AAAAGAGGCC TGGTGCAATC ACGTTTCTTG GGCCAGCAGT TGCTTATCTT
TTTACGGCAA TATGTGGAAC TGGTTATGTC GCTTTTTCCA TTTATCCTGT TATTGCTGAA
ATCGCAGCTG ATGCAAGAGT GAGGCCAGAA CGCGCCATGT CAATGTCAGT CATTGCCGCC
AACTTTGGTC TTATAGCGAG CCCAGTAAGT GCGGTCGTTA CAGGAACGAT TGCGGTCTTT
TCCGGATTAC ATGTTTCTGC TTTAGATATT TTATTGATTA CTGTACCGGG AACTATTTTG
GGGTGTCTTG TTGGTTGTTT GTTTGTTTAT AAACGGGGCC ATGATCTTGA AACGGACCCA
GAGTTTCAAC GCAGAGTTGC AGAAGGTGAA TTTGAGTCAG TAAAAACTGG GGAACGTACA
ATCAGTATTA TATCTAAAAC AGCGAAGAAA GCCCTGATGA TTTTTATTTC TGGGATCATT
TTGGTTGTTG TTTTAGGCTC TGTTCCAGAA TTACGTCCGG TATGGAATAC CAGTGCCGGT
GTAGAACGGA TGAGTATTCC AACAGCATTG CAAATCATCA TGTTGACTAC AGCATGTATC
ATTATGATGG TATGTCGGAT TTCTCCATCG AAACTTGACT CAGGATCCGT TTTTAAGGCC
GGTCTGGTTG GAGTTGTTGC CATATTTGGT CTTTCATGGA TGATGAGTTC TTTCTTTGAA
GCATGGCAAG ATTTATTTAA TAACACTTTC AATGATTTTC ATAACCCAGT TATATTCGGT
GTGATTGTGT TTGTTCTTTC TGCTGTGATT TACAGTCCCG CAGCAACTGC TGTGGCATTA
TTTCCTGCTG GCGTATTAAT GGGATATTCG ACTGAGACGC TCATAGCTTT GCTTCCGGTA
ACGTGCGGAT CATTCATTAT TCCTGGTGGT GCACAAATTG CTTGCGTGGC GTTTGACAGA
ACAGGAACAA CCAGAGTTGG CAAGTATGTA GTAAATCACA GTTATATGTT ACCCGGTCTG
ATTACCGTTC TGGCTTCAAC TATCTTCTGC TTCCTTTTTT CCACTATATT AGTGTAA
 
Protein sequence
MMLFVIQFAI VLTCIGIGGR FGGIGLGAAG GLGLAILTFG FGVPPDSPPI TVISIILAVI 
TCIAILQAAG GLDLFVTIAE KILQKRPGAI TFLGPAVAYL FTAICGTGYV AFSIYPVIAE
IAADARVRPE RAMSMSVIAA NFGLIASPVS AVVTGTIAVF SGLHVSALDI LLITVPGTIL
GCLVGCLFVY KRGHDLETDP EFQRRVAEGE FESVKTGERT ISIISKTAKK ALMIFISGII
LVVVLGSVPE LRPVWNTSAG VERMSIPTAL QIIMLTTACI IMMVCRISPS KLDSGSVFKA
GLVGVVAIFG LSWMMSSFFE AWQDLFNNTF NDFHNPVIFG VIVFVLSAVI YSPAATAVAL
FPAGVLMGYS TETLIALLPV TCGSFIIPGG AQIACVAFDR TGTTRVGKYV VNHSYMLPGL
ITVLASTIFC FLFSTILV