Gene EcSMS35_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1122 
Symbol 
ID6144736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1140962 
End bp1142278 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content44% 
IMG OID641616002 
Productanaerobic C4-dicarboxylate transporter 
Protein accessionYP_001743194 
Protein GI170681956 
COG category[R] General function prediction only 
COG ID[COG2704] Anaerobic C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.253067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCTC TATTTTTTAT ACAACTATTT ATTGTTCTGG CCTGTATTGG AATCGGCGGA 
AAATACGGCG GAATGGGGCT TGGTGCTGGA GGTGGGTTAG GGGTTGCAAT ATTGGTTTTG
GGCTTTGGAC TGAAACCCTC CTCTCCCTCC ATTACTGCTA TGTTAATTAT TATTTGTGTT
ATTAGTGCTG TCAGTATATT ACAAGCTGTT GGAGGATTAG ATTATTTAGT CAGGGTCGCA
GAACGGATAC TCCGTAAGAA ACCACAGGCG ATAACCTTTG TAGCGCCAAT ATTAACTTCA
GTATTTACAC TATTTTGTGG GACGACTTAT GTCGCTTTCT CCTTATACCC TGTTATTGCT
GAAGTCGCCG CTGAAGCAAA AGTACGTCCA GAACGTGCAT TATCTGCAAC GGTCATTGCA
GCCAGTGTGG CAGTGGCTGC CAGCCCTATG AGTGCAGCAA CTGCCGGGAT GCTGGCTATA
CTTCATGAGT ATGCAGGGAT CACGTTAGGG CAAATATTAT CTATTGCCCT TCCTTCGTTT
TTTATGGCTG CAATTGTTAC CAGCTTTTCA GTCTACAAGC GTGGTAAAGA ACTGGAAGAT
GATCCTGAAT TTCAAAGGCG TGTTGCAGCA GGTGAGTATG AGTTCATGCA CACCGAACAG
AAAAAAGAAT ATGTCGCTGC ACCAGGTGCA AGAAAAGGGG TTGCTATTTT TGCCATAGGA
GTCGTGCTGG TCCTTATTCT CGGATCTTTT ACTGAGCTTC TTCCGTCGTG GGATGGAAAA
AGATTATCAA CCCCAATGGT CATCCAGATG ATTATGTTGA CTGCAGCTTT ATTGATCATG
ATTGTGGGGA AAGTTCCAAG CAATAAATTT AACAGTGGCT CTGTGTTCCG TGCTGGTTTG
ATGGGGGTTG TGGCAATACT TGGTGTTTCG TGGATGACTG CAACATTTTT TGATGCTTAT
CAGGCAGAGC TGATCAATGT TTTTGGCAAT CTTGTAAATG ATGCGCCTTT GCTGTTTGGC
TTTATCGTAT TCTTGTTTTC TCTGGTGATC ATGAGCCCGG CTGCGACAGT AGCTGCGATT
ATGCCATTAG GAGTCACCCT GGGTATTCCT GCACCTTATC TTATTGCAAT TTTTGCCTGT
ACTTGTGGTG ATTTTATTAT ACCCGGAGCG AATCAGATTG GTTGTGTGGC GTTTGACAGA
ACTGGTACTA CCAGAATTGG GCGCTTTGTT ATCAACCATA GCTATATACG TCCTGGGTTC
GTCATGGTCA TTTCCCAGGT TGTTTTTGCC TACTTAATTG CGCAGGTTAT TCTGTAA
 
Protein sequence
MDALFFIQLF IVLACIGIGG KYGGMGLGAG GGLGVAILVL GFGLKPSSPS ITAMLIIICV 
ISAVSILQAV GGLDYLVRVA ERILRKKPQA ITFVAPILTS VFTLFCGTTY VAFSLYPVIA
EVAAEAKVRP ERALSATVIA ASVAVAASPM SAATAGMLAI LHEYAGITLG QILSIALPSF
FMAAIVTSFS VYKRGKELED DPEFQRRVAA GEYEFMHTEQ KKEYVAAPGA RKGVAIFAIG
VVLVLILGSF TELLPSWDGK RLSTPMVIQM IMLTAALLIM IVGKVPSNKF NSGSVFRAGL
MGVVAILGVS WMTATFFDAY QAELINVFGN LVNDAPLLFG FIVFLFSLVI MSPAATVAAI
MPLGVTLGIP APYLIAIFAC TCGDFIIPGA NQIGCVAFDR TGTTRIGRFV INHSYIRPGF
VMVISQVVFA YLIAQVIL