Gene EcSMS35_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1114 
Symbol 
ID6145118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1128517 
End bp1129878 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content48% 
IMG OID641615994 
Productanaerobic C4-dicarboxylate transporter 
Protein accessionYP_001743186 
Protein GI170679599 
COG category[C] Energy production and conversion 
COG ID[COG3069] C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00771] c4-dicarboxylate anaerobic carrier family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAG TTGTTATCGC GCTTCTTGTT ATCGTGATAG TTGCTCGCTT AATCCTGAAA 
GGATATCGTG CAGAACCCGT GCTTTTTATT GCCGGGCTTG CGCTGATGGT TTGTACCTGG
TTTACCGGTT GGGGCACAGT GTTACCGAAG GGCGTGTCCG GTACAGGAAT TAGTTTTCTT
GATCCTTTCG AAGTCATGCG CAATTTATTC AGTACCCGTG CGGCTGATCT CGGTCTGATG
ATTATGACCC TGATGGGATT TGCGCATTAC ATGGACCATA TCGGTGCGAA TGAAGCGGTT
GTCCGTGTGG TAACCCGGCC ATTACGGACA TTACGTTCCC CCTATGTTCT GCTTTTTTTC
TCTTATCTTT TCGCCAGTCT TCTTCAACTG GCTATCCCCT CAGCTACCGG ACTTGCTGTT
TTACTGATGG GAACCATGTT TCCTATTATG CTCGGACTCG GGCTGTCAGC AGCGTCTGCC
GCAGGGGTAA TTGCCACCTC TCTTGGTGTG GCGTACACAC CGACAGCCAT TGACGCCATA
CGAGGCTCTG AAGCGGTAAA TATGGATGTG GTGGAGTATG TTGTCTATCA TCAGGGGCCT
GCTGCTCTGG CGACTGTTCT GATTGTCGGT ATCAGCCACT TTTTCTGGCA GAAACACTGT
GATCGTAAAG CTGGAACATT ACCTCATGAA ATTGGTACGA CAACGGTCAT AAAAGCAGGC
AGTACGCCGG CTTATTATGC TCTGCTACCT ATGCTACCGA TCCTGATGGC CGTTGGCTCA
TCGGAGATTT TTGTCACCGG AATTAATCTC AATATCATTA CGATTGTTCT TATCTCAATG
GCAATTTGCA TGCTGATCGA ATGGGTCCGT AAATGTGATC TGAAAGCTGT ATGTGACGGG
TTTACCCATT TCCTGAAAGG AATGGGGACA GCATTTACCG GCGTGGTGGG CTTACTGGTG
GCCGCAGGTG TGTTTGCTCA TGGGATTAAA AGCATTGGCG CGATAGATCA ATTAATTTTG
ATGGCAGAGC ACGTTGGGTT ACCCCCTTTT GCGATGGGCA TCGTTTTTGC TCTGGTTACA
CTGGCTGCAG CTGTCATTAT GGGATCGGGC AATGCGCCAT TTCTGGCTTT TGTTGAACTG
ATACCACAAA TCGCTGCCAG CATGGGCGTG AATGCCATAT CGATGATCCT GCCTATGCAG
CAGGCTTCCC ATATGGGGCG TGCGATGTCT CCCGTATCTG GCGTGGTTAT AGCGGTTTCC
AGTGGAGCAA ATATAACTCC CTTCGAAGTG GTAAAACGCA CTGCTTTACC CCTGATAGTT
GGTTTTGTTT TTCACTCTGC GATTATCGGT ATTTTTTATT AA
 
Protein sequence
MIKVVIALLV IVIVARLILK GYRAEPVLFI AGLALMVCTW FTGWGTVLPK GVSGTGISFL 
DPFEVMRNLF STRAADLGLM IMTLMGFAHY MDHIGANEAV VRVVTRPLRT LRSPYVLLFF
SYLFASLLQL AIPSATGLAV LLMGTMFPIM LGLGLSAASA AGVIATSLGV AYTPTAIDAI
RGSEAVNMDV VEYVVYHQGP AALATVLIVG ISHFFWQKHC DRKAGTLPHE IGTTTVIKAG
STPAYYALLP MLPILMAVGS SEIFVTGINL NIITIVLISM AICMLIEWVR KCDLKAVCDG
FTHFLKGMGT AFTGVVGLLV AAGVFAHGIK SIGAIDQLIL MAEHVGLPPF AMGIVFALVT
LAAAVIMGSG NAPFLAFVEL IPQIAASMGV NAISMILPMQ QASHMGRAMS PVSGVVIAVS
SGANITPFEV VKRTALPLIV GFVFHSAIIG IFY