Gene EcSMS35_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1117 
Symbol 
ID6142907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1133553 
End bp1134887 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content46% 
IMG OID641615997 
Productanaerobic C4-dicarboxylate transporter 
Protein accessionYP_001743189 
Protein GI170682600 
COG category[R] General function prediction only 
COG ID[COG2704] Anaerobic C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTA TCACTGTTCT TCAGATAGTG GTTCTGCTGG GGGCAATCTT TTTGGGTGTC 
CGTATGGGCG GGATTGGTAT TGGTTATGCC GGTGGCATTG GGGTATTGAT TCTGGGGCTT
TGTCTGGATA TGAAGCCTGG TAATATCCCC TGGGATGTGA TTCTAATTAT TGCATCGGTA
ATTTCTGCTA TTTCGGCCAT GCAACTGGCG GGAGGACTGG ACTATCTCGT TCAGGTCGCT
GAACGAATTC TTCGTAAAAA TCCAAAATAT ATTAATTACC TGGCTCCGGT TGTCACTTAT
GTGCTGACAA TCTTTGCCGG TACGGGACAT ACTGCATTTT CAATGATCCC TGTAATTGTC
GAAGTTTCCA AAGAACAAAA CATTAAGCCC TCATGTCCGC TCTCAATTGC GGTTGTATCA
TCTCAAATTG CAATTACAGC ATCACCTGTA TCGGCTGCCG TTATCTATAT GTCTGGCGTA
CTGGAAGGCT TTGGGTGGAG TTATCCAGTA TTGCTTGGGA TATGGTTGTT TACCACATTC
GTTGGTTGCA TGCTTACCGC ATTTATAATC AGTCTGATTT CTGACATGAA ATTAGACAAT
GATCCGGTTT ACCGGGAGCG TCTTTCCAAA GGACTCGTAA GCGCGCCAGT AAAGAGTGTT
AACAAACAGC TCAAGCCTTA TGCCAGACGA TCTGTCGCAA TTTTCCTTAT TGGCGTCATC
CTGGTGGTCC TTTACGCTTC GGCCATTAGC CCGACGCTGG GTCTGATTGA TAACGTTGTT
GTTAGTCGTG ATGCGGCTAT TATGAGTCTC ATGCTACTGG TTGGCGGATT CATTACCCTT
TTCTGTAAAG CAGATATAAA CAAGATTGCG GACTCCTCTG TGTTCAAGTC AGGCATGGTT
GCCTGTATCT GTGTACTGGG TGTGGCATGG TTGGGGGACA CTTTTGTATC TGGTCATTCA
GGAGAAATTA AGGAGCTTGC CAGAACTACT GTATCCCAGT ATCCAGCTCT TCTGGCTGTG
GTATTTTTCC TGGCTGCGAT GCTTCTGTAT TCACAGGCTG CAACTGCCAA AGCTATCACT
CCGGCTATTG TGACTGCATT GGGTATTACT GCAGCGAATC CGGATGACAG TTACATGCTG
GTAGCTTCTT TTGCTGCTGT GTCTGCTTTA TTTGTGTTAC CAACTTACCC AACCCTTCTG
GGGGCGGTGC AAATGGATGA CACAGGAACG ACCCGTATTG GTAAGTATGT GTTCAACCAT
GCTTTCTTTA TTCCGGGTGT ACTGGCTATT GCTTTCTCAG TGCTTCTGGG ATTTCTTGTG
GTGAGTATGT TCTGA
 
Protein sequence
MDIITVLQIV VLLGAIFLGV RMGGIGIGYA GGIGVLILGL CLDMKPGNIP WDVILIIASV 
ISAISAMQLA GGLDYLVQVA ERILRKNPKY INYLAPVVTY VLTIFAGTGH TAFSMIPVIV
EVSKEQNIKP SCPLSIAVVS SQIAITASPV SAAVIYMSGV LEGFGWSYPV LLGIWLFTTF
VGCMLTAFII SLISDMKLDN DPVYRERLSK GLVSAPVKSV NKQLKPYARR SVAIFLIGVI
LVVLYASAIS PTLGLIDNVV VSRDAAIMSL MLLVGGFITL FCKADINKIA DSSVFKSGMV
ACICVLGVAW LGDTFVSGHS GEIKELARTT VSQYPALLAV VFFLAAMLLY SQAATAKAIT
PAIVTALGIT AANPDDSYML VASFAAVSAL FVLPTYPTLL GAVQMDDTGT TRIGKYVFNH
AFFIPGVLAI AFSVLLGFLV VSMF