Gene EcSMS35_0746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0746 
Symbol 
ID6144114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp753570 
End bp754850 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content49% 
IMG OID641615635 
Productdicarboxylate/amino acid:cation family protein 
Protein accessionYP_001742834 
Protein GI170680220 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.579753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TAAGTTTAAC CACGATGATT CTTTTGGCGC TGGTACTTGG AATGATTATC 
GGCGTAGTGC TCAATAACAC TGCTTCACCG GAAACCGCAA AACTCTATGC GCAAGAAATA
TCGATATTCA CGACGATTTT CTTACGACTG ATAAAAATGA TTATCGCTCC GTTAGTGGTC
TCTACCCTGG TGGTAGGTAT AGCTAAAATG GGAGATGCCA AAGCCCTTGG TCGTATTTTT
TCTAAAACAC TCTTTTTATT TATTTGCGCC TCATTGCTGT CAATCGCCTT AGGCTTGATA
ACAGTAAATT TCTTCATGCC AGGCACAGGA ATTAATTTTG TTGCACACGG AGCCGAAACC
ACCGGAGTGG TCGCGTCAGA ACCCTTTACG CTAAAAGTAT TTATTTCGCA TGCTTTCCCC
ACCAGCATTG TCGATGCCAT GGCGCACAAT GAAATTTTGC AAATCGTGGT GTTCTCAATT
TTCCTCGGCT GTAGCCTGAC GGCGATTGGT GAGAAAGGCA GCGCCATCGT TCACGCCTTA
GATTCGCTGG CACATGCCAT GTTAAAGCTC ACTGGCTACG TCATGCTCTT CGCTCCCCTG
ACCGTATTCG CCGCTATTTC AGCATTGATT GCTGAACGAG GACTGGCAGT TATGGTGAGC
GCCGGGATCT TTATGGGTGA ATTTTATTTC ACCATGTTGT TACTTTGGGT GCTGCTTATC
GGTCTGGCCA TCGTTTATGT CGGCCCCTGC ATCAGACGCC TGACCCGTGC CCTTTCGGAA
CCCGCCCTGC TGGCATTTAC CACATCCAGT TCTGAAGCGG CTTTTCCGGG AACGCTTGAA
AAACTGGAGC AATTTGGCGT TTCCCCCAAA ATTGCCAGCT TTGTCTTACC CATTGGCTAC
TCATTTAATC TCGTTGGATC AATGGCCTAC TGCTCCTTCG CCACAGTTTT CATCGCCCAG
GCCTGCAATA TCCATTTATC CATCGGTGAG CAAATCACCA TGCTGTTGAT CCTGATGTTG
ACCTCGAAAG GAATGGCTGG CGTACCACGC GCCTCAATGG TGGTTATCGC CGCCACGCTC
AACCAGTTCA ATATTCCGGA AGCGGGGCTG ATCTTGCTGA TGGGCGTTGA TCCGTTCCTT
GATATGGGGC GTTCCGCGAC AAACGTCATG AGCAACGCAA TGGGCGCTGC GATGGTGAGT
CGGTGGGAAG GCGAACATTT CGGCGAGGGC TGTCGGGGTA AAGCATTAAA ACCCAATGAA
TCGAACGTTG CTCTGCCCTG A
 
Protein sequence
MKKISLTTMI LLALVLGMII GVVLNNTASP ETAKLYAQEI SIFTTIFLRL IKMIIAPLVV 
STLVVGIAKM GDAKALGRIF SKTLFLFICA SLLSIALGLI TVNFFMPGTG INFVAHGAET
TGVVASEPFT LKVFISHAFP TSIVDAMAHN EILQIVVFSI FLGCSLTAIG EKGSAIVHAL
DSLAHAMLKL TGYVMLFAPL TVFAAISALI AERGLAVMVS AGIFMGEFYF TMLLLWVLLI
GLAIVYVGPC IRRLTRALSE PALLAFTTSS SEAAFPGTLE KLEQFGVSPK IASFVLPIGY
SFNLVGSMAY CSFATVFIAQ ACNIHLSIGE QITMLLILML TSKGMAGVPR ASMVVIAATL
NQFNIPEAGL ILLMGVDPFL DMGRSATNVM SNAMGAAMVS RWEGEHFGEG CRGKALKPNE
SNVALP