Gene ECH74115_0826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0826 
Symbol 
ID6972161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp848762 
End bp850042 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content49% 
IMG OID643384851 
Producttransporter, dicarboxylate/amino acid:cation family 
Protein accessionYP_002269357 
Protein GI209398852 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TAAGTTTAAC CACGATGATT CTTTTGGCGC TGGTACTTGG AATGATTATC 
GGCGTAGTGC TCAATAACAC TGCTTCACCG GAAACCGCAA AACTCTATGC GCAAGAAATA
TCGATATTCA CGACGATTTT CTTACGACTG ATAAAAATGA TTATCGCTCC GTTAGTGGTC
TCTACCCTGG TGGTAGGTAT TGCTAAAATG GGCGATGCCA AAGCCCTTGG TCGTATTTTT
TCTAAAACAC TCTTTTTATT TATTTGCGCC TCATTGCTGT CAATTGCCTT AGGCTTGATA
ACGGTAAATT TCTTCATGCC AGGTACAGGA ATTAATTTTG TTGCACACGG TGCCGAAACC
ACCGGAGTGG TCGCGGCAGA ACCCTTTACG CTAAAAGTAT TTATTTCGCA TGCTTTCCCC
ACCAGCATTG TCGATGCCAT GGCGCACAAT GAAATTTTGC AAATCGTGGT GTTCTCAATT
TTCCTCGGCT GTAGCCTGAC GGCGATTGGC GAGAAAGGCA GCGCCATCGT TCACGCCTTA
GATTCGCTGG CACATGCCAT GTTAAAGCTC ACTGGCTACG TCATGCTCTT CGCTCCCCTG
ACCGTATTCG CTGCTATTTC AGCATTGATT GCTGAACGAG GACTGGCAGT TATGGTGAGC
GCCGGGATCT TTATGGGTGA ATTTTATTTC ACCATGTTAT TACTTTGGGT ACTGCTTATC
GGTCTGGCCA TCGTTTATGT CGGCCCCTGC ATCAGACGCC TGACCCGCGC CCTTTCGGAA
CCCGCCCTGC TGGCATTTAC CACATCCAGT TCTGAAGCGG CTTTTCCGGG AACGCTTGAA
AAACTGGAGC AATTTGGCGT TTCCCCCAAA ATTGCCAGCT TTGTCTTACC CATTGGCTAC
TCATTTAATC TCGTTGGATC AATGGCCTAC TGCTCCTTCG CCACAGTTTT CATCGCCCAG
GCCTGCAATA TCCATTTATC CATCGGTGAG CAAATCACCA TGCTGTTGAT CCTGATGTTG
ACCTCGAAAG GAATGGCTGG CGTACCACGC GCCTCAATGG TGGTTATCGC CGCCACGCTC
AACCAGTTCA ATATTCCGGA AGCGGGGCTG ATCTTGCTGA TGGGCGTTGA TCCGTTCCTT
GATATGGGGC GTTCCGCGAC AAACGTCATG AGCAACGCAA TGGGCGCTGC GATGGTGAGT
CGGTGGGAAG GCGAACATTT CGGCGAGGGC TGTCGGGGTA AAGCATTAAA ACCCAATGAA
TCGAACGTTG CTCTGCCCTG A
 
Protein sequence
MKKISLTTMI LLALVLGMII GVVLNNTASP ETAKLYAQEI SIFTTIFLRL IKMIIAPLVV 
STLVVGIAKM GDAKALGRIF SKTLFLFICA SLLSIALGLI TVNFFMPGTG INFVAHGAET
TGVVAAEPFT LKVFISHAFP TSIVDAMAHN EILQIVVFSI FLGCSLTAIG EKGSAIVHAL
DSLAHAMLKL TGYVMLFAPL TVFAAISALI AERGLAVMVS AGIFMGEFYF TMLLLWVLLI
GLAIVYVGPC IRRLTRALSE PALLAFTTSS SEAAFPGTLE KLEQFGVSPK IASFVLPIGY
SFNLVGSMAY CSFATVFIAQ ACNIHLSIGE QITMLLILML TSKGMAGVPR ASMVVIAATL
NQFNIPEAGL ILLMGVDPFL DMGRSATNVM SNAMGAAMVS RWEGEHFGEG CRGKALKPNE
SNVALP