Gene ECH74115_5827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5827 
SymbolgntP 
ID6971027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5478423 
End bp5479766 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content53% 
IMG OID643389454 
Productfructuronate transporter 
Protein accessionYP_002273846 
Protein GI209400656 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.469468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.910454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTGC TTAACATTCT CTGGGTGGTA TTCGGCATTG GTCTGATGCT GGTACTGAAT 
TTGAAGTTCA AAATCAATTC AATGGTGGCT TTGTTGGTGG CGGCGCTGTC CGTCGGGATG
CTGGCGGGCA TGGATTTGAT GTCGCTGCTG CACACCATGA AAGCGGGCTT CGGCAACACG
CTGGGGGAAC TGGCTATCAT CGTGGTGTTC GGTGCGGTCA TCGGTAAATT GATGGTCGAC
TCCGGCGCGG CTCACCAGAT AGCGCATACG CTGCTGGCGC GTCTCGGTCT GCGCTATGTA
CAGCTGTCGG TGATTATCAT CGGCCTGATT TTTGGTCTGG CGATGTTCTA TGAAGTGGCC
TTTATCATGT TAGCGCCGCT GGTTATTGTC ATTGCCGCCG AAGCTAAAAT TCCGTTCCTG
AAACTGGCGA TCCCGGCAGT AGCAGCTGCC ACTACCGCAC ATTCACTGTT CCCACCGCAG
CCGGGTCCGG TGGCGCTGGT GAATGCTTAT GGCGCAGATA TGGGGATGGT TTATATCTAT
GGCGTACTGG TGACGATCCC AAGTGTAATC TGCGCAGGTC TGATCCTGCC GAAGTTCCTC
GGCAATCTTG AGCGCCCAAC GCCATCATTC CTGAAAGCAG ATCAACCGGT AGATATGAAC
AATCTGCCCT CTTTCGGCGT TTCGATTCTG GTGCCGCTGA TCCCGGCGAT CATTATGATC
TCTACCACCA TCGCCAATAT CTGGCTGGTA AAAGATACCC CTGCCTGGGA AGTGGTTAAC
TTTATCGGTT CCTCGCCGAT TGCGATGTTT ATTGCGATGG TGGTTGCATT CGTACTCTTT
GGCACCGCGC GTGGTCATGA CATGCAGTGG GTGATGAACG CTTTTGAAAG CGCGGTGAAG
AGTATTGCAA TGGTGATTCT GATCATCGGT GCGGGTGGCG TGCTGAAGCA AACTATCATC
GACACCGGCA TTGGCGACAC CATCGGCATG TTGATGTCCC ACGGCAATAT CTCGCCCTAC
ATTATGGCAT GGCTGATCAC TGTGCTAATT CGTCTGGCGA CGGGTCAGGG TGTCGTTTCG
GCGATGACCG CCGCCGGGAT TATCAGTGCT GCAATCCTTG ATCCAGCAAC TGGTCAGCTG
GTTGGCGTGA ATCCGGCGCT GCTGGTGCTG GCGACGGCTG CGGGTTCCAA CACCCTCACC
CACATTAACG ATGCATCTTT CTGGTTGTTC AAAGGTTACT TTGACCTGTC GGTAAAAGAC
ACGTTGAAAA CCTGGGGTCT GCTGGAGCTG GTCAACTCCG TGGTTGGGCT GATTATTGTG
CTGATTATTA GCATGGTAGC GTAA
 
Protein sequence
MHVLNILWVV FGIGLMLVLN LKFKINSMVA LLVAALSVGM LAGMDLMSLL HTMKAGFGNT 
LGELAIIVVF GAVIGKLMVD SGAAHQIAHT LLARLGLRYV QLSVIIIGLI FGLAMFYEVA
FIMLAPLVIV IAAEAKIPFL KLAIPAVAAA TTAHSLFPPQ PGPVALVNAY GADMGMVYIY
GVLVTIPSVI CAGLILPKFL GNLERPTPSF LKADQPVDMN NLPSFGVSIL VPLIPAIIMI
STTIANIWLV KDTPAWEVVN FIGSSPIAMF IAMVVAFVLF GTARGHDMQW VMNAFESAVK
SIAMVILIIG AGGVLKQTII DTGIGDTIGM LMSHGNISPY IMAWLITVLI RLATGQGVVS
AMTAAGIISA AILDPATGQL VGVNPALLVL ATAAGSNTLT HINDASFWLF KGYFDLSVKD
TLKTWGLLEL VNSVVGLIIV LIISMVA