Gene EcSMS35_4848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4848 
SymbolgntP 
ID6146146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4954576 
End bp4955919 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content53% 
IMG OID641619652 
Productfructuronate transporter 
Protein accessionYP_001746759 
Protein GI170682985 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.774061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTGC TTAACATTCT CTGGGTGGTA TTCGGCATTG GTCTGATGCT GGTACTGAAT 
TTGAAGTTCA AAATCAATTC AATGGTGGCT TTGTTGGTGG CGGCGCTGTC CGTCGGGATG
CTGGCGGGCA TGGATTTGAT GTCGCTGCTG CACACCATGA AAGCGGGCTT CGGCAACACG
CTGGGGGAAC TGGCTATCAT CGTGGTGTTC GGTGCGGTCA TCGGTAAATT GATGGTCGAC
TCCGGCGCGG CTCACCAGAT AGCACATACG CTGCTGGCGC GTCTCGGTCT GCGCTATGTA
CAGCTGTCGG TGATTATCAT CGGCCTGATT TTCGGTCTGG CGATGTTTTA TGAAGTGGCC
TTTATCATGT TAGCGCCGCT GGTTATTGTT ATTGCCGCCG AAGCTAAAAT TCCGTTCCTG
AAACTGGCGA TCCCGGCAGT AGCAGCTGCC ACTACAGCAC ATTCACTGTT CCCACCGCAG
CCGGGTCCGG TGGCGCTGGT GAATGCTTAT GGCGCGGATA TGGGGATGGT TTATATCTAT
GGCGTACTGG TGACGATCCC AAGTGTAATC TGCGCAGGTC TGATCCTGCC GAAGTTCCTC
GGCAATCTTG AGCGCCCAAC GCCATCATTC CTGAAAGCAG ATCAACCGGT AGATATGAAT
AATCTGCCCT CTTTCGGCGT TTCGATTCTG GTGCCGCTGA TCCCAGCGAT CATTATGATC
TCCACCACCA TCGCCAATAT CTGGCTGGTA AAAGATACCC CTGCCTGGGA AGTGGTTAAC
TTTATCGGTT CCTCGCCGAT TGCAATGTTT ATTGCGATGG TGGTTGCATT CGTACTCTTT
GGCACCGCGC GTGGTCATGA CATGCAGTGG GTGATGAACG CTTTTGAAAG CGCGGTGAAG
AGTATTGCAA TGGTGATTCT GATCATCGGT GCGGGTGGCG TGCTGAAGCA GACCATCATC
GACACCGGCA TTGGCGACAC CATCGGCATG TTGATGTCCC ACGGCAATAT CTCGCCCTAC
ATCATGGCAT GGCTGATCAC TGTGCTAATT CGTCTGGCGA CGGGTCAGGG TGTCGTTTCG
GCGATGACCG CCGCCGGGAT TATCAGTGCT GCAATCCTTG ATCCAGCAAC CGGTCAGCTG
GTTGGCGTGA ATCCGGCGCT GCTGGTACTG GCGACGGCTG CGGGTTCCAA CACCCTCACC
CACATTAATG ATGCCTCATT CTGGCTGTTC AAAGGTTACT TTGACCTGTC GGTAAAAGAC
ACGTTGAAAA CCTGGGGACT GCTGGAGCTG GTCAACTCCG TGGTTGGGCT GATTATTGTG
TTGATTATTA GCATGGTAGC GTAA
 
Protein sequence
MHVLNILWVV FGIGLMLVLN LKFKINSMVA LLVAALSVGM LAGMDLMSLL HTMKAGFGNT 
LGELAIIVVF GAVIGKLMVD SGAAHQIAHT LLARLGLRYV QLSVIIIGLI FGLAMFYEVA
FIMLAPLVIV IAAEAKIPFL KLAIPAVAAA TTAHSLFPPQ PGPVALVNAY GADMGMVYIY
GVLVTIPSVI CAGLILPKFL GNLERPTPSF LKADQPVDMN NLPSFGVSIL VPLIPAIIMI
STTIANIWLV KDTPAWEVVN FIGSSPIAMF IAMVVAFVLF GTARGHDMQW VMNAFESAVK
SIAMVILIIG AGGVLKQTII DTGIGDTIGM LMSHGNISPY IMAWLITVLI RLATGQGVVS
AMTAAGIISA AILDPATGQL VGVNPALLVL ATAAGSNTLT HINDASFWLF KGYFDLSVKD
TLKTWGLLEL VNSVVGLIIV LIISMVA