Gene EcHS_A4481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4481 
Symbol 
ID5593899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4485157 
End bp4486113 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content50% 
IMG OID640923579 
Productputative sugar ABC transporter, periplasmic sugar-binding protein 
Protein accessionYP_001461020 
Protein GI157163702 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGAAAC GCTTACTTGT AGTCTCTGCA GTCTCGGCAG CCATGTCGTC TATGGCGTTG 
GCCGCTCCAT TAACCGTAGG ATTTTCGCAG GTCGGATCGG AATCCGGCTG GCGCGCTGCA
GAAACCAATG TGGCGAAAAG TGAGGCCGAA AAACGCGGAA TTACGTTGAA AATTGCCGAT
GGTCAGCAAA AGCAGGAAAA CCAGATTAAA GCGGTACGTT CCTTCGTTGC ACAAGGGGTG
GATGCGATCT TTATCGCTCC AGTGGTTGCG ACGGGTTGGG AACCGGTATT AAAAGAGGCG
AAAGATGCCG AAATCCCGGT CTTCTTGCTT GACCGTTCCA TCGATGTGAA AGACAAATCT
CTCTATATGA CCACCGTCAC TGCCGACAAC ATCCTCGAAG GCAAGTTGAT TGGTGACTGG
CTGGTAAAAG AAGTGAATGG CAAACCATGC AACGTGGTGG AACTGCAGGG CACCGTTGGG
GCCAGCGTCG CCATTGACCG TAAGAAAGGC TTTGCCGAAG CCATTAAGAA TGCGCCAAAT
ATCAAAATCA TCCGCTCGCA GTCAGGTGAC TTCACCCGCA GTAAAGGCAA AGAAGTTATG
GAGAGCTTTA TCAAAGCGGA AAACAACGGC AAAAACATCT GCATGGTTTA CGCCCATAAC
GATGACATGG TGATTGGTGC AATTCAGGCA ATTAAAGAAG CGGGCCTGAA ACCGGGCAAA
GATATCCTCA CGGGTTCCAT TGACGGCGTA CCGGATATCT ATAAAGCGAT GATTGATGGC
GAAGCGAACG CCAGCGTTGA ACTGACGCCG AACATGGCAG GCCCCGCTTT TGACGCGCTG
GAGAAATACA AAAAAGACGG CACCATGCCT GAAAAGCTGA CCCTGACCAA ATCCACCCTT
TATCTGCCTG ATACCGCAAA AGAAGAGTTA GAGAAGAAGA AAAATATGGG GTATTGA
 
Protein sequence
MWKRLLVVSA VSAAMSSMAL AAPLTVGFSQ VGSESGWRAA ETNVAKSEAE KRGITLKIAD 
GQQKQENQIK AVRSFVAQGV DAIFIAPVVA TGWEPVLKEA KDAEIPVFLL DRSIDVKDKS
LYMTTVTADN ILEGKLIGDW LVKEVNGKPC NVVELQGTVG ASVAIDRKKG FAEAIKNAPN
IKIIRSQSGD FTRSKGKEVM ESFIKAENNG KNICMVYAHN DDMVIGAIQA IKEAGLKPGK
DILTGSIDGV PDIYKAMIDG EANASVELTP NMAGPAFDAL EKYKKDGTMP EKLTLTKSTL
YLPDTAKEEL EKKKNMGY