Gene EcHS_A1425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1425 
Symbol 
ID5592338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1419359 
End bp1420651 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content50% 
IMG OID640920580 
Productputative sugar ABC transporter, periplasmic sugar-binding protein 
Protein accessionYP_001458139 
Protein GI157160821 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAT CAAAAATCGT GCTGTTATCA GCACTGGTTT CATGCGCCCT GATTTCAGGC 
TGTAAAGAAG AAAATAAAAC GAATGTATCC ATCGAATTTA TGCATTCTTC GGTGGAGCAG
GAGCGCCAGG CCGTTATCAG TAAATTGATT GCCCGTTTTG AAAAAGAAAA CCCTGGCATC
ACCGTTAAGC AAGTGCCCGT GGAAGAAGAT GCCTATAACA CTAAAGTCAT TACTCTTTCA
CGTAGCGGTT CGCTGCCGGA AGTGATCGAA ACCAGCCATG ACTACGCCAA AGTGATGGAC
AAAGAGCAGC TTATCGATCG CAAAGCGGTT GCCACAGTCA TCAGCAACGT TGGTGAAGGC
GCGTTTTACG ATGGCGTACT GCGTATTGTG CGTACCGAAG ATGGTAGCGC ATGGACCGGT
GTTCCTGTCA GCGCCTGGAT TGGCGGTATC TGGTATCGCA AAGATGTGCT GGCAAAAGCG
GGGCTTGAGG AGCCGAAAAA CTGGCAACAG CTGCTGGACG TTGCACAGAA ACTGAATGAC
CCGGCGAATA AAAAATACGG CATTGCGCTG CCTACAGCAG AAAGCGTGTT GACGGAACAA
TCCTTCTCCC AGTTTGCGTT ATCCAACCAG GCTAACGTCT TTAACGCCGA AGGCAAAATC
ACCCTTGATA CACCAGAGAT GATGCAGGCA CTGACCTATT ACCGCGACCT TGCTGCCAAC
ACTATGCCGG GTTCTAACGA CATCATGGAA GTGAAAGACG CCTTTATGAA CGGCACCGCG
CCGATGGCGA TTTACTCCAC CTATATCCTT CCGGCTGTGA TTAAAGAAGG CGACCCGAAA
AACGTCGGTT TCGTGGTGCC AACCGAGAAA AACTCTGCGG TCTACGGCAT GTTGACCTCG
CTGACCATTA CCGCCGGGCA AAAGACCGAA GAGACGGAAG CAGCAGAAAA ATTTGTCACC
TTTATGGAGC AGGCAGACAA CATTGCCGAC TGGGTGATGA TGTCGCCAGG TGCTGCGCTG
CCGGTGAATA AAGCGGTGGT GACTACCGCC ACCTGGAAAG ACAACGACGT TATTAAGGCG
CTGGGTGAAC TACCGAATCA GCTAATCGGT GAACTGCCAA ATATTCAGGT TTTTGGCGCA
GTAGGGGATA AAAACTTTAC CCGCATGGGT GATGTGACGG GTTCTGGCGT GGTGAGTTCA
ATGGTGCATA ACGTCACCGT GGGTAAAGCC GATCTCTCTA CTACGCTGCA AGCGAGCCAG
AAAAAACTGG ATGAACTGAT CGAACAGCAC TAA
 
Protein sequence
MIKSKIVLLS ALVSCALISG CKEENKTNVS IEFMHSSVEQ ERQAVISKLI ARFEKENPGI 
TVKQVPVEED AYNTKVITLS RSGSLPEVIE TSHDYAKVMD KEQLIDRKAV ATVISNVGEG
AFYDGVLRIV RTEDGSAWTG VPVSAWIGGI WYRKDVLAKA GLEEPKNWQQ LLDVAQKLND
PANKKYGIAL PTAESVLTEQ SFSQFALSNQ ANVFNAEGKI TLDTPEMMQA LTYYRDLAAN
TMPGSNDIME VKDAFMNGTA PMAIYSTYIL PAVIKEGDPK NVGFVVPTEK NSAVYGMLTS
LTITAGQKTE ETEAAEKFVT FMEQADNIAD WVMMSPGAAL PVNKAVVTTA TWKDNDVIKA
LGELPNQLIG ELPNIQVFGA VGDKNFTRMG DVTGSGVVSS MVHNVTVGKA DLSTTLQASQ
KKLDELIEQH