Gene EcHS_A2700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2700 
Symbol 
ID5595290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2715941 
End bp2716885 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content47% 
IMG OID640921817 
Productputative sugar ABC transporter, periplasmic sugar-binding protein 
Protein accessionYP_001459341 
Protein GI157162023 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACGC TATTAGGTAG CGCACTATTT GCCAGGGCTG CGGATAAAGA AATGACCATT 
GGTGCAATAT ACCTTGATAC CCAGGGATAT TACGCTGGAG TGCGTCAGGG CGTTCAGGAT
GCGGCAAAAG ATTCTTCAGT ACAGGTACAG TTAATTGAAA CTAACGCCCA GGGTGATATT
TCGAAAGAAT GTACCTTTGT TGATACCCTC GTGGCGCGTA ATGTCGATGC CATTATTTTA
TCGGCAGTGT CTGAAAATGG CAGTAGCCGT ACCGTTCGTC GCGCCAGTGA AGCGGGTATT
CCGGTGATTT GCTACAACAC CTGTATTAAT CAAAAGGGTG TCGATAAATA TGTCTCGGCG
TATCTGGTCG GCGATCCACT GGAATTTGGT AAAAAACTGG GTAACGCTGC CGCCGATTAT
TTTATTGCCA ATAAAATTGA CCAGCCGAAA ATTGCCGTCA TCAATTGCGA AGCCTTTGAA
GTTTGTGTGC AACGACGTAA AGGGTTTGAA GAAGTATTAA AAGCCCGCGT TCCCGGCGCG
CAAATTGTCG CTAATCAGGA AGGGACTGTT TTAGATAAAG CGATTTCCGT TGGTGAAAAA
CTGATTATCT CCACGCCGGA TCTCAACGCC ATTATGGGGG AATCGGGCGG TGCGACACTC
GGCGCGGTAA AAGCGGTACG TAATCAAAAT CAGGCCGGAA AAATTGCTGT TTTCGGTTCG
GATATGACAA CCGAAATTGC TCAGGAGCTG GAAAACAATC AGGTGCTGAA AGCGGTAGTG
GATATTTCCG GTAAGAAAAT GGGCAATGCT GTTTTCGCGC AAACATTGAA GGTTATCAAT
AAACAAGCCG ACGGTGAAAA AGTGATTCAG GTGCCTATCG ATCTCTATAC CAAAACGGAA
GACGGTAAAC AGTGGCTGGC AACGCACGTT GATGGTCTGC CCTAA
 
Protein sequence
MATLLGSALF ARAADKEMTI GAIYLDTQGY YAGVRQGVQD AAKDSSVQVQ LIETNAQGDI 
SKECTFVDTL VARNVDAIIL SAVSENGSSR TVRRASEAGI PVICYNTCIN QKGVDKYVSA
YLVGDPLEFG KKLGNAAADY FIANKIDQPK IAVINCEAFE VCVQRRKGFE EVLKARVPGA
QIVANQEGTV LDKAISVGEK LIISTPDLNA IMGESGGATL GAVKAVRNQN QAGKIAVFGS
DMTTEIAQEL ENNQVLKAVV DISGKKMGNA VFAQTLKVIN KQADGEKVIQ VPIDLYTKTE
DGKQWLATHV DGLP