Gene EcHS_A1433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1433 
Symbol 
ID5590935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1428271 
End bp1429353 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content49% 
IMG OID640920588 
Productsugar ABC transporter, ATP-binding protein 
Protein accessionYP_001458147 
Protein GI157160829 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGT TTTCGTTACA ACATATTCAA AAAATCTACG ATAACCAGGT GCATGTGGTG 
AAGGACTTCA ACCTGGAAAT TGCCGATAAA GAGTTCATCG TGTTTGTCGG CCCGTCGGGC
TGCGGTAAGT CGACCACCCT GCGCATGATT GCCGGGCTTG AGGAGATCAG CGGCGGCGAT
CTGTTGATCG ACGGCAAACG AATGAATGAC GTTCCAGCCA AAGCACGCAA TATAGCGATG
GTGTTCCAGA ACTACGCGTT GTATCCGCAT ATGACGGTTT ACGACAACAT GGCGTTTGAT
CTGAAGATGC AAAAAATCGC CAAAGAGGTG ATTGATGAGC GGGTGAACTG GGCGGCGCAA
ATTCTCGGCC TGCGTGAGTA CCTGAAACGT AAGCCGGGGG CGCTTTCCGG CGGGCAACGT
CAGCGAGTGG CGCTTGGGCG GGCGATTGTA CGCGAAGCGG GCGTGTTTTT AATGGATGAA
CCGCTCTCTA ACCTTGATGC CAAGCTGCGC GTGCAAATGC GCGCAGAGAT CAGCAAGCTG
CATCAGAAAC TGAACACCAC CATGATCTAC GTGACCCACG ATCAGACCGA AGCGATGACC
ATGGCGACGC GGATTGTGAT TATGAAAGAC GGGATTGTTC AGCAAGTAGG TGCGCCGAAA
ACCGTTTATA ACCAACCCGC GAATATGTTT GTTTCCGGAT TTATTGGATC ACCAGCGATG
AATTTTATTC GCGGCACGAT CGATGGCGAT AAATTCGTTA CGGAAACGCT TAAATTAACC
ATTCCCGAAG AGAAATTAGC GGTTCTGAAA ACACAGGAAA GTTTGCATAA GCCCATCGTG
ATGGGAATAC GACCGGAAGA TATTCATCCG GACGCGCAAG AGGAAAATAA CATTTCCGCC
AAAATTAGCG TGGCAGAATT AACCGGTGCG GAATTTATGC TCTACACCAC GGTTGGGGGG
CACGAGTTAG TGGTCCGTGC TGGTGCGTTA AATGATTATC ATGCAGGAGA AAATATCACT
ATTCATTTTG ATATGACGAA ATGTCATTTC TTTGATGCAG AAACGGAAAT AGCAATTCGC
TAA
 
Protein sequence
MAQFSLQHIQ KIYDNQVHVV KDFNLEIADK EFIVFVGPSG CGKSTTLRMI AGLEEISGGD 
LLIDGKRMND VPAKARNIAM VFQNYALYPH MTVYDNMAFD LKMQKIAKEV IDERVNWAAQ
ILGLREYLKR KPGALSGGQR QRVALGRAIV REAGVFLMDE PLSNLDAKLR VQMRAEISKL
HQKLNTTMIY VTHDQTEAMT MATRIVIMKD GIVQQVGAPK TVYNQPANMF VSGFIGSPAM
NFIRGTIDGD KFVTETLKLT IPEEKLAVLK TQESLHKPIV MGIRPEDIHP DAQEENNISA
KISVAELTGA EFMLYTTVGG HELVVRAGAL NDYHAGENIT IHFDMTKCHF FDAETEIAIR