Gene EcHS_A1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1431 
Symbol 
ID5591895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1425334 
End bp1427601 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content54% 
IMG OID640920586 
Productglycosy hydrolase family protein 
Protein accessionYP_001458145 
Protein GI157160827 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGGC CAGTAACGTT ATCAGAACCC CATTTCAGCC AGCATACCCT GAACAAGTAT 
GCATCGCTGA TGGCGCAGGG GAACGGCTAT CTTGGGCTTC GCGCCAGCCA TGAAGAAGAT
TACACGCGCC AGACGCGAGG GATGTATCTG GCGGGGCTGT ATCATCGGGC GGGAAAAGGT
GAAATCAACG AACTGGTGAA CCTGCCTGAT GTCGTGGGGA TGGAGATTGC CATAAATGGT
GAGGTTTTCT CGTTATCCCA CGAAGCCTGG CAGCGTGAGC TTGACTTTGC CAGTGGCGAA
TTACGCCGCA ATGTTGTCTG GCGTACCAGC AACGGCTCAG GTTACACCAT CGCCAGCCGT
CGCTTTGTTT CGGCAGACCA ACTACCGCTC ATTGCGCTGG AAATCACTAT TACGCCACTG
GACGCCGACG CGTCAGTGCT GATTTCAACA GGCATTGACG CCACGCAAAC CAATCACGGT
CGCCAACATC TCGACGAAAC CCAGGTGCGG GTGTTTGGTC AGCATCTGAT GCAGGGGAGC
TACACCACCC AGGATGGACG CAGTGATGTG GCCATCAGCT GTTGCTGTAA GGTGAGCGGT
GATGTGCAGC AATGCTATAC CGCCAAAGAG CGCCGTTTAC TGCAACATAC CAGTGCGCAG
CTTCATGCAG GCGAGACAAT GACGTTGCAA AAACTGGTGT GGATCGACTG GCGGGATGAC
AGGCAAGCTG CTTTAGACGA GTGGGGCAGC GCGTCGCTTC GCCAGCTTGA AATGTGCGCG
CAGCAGAGTT ACGACCAACT TCTTGCAGCA TCAACAGAAA ACTGGCGTCA ATGGTGGCAG
AAACGTCGTA TCACGGTAAA TGGCGGCGAA GCGCACGATC AGCAAGCGTT AGATTATGCG
CTTTATCATC TGCGCATCAT GACGCCTGCC CACGACGAGC GCAGCAGCAT TGCGGCAAAA
GGCTTAACCG GCGAAGGCTA CAAAGGCCAC GTTTTCTGGG ATACAGAAGT ATTTTTGTTA
CCGTTTCATC TGTTTAGCGA TCCGACGGTT GCCCGAAGTT TACTGCGTTA TCGCTGGCAC
AACTTGCCAG GCGCGCAGGA GAAAGCGCGA CGCAACGGCT GGCAGGGCGC GCTATTTCCG
TGGGAAAGCG CGCGCAGCGG CGAAGAAGAG ACGCCGGAAT TTGCCGCCAT TAACATTCGC
ACCGGGCTGC GGCAAAAAGT GGCCTCGGCG CAGGCGGAAC ATCATCTGGT GGCCGATATC
GCCTGGGCGG TTATTCAATA CTGGCAGACC ACGGGGGATG AAAGTTTCAT TGCGCATGAA
GGCATGGCGC TACTTCTGGA GACGGCAAAG TTCTGGATTA GCCGCGCGGT GAGAGTTAAC
GATCGTCTGG AAATTCATGA TGTTATTGGG CCAGACGAAT ATACCGAACA TGTCAATAAT
AATGCATACA CCAGCTATAT GGCCCGCTAC AACGTTCAAC AGGCGCTGAA TATTGCCCGC
CAGTTCGGCT GTAGCGACGA TGCGTTTATC CATCGCGCCG AAATGTTCCT CAAAGAGCTA
TGGATGCCAG AAATTCAGCC CGACGGCGTT TTGCCGCAGG ATGATTCGTT TATGGCTAAG
CCGGCGATTA ATCTGGCGAA ATACAAAGCG GCGGCGGGGA AGCAAACCAT ACTGCTGGAT
TATTCACGCG CAGAAGTGAA CGAGATGCAG ATCCTCAAAC AAGCTGATGT GGTGATGCTC
AATTACATGC TGCCGGAGCA GTTCTCAGCG GCATCGTGTC TTGCCAATCT GCAATTTTAT
GAACCGCGCA CTATTCACGA CTCGTCATTA AGTAAAGCAA TCCACGGCAT TGTTGCCGCA
CGCTGTGGCC TGCTGACCCA AAGTTATCAG TTCTGGCGCG AGGGGACTGA AATCGATCTT
GGTGCTGATC CGCATAGTTG TGATGATGGT ATCCATGCTG CCGCAACTGG CGCTATCTGG
CTGGGGGCGA TTCAGGGTTT TGCCGGGGTG AGCGTGCGTG ACGGTGAATT GCATCTCAAT
CCGGCGTTAC CTGAGCAGTG GCAACAGTTG TCTTTCCCTC TGTTCTGGCA GGGCTGCGAA
TTACAGGTCA CTCTTGACGC GCAGCGTATT GCGATTCGAA CTTCTGCGCC CGTTTCACTG
CGTTTGAACG GTCAGCTTAT AACCGTGGCT GAAGAATCTG TTTTCTGTTT GGGTGATTTT
ATTTTGCCCT TCAATGGGAC CGCTACCAAA CATCAGGGGG ATGAATGA
 
Protein sequence
MTRPVTLSEP HFSQHTLNKY ASLMAQGNGY LGLRASHEED YTRQTRGMYL AGLYHRAGKG 
EINELVNLPD VVGMEIAING EVFSLSHEAW QRELDFASGE LRRNVVWRTS NGSGYTIASR
RFVSADQLPL IALEITITPL DADASVLIST GIDATQTNHG RQHLDETQVR VFGQHLMQGS
YTTQDGRSDV AISCCCKVSG DVQQCYTAKE RRLLQHTSAQ LHAGETMTLQ KLVWIDWRDD
RQAALDEWGS ASLRQLEMCA QQSYDQLLAA STENWRQWWQ KRRITVNGGE AHDQQALDYA
LYHLRIMTPA HDERSSIAAK GLTGEGYKGH VFWDTEVFLL PFHLFSDPTV ARSLLRYRWH
NLPGAQEKAR RNGWQGALFP WESARSGEEE TPEFAAINIR TGLRQKVASA QAEHHLVADI
AWAVIQYWQT TGDESFIAHE GMALLLETAK FWISRAVRVN DRLEIHDVIG PDEYTEHVNN
NAYTSYMARY NVQQALNIAR QFGCSDDAFI HRAEMFLKEL WMPEIQPDGV LPQDDSFMAK
PAINLAKYKA AAGKQTILLD YSRAEVNEMQ ILKQADVVML NYMLPEQFSA ASCLANLQFY
EPRTIHDSSL SKAIHGIVAA RCGLLTQSYQ FWREGTEIDL GADPHSCDDG IHAAATGAIW
LGAIQGFAGV SVRDGELHLN PALPEQWQQL SFPLFWQGCE LQVTLDAQRI AIRTSAPVSL
RLNGQLITVA EESVFCLGDF ILPFNGTATK HQGDE