Gene EcE24377A_1527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1527 
Symbol 
ID5586196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1509486 
End bp1511753 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content54% 
IMG OID640925218 
Productglycosy hydrolase family protein 
Protein accessionYP_001462623 
Protein GI157155852 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGGC CAGTAACGTT ATCAGAACCC CATTTCAGCC AGCATACCCT GAACAAGTAT 
GCATCGCTGA TGGCGCAGGG GAACGGCTAT CTTGGGCTTC GCGCCAGCCA TGAAGAAGAT
TACACGAGCC AGACGCGAGG GATGTATCTG GCGGGGCTGT ATCATCGGGC GGGAAAAGGT
GAAATCAACG AACTGGTGAA CCTGCCTGAT ATCTTGGGGA TGGAGATTGC CATAAATGGT
GAGGTTTTCT CGTTATCCCG CGAAGCCTGG CAGCGTGAAC TTGACTTTGC CAGTGGAGAA
TTACGCCGTA GCGTTGTCTG GCGTACCAGC AACGGCACAG GTTACACCAT CGCCAGCCGT
CGCTTTGTTT CGGCAGACCA ACTGCCGCTC ATTGTGCTGG AAATCACTAT TACGCCACTG
GACGCCGACG CGTCAGTGCT GATTTCAACA GGTATCGACG CCACGCAAAC CAACCACGGT
CGCCAACATC TCGACGAAAC CCAGGTGCGG GTGTTTGGTC AGCATCTGAT GCAGGGGATC
TACACCACCC AGGATGGACG CAGTGATGTG GCCATCAGCT GTTGCTGTAT GGTGAGCGGT
GATGTGCAGC AATGCTATAC CGCCAAAGAG CGCCGTTTGC AGCAACATAC CAGTGCGCAG
CTTCATGCAG GCGAGACAGT GACGTTGCAA AAACTGGTGT GGATCGACTG GCGGGATGAC
AGGCAAGCCG TTTTAGACGA GTGGGGCAGC GCGTCGCTTC GCCAGCTTGA AATGTGCGCG
CAGCAGAGTT ACGACCAACT TCTTGCAGTA TCAACAGAAA ACTGGCGTCA ATGGTGGCAG
AAACGTCGTA TCACGGTAAA TGGCGGCGAA GCGCACGATC AGCAAGCGTT AGATTATGCG
CTTTATCATC TGCGCATCAT GACGCCTGCC CACGACGAGC GCAGCAGCAT TGCGGCAAAA
GGCTTAACCG GCGAAGGCTA CAAAGGCCAC GTTTTCTGGG ATACAGAAGT ATTTTTGTTA
CCGTTTCATC TGTTTAGCGA TCCGACGGTT GCCCGAAGTT TACTGCGTTA TCGCTGGCAC
AACTTGCCAG GCGCGCAGGA GAAAGCGCGA CGCAACGGCT GGCAGGGCGC GCTATTTCCG
TGGGAAAGCG CGCGCAGCGG CGAAGAAGAG ACGCCGGAAT TTGCCGCCAT TAACATTCGC
ACCGGGCTGC GGCAAAAAGT GGCCTCGGCG CAGGCGGAAC ATCATCTGGT GGCCGATATC
GCCTGGGCGG TTATTCAATA CTGGCAGACC ACGGGGGATG AAAGTTTCAT TGCGCATGAA
GGCATGGCGC TACTTCTGGA AACTGCAAAG TTCTGGATTA GCCGCGCGGT GAGGGTTAAC
GACCGTCTGG AAATTCATGA TGTTATTGGG CCTGACGAAT ATACCGAACA TGTCAATAAT
AACGCCTTCA CCAGCTATAT GGCGTATTAC AACGTCCAGC AGGCGCTGAG TATTGCCCGC
CAGTTTGGCT GTAGCGACGA TGCGTTTATC CATCGCGCCG AAATGTTCCT TAAAGAACTG
CGGCTGCCAG AAATTCAGCC CGACGGCGTT TTGCCGCAGG ATGATTCGTT TATGGCTAAG
CCGGCGATTA ATCTGGCGAA ATACAAAGCG GCGGCGGGGA AGCAAACCAT TCTGCTGGAT
TATTCACGCG CAGAAGTGAA CGAGATGCAA ATCCTCAAAC AAGCTGATGT GGTGATGCTC
AATTACATGC TGCCGGAGCA GTTCTCAGCG GCATCGTGTC TTGCCAATCT GCAATTTTAT
GAACCGCGCA CTATTCACGA CTCGTCATTA AGTAAAGCAA TCCACGGCAT TGTTGCCGCA
CGCTGTGGCC TGCTGACCCA AAGTTATCAG TTCTGGCGCG AGGGGACTGA AATCGATCTT
GGTGCTGATC CGCATAGTTG TGATGATGGT ATCCATGCTG CCGCAACTGG CGCTATCTGG
CTGGGGGCGA TTCAGGGTTT TGCCGGGGTG AGCGTGCGTG ACGGTGAATT GCATCTCAAT
CCGGCGTTAC CTGAGCAGTG GCAACAGTTG TCTTTCCCTC TGTTCTGGCA GGGCTGCGAA
TTACAGGTCA CTCTTGACGC GCAGCGTATT GCGATTCGAA CTTCTGCGCC CGTTTCACTG
CGTTTGAACG GTCAGCTTAT ATCCGTGGCT GAAGAATCTG TTTTCTGTTT GGGTGATTTT
ATTTTGCCCT TCAATGGGAC CGCTACCACA CATCAGGAGG ATGAATGA
 
Protein sequence
MTRPVTLSEP HFSQHTLNKY ASLMAQGNGY LGLRASHEED YTSQTRGMYL AGLYHRAGKG 
EINELVNLPD ILGMEIAING EVFSLSREAW QRELDFASGE LRRSVVWRTS NGTGYTIASR
RFVSADQLPL IVLEITITPL DADASVLIST GIDATQTNHG RQHLDETQVR VFGQHLMQGI
YTTQDGRSDV AISCCCMVSG DVQQCYTAKE RRLQQHTSAQ LHAGETVTLQ KLVWIDWRDD
RQAVLDEWGS ASLRQLEMCA QQSYDQLLAV STENWRQWWQ KRRITVNGGE AHDQQALDYA
LYHLRIMTPA HDERSSIAAK GLTGEGYKGH VFWDTEVFLL PFHLFSDPTV ARSLLRYRWH
NLPGAQEKAR RNGWQGALFP WESARSGEEE TPEFAAINIR TGLRQKVASA QAEHHLVADI
AWAVIQYWQT TGDESFIAHE GMALLLETAK FWISRAVRVN DRLEIHDVIG PDEYTEHVNN
NAFTSYMAYY NVQQALSIAR QFGCSDDAFI HRAEMFLKEL RLPEIQPDGV LPQDDSFMAK
PAINLAKYKA AAGKQTILLD YSRAEVNEMQ ILKQADVVML NYMLPEQFSA ASCLANLQFY
EPRTIHDSSL SKAIHGIVAA RCGLLTQSYQ FWREGTEIDL GADPHSCDDG IHAAATGAIW
LGAIQGFAGV SVRDGELHLN PALPEQWQQL SFPLFWQGCE LQVTLDAQRI AIRTSAPVSL
RLNGQLISVA EESVFCLGDF ILPFNGTATT HQEDE