Gene EcolC_2309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2309 
Symbol 
ID6068708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2546480 
End bp2548747 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content54% 
IMG OID641601712 
ProductKojibiose phosphorylase 
Protein accessionYP_001725271 
Protein GI170020317 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGGC CAGTAACGTT ATCAGAACCC CATTTCAGCC AGCATACCCT GAACAAGTAT 
GCATCGCTGA TGGCGCAGGG GAACGGCTAT CTTGGGCTTC GCGCCAGCCA TGAAGAAGAT
TACACGCGCC AGACGCGAGG GATGTATCTG GCGGGGCTGT ATCATCGGGC GGGAAAAGGT
GAAATCAACG AACTGGTGAA CCTGCCTGAT GTCGTGGGGA TGGAGATTGC CATAAATGGT
GAGGTTTTCT CGTTATCCCA CGAAGCCTGG CAGCGTGAGC TTGACTTTGC CAGTGGCGAA
TTACGCCGCA ATGTTGTCTG GCGTACCAGC AACGGCTCAG GTTACACCAT CGCCAGCCGT
CGCTTTGTTT CGGCAGACCA ACTGCCGCTC ATTGCGCTGG AAATCACTAT TACGCCACTG
GACGCCGACG CGTCAGTGCT GATTTCAACA GGCATTGACG CCACGCAAAC CAATCACGGT
CGCCAACATC TCGACGAAAC CCAGGTGCGG GTGTTTGGTC AGCATCTGAT GCAGGGGAGC
TACACCACCC AGGATGGACG CAGTGATGTG GCCATCAGCT GTTGCTGTAA GGTGAGCGGT
GATGTGCAGC AATGCTATAC CGCCAAAGAG CGCCGTTTAC TGCAACATAC CAGTGCGCAG
CTTCATGCAG GCGAGACAAT GACGTTGCAA AAACTGGTGT GGATCGACTG GCGGGATGAC
AGGCAAGCTG CTTTAGACGA GTGGGGCAGC GCGTCGCTTC GCCAGCTTGA AATGTGCGCG
CAGCAGAGTT ACGACCAACT TCTTGCAGCA TCAACAGAAA ACTGGCGTCA ATGGTGGCAG
AAACGTCGTA TCACGGTAAA TGGCTGCGAA GCGCACGATC AGCAAGCGTT AGATTATGCG
CTTTATCATC TGCGCATCAT GACGCCTGCC CACGACGAGC GCAGCAGCAT TGCGGCAAAA
GGCTTAACCG GCGAAGGCTA CAAAGGCCAC GTTTTCTGGG ATACAGAAGT ATTTTTGTTA
CCGTTTCATC TGTTTAGCGA TCCGACGGTT GCCCGAAGTT TACTGCGTTA TCGCTGGCAC
AACTTGCCAG GCGCGCAGGA GAAAGCGCGA CGCAACGGCT GGCAGGGCGC GCTATTTCCG
TGGGAAAGCG CGCGCAGCGG CGAAGAAGAG ACGCCGGAAT TTGCCGCCAT TAACATTCGC
ACCGGGCTGC GGCAAAAAGT GGCCTCGGCG CAGGCGGAAC ATCATCTGGT GGCCGATATC
GCCTGGGCGG TTATTCAATA CTGGCAGACC ACGGGGGATG AAAGTTTCAT TGCGCATGAA
GGCATGGCGC TACTTCTGGA GACGGCAAAG TTCTGGATTA GCCGCGCGGT GAGAGTTAAC
GATCGTCTGG AAATTCATGA TGTTATTGGG CCAGACGAAT ATACCGAACA TGTCAATAAT
AATGCATACA CCAGCTATAT GGCCCGCTAC AACGTTCAAC AGGCGCTGAA TATTGCCCGC
CAGTTCGGCT GTAGCGACGA TGCGTTTATC CATCGCGCCG AAATGTTCCT CAAAGAGCTA
TGGATGCCAG AAATTCAGCC CGACGGCGTT TTGCCGCAGG ATGATTCGTT TATGGCTAAG
CCGGCGATTA ATCTGGCGAA ATACAAAGCG GCGGCGGGGA AGCAAACCAT ACTGCTGGAT
TATTCACGCG CAGAAGTGAA CGAGATGCAG ATCCTCAAAC AAGCTGATGT GGTGATGCTC
AATTACATGC TGCCGGAGCA GTTCTCAGCG GCATCGTGTC TTGCCAATCT GCAATTTTAT
GAACCGCGCA CTATTCACGA CTCGTCATTA AGTAAAGCAA TCCACGGCAT TGTTGCCGCA
CGCTGTGGCC TGCTGACCCA AAGTTATCAG TTCTGGCGCG AGGGGACTGA AATCGATCTT
GGTGCTGATC CGCATAGTTG TGATGATGGT ATCCATGCTG CCGCAACTGG CGCTATCTGG
CTGGGGGCGA TTCAGGGTTT TGCCGGGGTG AGCGTGCGTG ACGGTGAATT GCATCTCAAT
CCGGCGTTAC CTGAGCAGTG GCAACAGTTG TCTTTCCCTC TGTTCTGGCA GGGCTGCGAA
TTACAGGTCA CTCTTGACGC GCAGCGTATT GCGATTCGAA CTTCTGCGCC CGTTTCACTG
CGTTTGAACG GTCAGCTTAT AACCGTGGCT GAAGAATCTG TTTTCTGTTT GGGTGATTTT
ATTTTGCCCT TCAATGGGAC CGCTACCAAA CATCAGGAGG ATGAATGA
 
Protein sequence
MTRPVTLSEP HFSQHTLNKY ASLMAQGNGY LGLRASHEED YTRQTRGMYL AGLYHRAGKG 
EINELVNLPD VVGMEIAING EVFSLSHEAW QRELDFASGE LRRNVVWRTS NGSGYTIASR
RFVSADQLPL IALEITITPL DADASVLIST GIDATQTNHG RQHLDETQVR VFGQHLMQGS
YTTQDGRSDV AISCCCKVSG DVQQCYTAKE RRLLQHTSAQ LHAGETMTLQ KLVWIDWRDD
RQAALDEWGS ASLRQLEMCA QQSYDQLLAA STENWRQWWQ KRRITVNGCE AHDQQALDYA
LYHLRIMTPA HDERSSIAAK GLTGEGYKGH VFWDTEVFLL PFHLFSDPTV ARSLLRYRWH
NLPGAQEKAR RNGWQGALFP WESARSGEEE TPEFAAINIR TGLRQKVASA QAEHHLVADI
AWAVIQYWQT TGDESFIAHE GMALLLETAK FWISRAVRVN DRLEIHDVIG PDEYTEHVNN
NAYTSYMARY NVQQALNIAR QFGCSDDAFI HRAEMFLKEL WMPEIQPDGV LPQDDSFMAK
PAINLAKYKA AAGKQTILLD YSRAEVNEMQ ILKQADVVML NYMLPEQFSA ASCLANLQFY
EPRTIHDSSL SKAIHGIVAA RCGLLTQSYQ FWREGTEIDL GADPHSCDDG IHAAATGAIW
LGAIQGFAGV SVRDGELHLN PALPEQWQQL SFPLFWQGCE LQVTLDAQRI AIRTSAPVSL
RLNGQLITVA EESVFCLGDF ILPFNGTATK HQEDE