Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3051 |
Symbol | |
ID | 4071958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3623528 |
End bp | 3626212 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985070 |
Product | glycogen/starch/alpha-glucan phosphorylase |
Protein accession | YP_592126 |
Protein GI | 94970078 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0058] Glucan phosphorylase |
TIGRFAM ID | [TIGR02093] glycogen/starch/alpha-glucan phosphorylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.501405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAC CGCTCGGGGT AAAAGTGGCG TATCTGTATC GTTGGTCGAA CGAAACAGGT GAGCCCCGCG CGAAACGGGA AGGACCTCTA AGAAGGACCT CTCAAAGCTC GGCCGTTACA GGCCTGGAGT CCATGGCTTC CTCTTCGAAC TTACTGCCGA ACGTTCGGCC TGCCGAGCTG CTTCGCGAAG CGATTTACCG TCATATCCGG TACACATTGG TTCTGCGCAA TCCGCGCCTT CTGGGGCCGG TTGAGCTGCT GACGCCCGTG TCGCTTGCCG TCCGCGATCG CATTGTGGAC CGCATGATCG AGACCGAAGA GCGCGTCCGC AGTAAAGACT CGAAGCGCCT GTACTACCTT TCCATGGAGT TTCTGATGGG AAGGTCCCTG AACGATAACC TGCACAACCT TGGCATTACG GAACTGATGC GCGAGGTGCT CGCGAGCATA GGGATGTCGT TGGACGACGT CCTGGCGTGC GAACTCGATG CTGGTCTCGG GAACGGTGGC TTGGGTCGTC TTGCGGCGTG CTTCCTGGAG TCTCTGGCGA CCTTGGGAAT GCCGGGGTAT GGGTATGGCA TTGACTACGA GTACGGATTG TTCCGGCAGG AGATCAACGG TGGCTTCCAG CGCGAGAAGC CGGACCGCTG GAAGGCCAAC GGCACACCGT TCGAGATTGG ACGCCCGGAT CGCGCGCTGA GCATTCCCCT TTATGGCCGC GTAGAGACCT CACGTGACAA CCACGGCAAT CTGCGGCAGA TCTGGACGAG CCAGAAATTC GTGCTCGGCA TTCCAAGCGA TATGCCGGTG GTCGGATGGG GCGGCGAGAC CGTCAATTTC CTGCGACTGT TCGCAGCGCG CGCTTCGGAA GATTTCGACA TTGAGATCTT TAACCGCGGC GATTACATAC GCGCCGTCGA GCAGAAGATC GGCAACGAGA ACATTTCGCG CGTGCTGTAT CCGTCGGACT CGGTGATGTC GGGCAAGGAA CTGCGATTGG TGCAGGAGTA CTTCCTGGTC GCATGTGCCA TCGGCGACCT GGTGCGCCGC TACGATCTCG ATCACAAAGG CTACGAGCAG TTCCCGTCGA AGGTCGCGAT CCAGATGAAC GATACGCATC CGAGCCTTGC GGTCGCGGAG TTGATGCGCG TGTTCATTGA CGAGAAGGGG CTGGAGTGGG AACAGGCGTG GGAACTGACC CAAGCCACCT GCGCGTACAC CAACCACACG TTGCTGCCGG AAGCATTGGA ACGCTGGTCG GTATCGCTGA TGGAGCGCGT GCTGCCGCGG CACATGCAGC TTATCTACGG AATCAATCAC CAGTTCCTGC GTAGCATTTC GGAATATTCC TTCACCGATC CCGACTTGAT GCGGCGGACC TCCATCATCG AGGAAGCCGC GGACAAGCAA GTGCGCATGG CGCACCTGGC GATCATCGGA AGCCATGCCG TGAATGGCGT GGCTGCATTG CATAGCGAGC TCGTCAAATC CACGCTGGTG CCGGATTTTG CGAAGTTGTG GCCGGAGCGT TTCAGCAATA AGACAAACGG CGTGGCCCCA CGGCCGTGGC TGCACAAATC GAATCCCGGG CTCGCTGCGC TTCTCACAGA AACCATAGGT GAGCGATGGG TCACTGATCT CTCGCTGCTG CGGGGGCTCC AGAAGTTTGC CGATGACGCG GCGTTCCGCG CGAAATTCCG TGAAGTAAAA CAGCACAACA AAAACCGGCT GGCGAAGGTC ATCTTCGACC AGACTGGCGT GCAGGTCTAT ACGTCGTCGC TATTCGATGT GCAGATCAAA CGCATTCACG AGTACAAGCG GCAACTGCTG AACGTGATGC GCATTATCGA TCAGTATCTG CAATGCGTGG ACCATGGTGT GGAGATCAAG GTCCCGCGAA CGTTTGTGTT TGCAGGTAAA GCGGCACCGG GATACTGGGC AGCGAAACAG ATCATCAAGC TCATCCACAA CGTAGCGTCA GTGGTGAACA GCGATCCGCG GATGAAAGAC CGGATCAAGG TTGCGTTCTT GCCGGACTAT CGAGTCTCGC TGGCTGAGAT CATCATTCCG GCCGCCGACG TCAGCGAGCA GATTTCGACC GCAGGCATGG AAGCCTCCGG TACCGGCAAC ATGAAGCTCT CGATGAACGG AGCGCTGACC GTCGGAACCT ACGATGGCGC CAACATCGAG ATCCTTGAGG AAGTCGGGGA AGAGAATTTT TATCTCTTCG GATTGCGGGT AGAGCAGATC GAGGAGATGA AGCGCAAGGG CCTCTACAAC GCGCAGGAAT ACTACGCCAA GAACGAGCGC ACCAAGGCCG TAATGGATTC CCTGGGCAGC GATCGCTTCT CACCGCAGGA GCCTGGGTTA TTCCGCTGGA TTGTGGACGA AATTCTTTAT CGCGGCGATC GTTATTTCCA TCTTGCGGAC TTGCCCTCGT ACGTCGAGAT CAATGAGTCG GTGGACGAGG ACTACCTCGA CCAGGAGCTA TGGTCGCGCA AAGCTGCGAT CAATGTCGCG CGAATCGGGA AATTCTCGAG CGATCGCACG ATCCTCGAAT ACGCGCGAGA TATCTGGCAC ATCGGGCCAT TCGAGCAGCC GTGGGTTCCC AAACCCAAAC CAGAGGCCAT TCTCACAGCA GCGCCGCCGG CATCTGCGTC AGAGTCCAGC TCCGTCGTCG GATAG
|
Protein sequence | MNAPLGVKVA YLYRWSNETG EPRAKREGPL RRTSQSSAVT GLESMASSSN LLPNVRPAEL LREAIYRHIR YTLVLRNPRL LGPVELLTPV SLAVRDRIVD RMIETEERVR SKDSKRLYYL SMEFLMGRSL NDNLHNLGIT ELMREVLASI GMSLDDVLAC ELDAGLGNGG LGRLAACFLE SLATLGMPGY GYGIDYEYGL FRQEINGGFQ REKPDRWKAN GTPFEIGRPD RALSIPLYGR VETSRDNHGN LRQIWTSQKF VLGIPSDMPV VGWGGETVNF LRLFAARASE DFDIEIFNRG DYIRAVEQKI GNENISRVLY PSDSVMSGKE LRLVQEYFLV ACAIGDLVRR YDLDHKGYEQ FPSKVAIQMN DTHPSLAVAE LMRVFIDEKG LEWEQAWELT QATCAYTNHT LLPEALERWS VSLMERVLPR HMQLIYGINH QFLRSISEYS FTDPDLMRRT SIIEEAADKQ VRMAHLAIIG SHAVNGVAAL HSELVKSTLV PDFAKLWPER FSNKTNGVAP RPWLHKSNPG LAALLTETIG ERWVTDLSLL RGLQKFADDA AFRAKFREVK QHNKNRLAKV IFDQTGVQVY TSSLFDVQIK RIHEYKRQLL NVMRIIDQYL QCVDHGVEIK VPRTFVFAGK AAPGYWAAKQ IIKLIHNVAS VVNSDPRMKD RIKVAFLPDY RVSLAEIIIP AADVSEQIST AGMEASGTGN MKLSMNGALT VGTYDGANIE ILEEVGEENF YLFGLRVEQI EEMKRKGLYN AQEYYAKNER TKAVMDSLGS DRFSPQEPGL FRWIVDEILY RGDRYFHLAD LPSYVEINES VDEDYLDQEL WSRKAAINVA RIGKFSSDRT ILEYARDIWH IGPFEQPWVP KPKPEAILTA APPASASESS SVVG
|
| |