Gene Acid345_2329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2329 
Symbol 
ID4071483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2759738 
End bp2762713 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content59% 
IMG OID637984345 
ProductAlpha-1,6-glucosidases, pullulanase-type 
Protein accessionYP_591404 
Protein GI94969356 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02103] alpha-1,6-glucosidases, pullulanase-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0783894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0180008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGCA GACTTTCTCT GCCCGTGCTC ATTTTGCTCT GCACGCTTGC GGGTTACGCG 
CAAGGACCAA CTCAAGCTCG CATTCACTAC TATCGTCCCG ACGCCCAGTT CGCTGGCTGG
GGCCTGCATG TTTGGAACAA CACCACCGAC AACGTCACGT GGACGACTCC GCTGCAACCC
GCTGGCAGCG ATAGCTATGG AATCTATTTC GACGTGGATT TGTTCGACGC AACCACGCCG
CTGGGATTCA TCATCCACAA GGGCGATACC AAGGATCCCG GCCCCGATAT GTACATGGAT
GTGACGCAAC CTGGGCGGAA TGCCTGGATC ATCTCGGGCG ACTCGACCAT TTACTACACG
CAGCCGACGA AGTCGCAGTT GCTGTCGGCG AACTTCCATC GTCTGCAGGC GTACTGGATT
GATCGCAACA CGATCGCCAT CCAGGCCGCG TATTTCCATA GCGGCGACAC GGTGAGGTTG
CACTCGGACT TGAGCGCAGG GCTGGCGATT ACGGACACTG GATTGACCGG GGGCCAAAGC
ACCCTGCTCA CTCCGGATCC GAGCGGACTC ACGCCGGCGC AGTTGGCGAC GTTCCCGCAA
CTCAATGGCT ACGCCGCTTT CCATCTGCAG AACGCGAGGA GCTTCAACTA CGCAATGTAT
TTGAAGGGGC AAGTCGCAGT ATCCGATGTG GATAGCGCCG GAAATCTAAC CTACGCGACA
GGAGTACAAA CGCCCGGCGT GCTCGACGAT TTGTACTTTT ACAGCGGGAA ACTGGGGCCG
TCGTTCCGCG GGACCGTGCC AACGGTCAGC GTGTGGGCTC CGACAGCGCA ATCCGTCGCT
CTACAACTCT TCAACGCCGC AACCGACGCG ACTCCAACTC AGGTCGTGCC GATGCACGAG
AGCAACGGCG TGTGGAGCGT GCAGGGCAAG CCACAGTGGA AGAACAAGTA CTACCTCTTC
AATGTGAAGG TCTACACGCC GTTCACGTTC TCGGTTGTAG AGAATGTCGT CACGGATCCG
TGGTCGCTGG GCCTGTCGCT GAACAGCACG AAGAGCCAGA TCATCGACCT CGATGACGCG
TCCAACAAGC CATTGGGATG GGACCTGCTT CCCTCTCCGC CGCTGGCGAG TTGGAACGAC
TTGAGCATCT ATGAACTGCA CATCCGCGAC TTCAGCGCGA CGGACAGCAC GATTCCAGCG
GTGCAGCGAG GAACCTATCT GGCGTTCACC GACCAAACGT CGAATGGCAT GAAGCACCTG
CGCAGCCTAT CGCAGTCGGG ATTGAGAGCA GTCCATCTGC TGCCGAATTT CGACATTGCC
AGTGTGAATG AAGACAAGAC TACCTGGAAG ACGACGGGAG ATCTCAGCGT CTATCCGCCA
GACTCCGACC AGCAGCAGGC GGCGGTGGCA GCCATCCAGG GGCAGGACGG CTTCAACTGG
GGCTACGATC CTCTGCACTA TCTCACGCCA GAGGGCAGCT ACGCAGTGAA TGCCGCCAGT
CGCACGAAGG AATATCGTGC GATGGTCGAG TCGCTGCATG CAAACGGGCT GCGCGTGATT
CAGGACGTGG TCTTCAATCA CACCAGCAGC TTCGGACAAA ATCCAAACTC CGTCCTCGAT
GAGGTTGTGC CCGATTACTA CTATCGTCTC GACGCCAACG GCGCGAACTA CTTCGCAAGC
TGCTGCGCCG ACACGGCGAC CGAGCATCGC ATGATGGAAA AGCTGATGAT CGACGCGGTG
ACCACCTTCG CCAAGGAATA CAAGGTGGAT GGCTTCCGCT TCGACATCAT GTCCTTCCAT
TTGCTGTCGA ACATCCAGCA CGTACGGCAG GCGCTTGACC AGCTCACTCT ACGGAACGAT
GGCGTAGATG GCCGCAAGGT TTACCTGTAC GGTGAGGGTT GGAACTTCGG CGAAACCGGC
AACAACGCGC TCGGCAAGAA CGCCATGCAG AGCAATCTGT ATGGCACCGG CATTGGATCG
TTCAACGACC GCACGCGCGA TGGCGTGCGT GGTGGTGGAC CATTCAACGA TGTCCGCGAG
CAAGGCTTCG CTACCGGCTT ATTCACCGAC CCGAACACCA CGTTCTCCAT CGGCGACAGC
GACACACAGA AAGCAAAGCT GCTGCAGGAA AGCGATTGGA TCCGCATCGG CCTGACCGGA
AACCTGCGCG ATTACAGCTT CACGGATAGC AATGGCAACA CCGTGACCGG CGGCAGCGTG
GACTACAACG GACAGCACGC AGGCTACACC GCGCAGCCGA TCGAAGACAT CAGCTACGCA
TCCGCGCACG ACAATCAAAC CATCTTCGAT GCGGTGCAAT TGAAGTCGAA CATCACCGAC
TCAATTGCCG ACCGCACCCG TCGCCAGAAC CTGGCAAACA GTCTTATCCT GCTCGGTCAG
GGTATTCCCT TCTTCCACGC AGGCGACGAC ATCCTGCGTT CGAAGTCGGG TGACAACAAC
AGCTACGACT CCGGCGACTG GTTCAACAAG ATCGACTGGA CCCTGCAAAC CGACAACTGG
GGCGTGGGCC TGCCGATCGC GAGCCAGAAC AGCTTCCAAT GGAGCGACCT GCAACCACTA
CTCGCCGACC CAGCACTGAT GCCGCAACCG GCGAACATCC AGCGCGCATA CGACCACTTC
TGCGAGATGT TGCGCGTGCG CTACAGCTCA CCGCTGTTCC GCATGAGTAC CGAGCCGCAA
ATCCAAGCGA ACTTGCGGTT CCTGAATGTC GGTCCGAACC AGATCCCGGG CGTAATTGCC
GTGGTGCTGG GAACGGGACG TCAGCAGATC GTGGTGGTGT TCAACGGCAG TAATGCGTCG
CAGACGATCA GCGATGCGTC GCTGGCGAAT TTGCGTCTTC AACTCCATCC GGTGCTGCAA
CACTCCACGG ACCCGGTCGT GAAGCAGTCC ACCTACAACA GCAACGGGAG CGTAACGGTG
CCGGCCCTGA CGACGGCGGT GTTCACGCCG CGTTGA
 
Protein sequence
MLRRLSLPVL ILLCTLAGYA QGPTQARIHY YRPDAQFAGW GLHVWNNTTD NVTWTTPLQP 
AGSDSYGIYF DVDLFDATTP LGFIIHKGDT KDPGPDMYMD VTQPGRNAWI ISGDSTIYYT
QPTKSQLLSA NFHRLQAYWI DRNTIAIQAA YFHSGDTVRL HSDLSAGLAI TDTGLTGGQS
TLLTPDPSGL TPAQLATFPQ LNGYAAFHLQ NARSFNYAMY LKGQVAVSDV DSAGNLTYAT
GVQTPGVLDD LYFYSGKLGP SFRGTVPTVS VWAPTAQSVA LQLFNAATDA TPTQVVPMHE
SNGVWSVQGK PQWKNKYYLF NVKVYTPFTF SVVENVVTDP WSLGLSLNST KSQIIDLDDA
SNKPLGWDLL PSPPLASWND LSIYELHIRD FSATDSTIPA VQRGTYLAFT DQTSNGMKHL
RSLSQSGLRA VHLLPNFDIA SVNEDKTTWK TTGDLSVYPP DSDQQQAAVA AIQGQDGFNW
GYDPLHYLTP EGSYAVNAAS RTKEYRAMVE SLHANGLRVI QDVVFNHTSS FGQNPNSVLD
EVVPDYYYRL DANGANYFAS CCADTATEHR MMEKLMIDAV TTFAKEYKVD GFRFDIMSFH
LLSNIQHVRQ ALDQLTLRND GVDGRKVYLY GEGWNFGETG NNALGKNAMQ SNLYGTGIGS
FNDRTRDGVR GGGPFNDVRE QGFATGLFTD PNTTFSIGDS DTQKAKLLQE SDWIRIGLTG
NLRDYSFTDS NGNTVTGGSV DYNGQHAGYT AQPIEDISYA SAHDNQTIFD AVQLKSNITD
SIADRTRRQN LANSLILLGQ GIPFFHAGDD ILRSKSGDNN SYDSGDWFNK IDWTLQTDNW
GVGLPIASQN SFQWSDLQPL LADPALMPQP ANIQRAYDHF CEMLRVRYSS PLFRMSTEPQ
IQANLRFLNV GPNQIPGVIA VVLGTGRQQI VVVFNGSNAS QTISDASLAN LRLQLHPVLQ
HSTDPVVKQS TYNSNGSVTV PALTTAVFTP R