Gene Acid345_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0820 
Symbol 
ID4072346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1016227 
End bp1018755 
Gene Length2529 bp 
Protein Length842 aa 
Translation table11 
GC content55% 
IMG OID637982829 
Productglycogen/starch/alpha-glucan phosphorylase 
Protein accessionYP_589899 
Protein GI94967851 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02093] glycogen/starch/alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGA CAGATATCTG GAGCGAGAAC ATGAGCAATA CGCCAGTCAC CGCACAACCC 
TCCAACGACG CTCATACCTT GCTCAAACAA TATGGATGTG GCCCTATCCC GTTCTCCGGG
ACCGACAACG CATTTTACGA GCGTCATTTT GTTTTCGACA AAGTCCTCGA TAAACGCGAG
AGTACGGCGC GTGATCAATT CGAGGCGTTC GCACGGTCCG TTCGTGATGT CCTTTCACAA
CGATGGGTGC TGACAGAACA GATTTACGGG CGACAGAACG CGAAACGCAT TTACTACGTC
TCCATGGAGT TTCTGATCGG TCGTTCGCTG GCCAACAATG TCACTAATCT TTTGCTGGAT
CCTCTCATCC AGGATTCCCT CAAACAAAAG AAGCTGGATT GGATCGAATT GATCGAGCAA
GAACCCGATG CCGGTCTCGG CAACGGCGGA TTAGGTCGAC TGGCGGCGTG CTTCCTTGAT
TCAATGGCGA CTCTGCAATT GGCCGCCATG GGATATGGAC TGCGCTATGA ATACGGAATT
TTCAAGCAAT CTATAAACGA TGGTTGGCAG GAGGAGACCG CCGACAATTG GTTGCGCCAT
CCGGATCCAT GGGAGGTCGC TCGGCCGAAC GAGGCCGTCG AGATCAAGCT GAACTGCTCG
TTTGAGTTGC ACAACGGTGT CTTGCGGGCG ATCCCCGGTC GCCCTTCAAC CATGATCGGG
ATCCCGTATG ACCGCCCGGT TGTCGGCTAC GGCGGCAAGA CCGTGAATAC CCTTCGGCTC
TGGGCCGCAT CTGCGGCCGA CTACTTCGAT TTTAGAGAAT TCAGCGGTGG CGATTTCGTC
GGCGCGTTGG CTGAGACTTT GACCGCTGAG ACGCTGACAC GCGTTCTCTA TCCCGACGAT
TCCACCTCTT TCGGGCAGGC TCTTCGCTTG GTGCAGGAAT ATTTCCTGGC AGCGTGTTCG
GTGGCGGACT TGATCCGGCG TTTCCGCAAG CACAACAGTG ATTGGAATTT GCTTCCCGAA
AAAGTTGCGG TGCAGCTGAA TGATACGCAT CCAGCACTGA CGGTCCCTGA ACTGATGAGA
GTCCTACTGG ACGAAGCGAA TCTCGGCTGG GAAACGGCTT GGGACCTCAC CCGACGTACC
CTCGCTTACA CGAACCATAC GCTCTTGCCC GAGGCGATGG AGAAGTGGCC GGTTGCCTGG
TTCGAACTCA TGCTGCCGCG ACATCTCCAG ATCATGCTTG AGATCAATCA ACGTCATCTC
GACGCTGTCC GGACCAAATT CCCGGGCGAG AACGAACGAT GCACCCGGAT GAGTCTGCTC
GAGGAAGGCT CCCCAAAGAA GCTCCGCATG GCCAACCTCG CGATTGTTGG ATCGCACAGT
ACGAATGGCG TTGCAGCCCT CCATTCGCAA CTCCTCAGGA CGACGACCCT GAAGGACTTT
GGAGAGATGT TTCCTGATCG ATTTAACAAT AAGACCAATG GAGTCACGCC GCGCCGTTGG
CTCCTATTAG CAAATCCGGC GCTTGCGCGA AACATTACGG AGGCGATCGG TGATGGGTGG
ATCAGGGATC TCGATCAACT TATCAAACTC AAGCCGCTCG CCGAGGACTG CGCCTTCTTG
GCAGCGATTC GCAAATCGAA GTACCAGGCA AAATCCGAAT TTGCAAATTG GCTCCTTCGA
ACCAGTGGGG TGAAGCTCGA TCCTGACACG ATTTTCGATA GCCAGGTGAA ACGGATTCAC
GAATACAAAC GGCAACTGTT GAACGCATTA CGAATCGTGG TCCTTTATAA CCGGCTGCGA
CAGAACCCCG AACTCGCAAT GGCGCCCCGA ACATTTCTGT TTGCGGGCAA AGCTGCACCT
GCTTATCACT TTGCGAAGTT GGTCATCAAG TTCATCAACA ATCTTGCAGG CACAATCGAG
GGCGATCCGG TCGTTCGGGG GAGACTCCGT GTCGTGTTCC TGCCCGACTA TTCCGTTTCC
ATGGCCGAGC ACCTGATTCC GGCTACCGAG GTATCGAACC AGATTTCTAC TGCCGGTTAC
GAAGCCAGCG GCACCAGCAA CATGAAGTTC ATGATGAACG GAGCACTGAC GATCGGAACG
CGCGACGGTG CAACCATCGA GATGGCCGAG GCCGCCGGCG AAGAAAACTT TTTCCTGTTT
GGCCTAACCG CCGATCAGGT ATCGCAGAAT CGCACATGGT ATTCCCCGCG CTGGCATTAT
GAGAACGAGC CGGAGACGCG TGCAGCATTG GAACTGATCT TCTCCAACCA CTTCAGCCGC
CACGAGCCGA ATGTCTTCGA GCCGTTCCGC CAACTACTTC TGGATAAGGG CGATTACTAC
ATGCACCTTG CTGATTTAGG AAGTTATCTC GCGGCGGACC AGCAACTCAC TGCGCTGTAC
AAGATTACTG ACGCATGGGC TAGCAAGGCC GTTTTGAACG TCGCCCACGC GGGCAGATTC
TCCAGCGATC GAACAATTGC GGAATACGCG GCGGACATTT GGGACGCCAA ACCGTGCCCG
GTGTCATAG
 
Protein sequence
MDKTDIWSEN MSNTPVTAQP SNDAHTLLKQ YGCGPIPFSG TDNAFYERHF VFDKVLDKRE 
STARDQFEAF ARSVRDVLSQ RWVLTEQIYG RQNAKRIYYV SMEFLIGRSL ANNVTNLLLD
PLIQDSLKQK KLDWIELIEQ EPDAGLGNGG LGRLAACFLD SMATLQLAAM GYGLRYEYGI
FKQSINDGWQ EETADNWLRH PDPWEVARPN EAVEIKLNCS FELHNGVLRA IPGRPSTMIG
IPYDRPVVGY GGKTVNTLRL WAASAADYFD FREFSGGDFV GALAETLTAE TLTRVLYPDD
STSFGQALRL VQEYFLAACS VADLIRRFRK HNSDWNLLPE KVAVQLNDTH PALTVPELMR
VLLDEANLGW ETAWDLTRRT LAYTNHTLLP EAMEKWPVAW FELMLPRHLQ IMLEINQRHL
DAVRTKFPGE NERCTRMSLL EEGSPKKLRM ANLAIVGSHS TNGVAALHSQ LLRTTTLKDF
GEMFPDRFNN KTNGVTPRRW LLLANPALAR NITEAIGDGW IRDLDQLIKL KPLAEDCAFL
AAIRKSKYQA KSEFANWLLR TSGVKLDPDT IFDSQVKRIH EYKRQLLNAL RIVVLYNRLR
QNPELAMAPR TFLFAGKAAP AYHFAKLVIK FINNLAGTIE GDPVVRGRLR VVFLPDYSVS
MAEHLIPATE VSNQISTAGY EASGTSNMKF MMNGALTIGT RDGATIEMAE AAGEENFFLF
GLTADQVSQN RTWYSPRWHY ENEPETRAAL ELIFSNHFSR HEPNVFEPFR QLLLDKGDYY
MHLADLGSYL AADQQLTALY KITDAWASKA VLNVAHAGRF SSDRTIAEYA ADIWDAKPCP
VS