Gene Acel_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0521 
Symbol 
ID4485034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp555555 
End bp557948 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content63% 
IMG OID639729288 
Productputative phosphoketolase 
Protein accessionYP_872280 
Protein GI117927729 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3957] Phosphoketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.544074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC CGACCGCCGG CCCTGGTGAT GCCGTGCTCT CCGATCAAGA ACTCGCTGCC 
ATCGACGCGT ACTGGCGGGC CGCGAATTAC TTATGTGTCG GCCAACTCTA CCTGCTGGAC
AATCCGCTCC TGCGCGAACC GCTGCGTCCG GAACACATCA AGCCGCGGCT GCTCGGCCAC
TGGGGGACCA CGCCCGGCCT CAACTTCATC TACACGCACA TGAACCGCTT CATTCGCCGT
GACGATTGGA ACGCCATTTT CATTACCGGG CCGGGACACG GCGGCCCGGC TCCGGTTGCG
CAGGCGTATC TCGAAGGGAC GTACAGCGAA ATCTACCCGT CCATCACGCG GGACGAAGAA
GGATTGCGCC GGCTCTTCCG GCAGTTCTCT TTCCCGGGCG GTATCCCGAG TCACGCGGCG
CCGGAGACGC CCGGGTCCAT CCATGAGGGC GGCGAATTAG GGTACAGCCT GGCGCACGCT
TTCGGCGCGG TGTTCGACAA TCCCGACCTG CTCGCCGTCT GCGTCGTCGG CGACGGTGAA
GCGGAGACCG GGCCGCTGGC CACGAGCTGG CACTCCAACA AGTTTCTCAA TCCGCGGCAC
GACGGTGCGG TGCTGCCGAT CCTGCATCTC AACGGCTACA AAATAGCCGG GCCGACCGTT
CTCGGCCGGA TACCGCGTTC GGAACTGCTC GCCCTCTTTG AGGGCTATGG TTATCGCCCG
CTGCTGGTGG CCGGCGACGA TCCGGCGGCC ATGCACCAGT TGATGGCCCG GACCCTCGAG
GAGGCGCTGG ACGACATCCG GGAGATCCAG CGGCGTGCGC GCAGCGGCGT TCCCGGCCGG
CCCACCTGGC CGATGATTAT TCTCGAAACA CCCAAGGGGT GGACCGGTCC CTGCGAGGTT
GACGGCCGCC CGGTCGAGGG CAGTTTCCGC TCGCACCAGA TACCTCTTAC CGATCCCCGC
GGTAACGCCG AGCATCTCGC AATGCTTGAA GCATGGATGC GCAGCTACCG ACCGGAAGAG
CTCTTCGACG CCCACGGCGC CGTCCGACCT GAACTCACCG ACCTCGCGCC GCGGGGAGCC
CGCCGGATGA GTGCCAATCC GCATGCGAAC GGCGGTGTCT TGCTCCGTGA CCTCTTCCTC
CCTGATTTCA CCGCATATGC CGTTCCGGTG GAAAAGCCGG CGACGACGAC TAGCGAGGCG
ACTCGGGTGC TCGGCCGCTA CCTGCGCGAC GTGCTTGCCC TCAACGCGGA GAGTCGGAAT
TTCCGGATCT TTGGTCCGGA TGAGACGGCG TCGAACCGTC TCGACGCCGT TTTCGACGTC
GAGAACCGCA TGCTCAACGC GGACATTCTG CCGACCGACG ATCACATTGC GCCCGACGGA
CGGGTTATGG AGGTGCTCTC CGAGCACCTC TGCCAGGGTT GGCTTGAAGG GTATCTCCTC
ACCGGCCGTC ATGGGCTATT CAATTCCTAT GAAGCGTTCA TCCACATCGT TGATTCGATG
TTCAATCAGC ATGCGAAGTG GCTCAAGGTG ACCCGCAGCC TCCCCTGGCG GCGTCCGATC
GCGAGCCTGA ACTATCTCCT CTCGAGCCAC GTGTGGCGGC AGGATCACAA CGGCTTCTCG
CACCAGGATC CGGGCTTCAT CGACTTGGTC GTCAACAAGA AGGCTGAAGT CATCCGGGTC
TATCTGCCGC CCGACACGAA CACGCTGCTC TCAGTGATGG ATCATTGCCT GCGGACGAGG
AATTACGTCA ACGTCGTCAT CGCGGGCAAG CAGCCGGCGC TCAACTACCT TTCCATGCCG
GACGCCATCG CACATTGCAC CCGGGGAATC GGCGTGTGGA GCTGGGCCAG CAACGACATG
GGTGACGACG GCAGAGGGGA ACCGGACGTC GTCATGGCCG CGGCAGGTGA CGTGCCCACG
CTTGAGGCTT TAGCAGCCAC CGCACTGCTG CGGGAATTCT TCCCGGATTT ACGGGTCCGC
TTCATCAATG TCGTTGACCT TATGCGGCTG CAGCCGAGCA GCGAGCACCC GCACGGGCTG
ACTGACGCAG AGTTCGACGC GTTGTTCACC ACCGACAAGC CGGTCATCTT TGCCTTCCAC
GGTTACCCAT GGCTCATCCA TCGGCTCACC TACCGCAGGA CGAACCACAA GAACATTCAC
GTGCGCGGCT ACAAGGAAGA GGGGACGACG ACCACGCCCT TCGACATGGC CATGCTCAAC
GACATCGATC GCTATCACTT GGTGATGGAT GTCATCGATC GGGTACCGAG CCTGGGCACC
CGTGCCTATC ACATTCGACA GCGCATGGCC GATGAACGTC TGGCGAAACG GCGTTACACC
CGGGACGTGG GTGACGACCA TCCGGACGTC AAGAACTGGG TGTGGCCGTG GTAG
 
Protein sequence
MTEPTAGPGD AVLSDQELAA IDAYWRAANY LCVGQLYLLD NPLLREPLRP EHIKPRLLGH 
WGTTPGLNFI YTHMNRFIRR DDWNAIFITG PGHGGPAPVA QAYLEGTYSE IYPSITRDEE
GLRRLFRQFS FPGGIPSHAA PETPGSIHEG GELGYSLAHA FGAVFDNPDL LAVCVVGDGE
AETGPLATSW HSNKFLNPRH DGAVLPILHL NGYKIAGPTV LGRIPRSELL ALFEGYGYRP
LLVAGDDPAA MHQLMARTLE EALDDIREIQ RRARSGVPGR PTWPMIILET PKGWTGPCEV
DGRPVEGSFR SHQIPLTDPR GNAEHLAMLE AWMRSYRPEE LFDAHGAVRP ELTDLAPRGA
RRMSANPHAN GGVLLRDLFL PDFTAYAVPV EKPATTTSEA TRVLGRYLRD VLALNAESRN
FRIFGPDETA SNRLDAVFDV ENRMLNADIL PTDDHIAPDG RVMEVLSEHL CQGWLEGYLL
TGRHGLFNSY EAFIHIVDSM FNQHAKWLKV TRSLPWRRPI ASLNYLLSSH VWRQDHNGFS
HQDPGFIDLV VNKKAEVIRV YLPPDTNTLL SVMDHCLRTR NYVNVVIAGK QPALNYLSMP
DAIAHCTRGI GVWSWASNDM GDDGRGEPDV VMAAAGDVPT LEALAATALL REFFPDLRVR
FINVVDLMRL QPSSEHPHGL TDAEFDALFT TDKPVIFAFH GYPWLIHRLT YRRTNHKNIH
VRGYKEEGTT TTPFDMAMLN DIDRYHLVMD VIDRVPSLGT RAYHIRQRMA DERLAKRRYT
RDVGDDHPDV KNWVWPW