Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0521 |
Symbol | |
ID | 4485034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 555555 |
End bp | 557948 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639729288 |
Product | putative phosphoketolase |
Protein accession | YP_872280 |
Protein GI | 117927729 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3957] Phosphoketolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.544074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAC CGACCGCCGG CCCTGGTGAT GCCGTGCTCT CCGATCAAGA ACTCGCTGCC ATCGACGCGT ACTGGCGGGC CGCGAATTAC TTATGTGTCG GCCAACTCTA CCTGCTGGAC AATCCGCTCC TGCGCGAACC GCTGCGTCCG GAACACATCA AGCCGCGGCT GCTCGGCCAC TGGGGGACCA CGCCCGGCCT CAACTTCATC TACACGCACA TGAACCGCTT CATTCGCCGT GACGATTGGA ACGCCATTTT CATTACCGGG CCGGGACACG GCGGCCCGGC TCCGGTTGCG CAGGCGTATC TCGAAGGGAC GTACAGCGAA ATCTACCCGT CCATCACGCG GGACGAAGAA GGATTGCGCC GGCTCTTCCG GCAGTTCTCT TTCCCGGGCG GTATCCCGAG TCACGCGGCG CCGGAGACGC CCGGGTCCAT CCATGAGGGC GGCGAATTAG GGTACAGCCT GGCGCACGCT TTCGGCGCGG TGTTCGACAA TCCCGACCTG CTCGCCGTCT GCGTCGTCGG CGACGGTGAA GCGGAGACCG GGCCGCTGGC CACGAGCTGG CACTCCAACA AGTTTCTCAA TCCGCGGCAC GACGGTGCGG TGCTGCCGAT CCTGCATCTC AACGGCTACA AAATAGCCGG GCCGACCGTT CTCGGCCGGA TACCGCGTTC GGAACTGCTC GCCCTCTTTG AGGGCTATGG TTATCGCCCG CTGCTGGTGG CCGGCGACGA TCCGGCGGCC ATGCACCAGT TGATGGCCCG GACCCTCGAG GAGGCGCTGG ACGACATCCG GGAGATCCAG CGGCGTGCGC GCAGCGGCGT TCCCGGCCGG CCCACCTGGC CGATGATTAT TCTCGAAACA CCCAAGGGGT GGACCGGTCC CTGCGAGGTT GACGGCCGCC CGGTCGAGGG CAGTTTCCGC TCGCACCAGA TACCTCTTAC CGATCCCCGC GGTAACGCCG AGCATCTCGC AATGCTTGAA GCATGGATGC GCAGCTACCG ACCGGAAGAG CTCTTCGACG CCCACGGCGC CGTCCGACCT GAACTCACCG ACCTCGCGCC GCGGGGAGCC CGCCGGATGA GTGCCAATCC GCATGCGAAC GGCGGTGTCT TGCTCCGTGA CCTCTTCCTC CCTGATTTCA CCGCATATGC CGTTCCGGTG GAAAAGCCGG CGACGACGAC TAGCGAGGCG ACTCGGGTGC TCGGCCGCTA CCTGCGCGAC GTGCTTGCCC TCAACGCGGA GAGTCGGAAT TTCCGGATCT TTGGTCCGGA TGAGACGGCG TCGAACCGTC TCGACGCCGT TTTCGACGTC GAGAACCGCA TGCTCAACGC GGACATTCTG CCGACCGACG ATCACATTGC GCCCGACGGA CGGGTTATGG AGGTGCTCTC CGAGCACCTC TGCCAGGGTT GGCTTGAAGG GTATCTCCTC ACCGGCCGTC ATGGGCTATT CAATTCCTAT GAAGCGTTCA TCCACATCGT TGATTCGATG TTCAATCAGC ATGCGAAGTG GCTCAAGGTG ACCCGCAGCC TCCCCTGGCG GCGTCCGATC GCGAGCCTGA ACTATCTCCT CTCGAGCCAC GTGTGGCGGC AGGATCACAA CGGCTTCTCG CACCAGGATC CGGGCTTCAT CGACTTGGTC GTCAACAAGA AGGCTGAAGT CATCCGGGTC TATCTGCCGC CCGACACGAA CACGCTGCTC TCAGTGATGG ATCATTGCCT GCGGACGAGG AATTACGTCA ACGTCGTCAT CGCGGGCAAG CAGCCGGCGC TCAACTACCT TTCCATGCCG GACGCCATCG CACATTGCAC CCGGGGAATC GGCGTGTGGA GCTGGGCCAG CAACGACATG GGTGACGACG GCAGAGGGGA ACCGGACGTC GTCATGGCCG CGGCAGGTGA CGTGCCCACG CTTGAGGCTT TAGCAGCCAC CGCACTGCTG CGGGAATTCT TCCCGGATTT ACGGGTCCGC TTCATCAATG TCGTTGACCT TATGCGGCTG CAGCCGAGCA GCGAGCACCC GCACGGGCTG ACTGACGCAG AGTTCGACGC GTTGTTCACC ACCGACAAGC CGGTCATCTT TGCCTTCCAC GGTTACCCAT GGCTCATCCA TCGGCTCACC TACCGCAGGA CGAACCACAA GAACATTCAC GTGCGCGGCT ACAAGGAAGA GGGGACGACG ACCACGCCCT TCGACATGGC CATGCTCAAC GACATCGATC GCTATCACTT GGTGATGGAT GTCATCGATC GGGTACCGAG CCTGGGCACC CGTGCCTATC ACATTCGACA GCGCATGGCC GATGAACGTC TGGCGAAACG GCGTTACACC CGGGACGTGG GTGACGACCA TCCGGACGTC AAGAACTGGG TGTGGCCGTG GTAG
|
Protein sequence | MTEPTAGPGD AVLSDQELAA IDAYWRAANY LCVGQLYLLD NPLLREPLRP EHIKPRLLGH WGTTPGLNFI YTHMNRFIRR DDWNAIFITG PGHGGPAPVA QAYLEGTYSE IYPSITRDEE GLRRLFRQFS FPGGIPSHAA PETPGSIHEG GELGYSLAHA FGAVFDNPDL LAVCVVGDGE AETGPLATSW HSNKFLNPRH DGAVLPILHL NGYKIAGPTV LGRIPRSELL ALFEGYGYRP LLVAGDDPAA MHQLMARTLE EALDDIREIQ RRARSGVPGR PTWPMIILET PKGWTGPCEV DGRPVEGSFR SHQIPLTDPR GNAEHLAMLE AWMRSYRPEE LFDAHGAVRP ELTDLAPRGA RRMSANPHAN GGVLLRDLFL PDFTAYAVPV EKPATTTSEA TRVLGRYLRD VLALNAESRN FRIFGPDETA SNRLDAVFDV ENRMLNADIL PTDDHIAPDG RVMEVLSEHL CQGWLEGYLL TGRHGLFNSY EAFIHIVDSM FNQHAKWLKV TRSLPWRRPI ASLNYLLSSH VWRQDHNGFS HQDPGFIDLV VNKKAEVIRV YLPPDTNTLL SVMDHCLRTR NYVNVVIAGK QPALNYLSMP DAIAHCTRGI GVWSWASNDM GDDGRGEPDV VMAAAGDVPT LEALAATALL REFFPDLRVR FINVVDLMRL QPSSEHPHGL TDAEFDALFT TDKPVIFAFH GYPWLIHRLT YRRTNHKNIH VRGYKEEGTT TTPFDMAMLN DIDRYHLVMD VIDRVPSLGT RAYHIRQRMA DERLAKRRYT RDVGDDHPDV KNWVWPW
|
| |