Gene Acel_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0208 
Symbol 
ID4485294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp222802 
End bp223803 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content73% 
IMG OID639728971 
Product2-amino-4-hydroxy-6- hydroxymethyldihydropteridine pyrophosphokinase 
Protein accessionYP_871968 
Protein GI117927417 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0801] 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase
[COG1539] Dihydroneopterin aldolase 
TIGRFAM ID[TIGR00525] dihydroneopterin aldolase
[TIGR00526] FolB domain
[TIGR01498] 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0493459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTC CGGCCGGCAA CCCGCCACGG CCTGGCGCGG CGGCGTCCCG GTACGGCCGG 
ATCGAGCTGC GGGGAGTGCG GGCCGTCGGC CGGCACGGCG TCTATCCGGA GGAACGAGCC
GACGGGCAGG AGTTTGTGGT GGACGCCGTC CTCGAACTCG ACCTGACGCC TGCGGTCGAG
ACGGATTCCG TGGCTGCCAC GGTGAATTAC GCGGACCTCG CGGACCGCAT CGCGCGGCGC
ATCGAGGGTG AACCCGTTAA CCTGATCGAG ACCCTCGCTG ACGAGATTGC CGTGGACTGC
CTGGCGGACC CTCGAGTCCG CGGGGTCGAG GTGACGGTGC ACAAGCCGCA CGCCCCGGTT
CCCCGGACGG TCAGCGACGT CGCCGTACGG GTCTACCGGC GACGGCCGAT CCAGGTGGTC
GTCGCATTGG GGGCAAACCT CGGCGATCGG CCGGCCGCGC TGCAGCGGGC GGTCGACGCG
CTGGCGGCCG CGCATCCGGT CGTGGCGGTC TCGCCGGTGT ACGAGACCGA ACCCGTTGGC
GGACCGCCGC AGCCGCCGTA CCTGAACGCC GTCGCTCTGC TGGAGGCCGC CGCCGGCCCG
TACGACATCC TCACCCTGGC CCAGATCATC GAAGCCGCGG CCGGTCGCAC CCGCGAGGTG
CGCTGGGGAC CACGCACGCT GGACATCGAC GTGATCTGCT ACGGTGACCT TGTCCTCGAC
GACCCCCGGC TCACCCTTCC GCACCCCCGC GCCGCGGAGC GCGCCTTCGT CCTCGCCCCC
TGGCATGCGG TCGATCCGGC TGCCGTACTG CCCGGGCACG GCCGGGTCGC TGATCTCCTG
CGCCGGCTGG ACACCAGCGG CATTCGCCGG CGGGACGACC TGCACCTTGC AGTGCCTGCC
GGCACCGGCG CTCCGGTGCC GGGTGGTACG GCTGCCCAGG CGGGCGGGTC GTCCGGCCAT
GCGGCCGGCC ATACGGCCGT TGCACCGGGA GCCTTGCCGT GA
 
Protein sequence
MSSPAGNPPR PGAAASRYGR IELRGVRAVG RHGVYPEERA DGQEFVVDAV LELDLTPAVE 
TDSVAATVNY ADLADRIARR IEGEPVNLIE TLADEIAVDC LADPRVRGVE VTVHKPHAPV
PRTVSDVAVR VYRRRPIQVV VALGANLGDR PAALQRAVDA LAAAHPVVAV SPVYETEPVG
GPPQPPYLNA VALLEAAAGP YDILTLAQII EAAAGRTREV RWGPRTLDID VICYGDLVLD
DPRLTLPHPR AAERAFVLAP WHAVDPAAVL PGHGRVADLL RRLDTSGIRR RDDLHLAVPA
GTGAPVPGGT AAQAGGSSGH AAGHTAVAPG ALP