Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0208 |
Symbol | |
ID | 4485294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 222802 |
End bp | 223803 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639728971 |
Product | 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine pyrophosphokinase |
Protein accession | YP_871968 |
Protein GI | 117927417 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0801] 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase [COG1539] Dihydroneopterin aldolase |
TIGRFAM ID | [TIGR00525] dihydroneopterin aldolase [TIGR00526] FolB domain [TIGR01498] 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0493459 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGTC CGGCCGGCAA CCCGCCACGG CCTGGCGCGG CGGCGTCCCG GTACGGCCGG ATCGAGCTGC GGGGAGTGCG GGCCGTCGGC CGGCACGGCG TCTATCCGGA GGAACGAGCC GACGGGCAGG AGTTTGTGGT GGACGCCGTC CTCGAACTCG ACCTGACGCC TGCGGTCGAG ACGGATTCCG TGGCTGCCAC GGTGAATTAC GCGGACCTCG CGGACCGCAT CGCGCGGCGC ATCGAGGGTG AACCCGTTAA CCTGATCGAG ACCCTCGCTG ACGAGATTGC CGTGGACTGC CTGGCGGACC CTCGAGTCCG CGGGGTCGAG GTGACGGTGC ACAAGCCGCA CGCCCCGGTT CCCCGGACGG TCAGCGACGT CGCCGTACGG GTCTACCGGC GACGGCCGAT CCAGGTGGTC GTCGCATTGG GGGCAAACCT CGGCGATCGG CCGGCCGCGC TGCAGCGGGC GGTCGACGCG CTGGCGGCCG CGCATCCGGT CGTGGCGGTC TCGCCGGTGT ACGAGACCGA ACCCGTTGGC GGACCGCCGC AGCCGCCGTA CCTGAACGCC GTCGCTCTGC TGGAGGCCGC CGCCGGCCCG TACGACATCC TCACCCTGGC CCAGATCATC GAAGCCGCGG CCGGTCGCAC CCGCGAGGTG CGCTGGGGAC CACGCACGCT GGACATCGAC GTGATCTGCT ACGGTGACCT TGTCCTCGAC GACCCCCGGC TCACCCTTCC GCACCCCCGC GCCGCGGAGC GCGCCTTCGT CCTCGCCCCC TGGCATGCGG TCGATCCGGC TGCCGTACTG CCCGGGCACG GCCGGGTCGC TGATCTCCTG CGCCGGCTGG ACACCAGCGG CATTCGCCGG CGGGACGACC TGCACCTTGC AGTGCCTGCC GGCACCGGCG CTCCGGTGCC GGGTGGTACG GCTGCCCAGG CGGGCGGGTC GTCCGGCCAT GCGGCCGGCC ATACGGCCGT TGCACCGGGA GCCTTGCCGT GA
|
Protein sequence | MSSPAGNPPR PGAAASRYGR IELRGVRAVG RHGVYPEERA DGQEFVVDAV LELDLTPAVE TDSVAATVNY ADLADRIARR IEGEPVNLIE TLADEIAVDC LADPRVRGVE VTVHKPHAPV PRTVSDVAVR VYRRRPIQVV VALGANLGDR PAALQRAVDA LAAAHPVVAV SPVYETEPVG GPPQPPYLNA VALLEAAAGP YDILTLAQII EAAAGRTREV RWGPRTLDID VICYGDLVLD DPRLTLPHPR AAERAFVLAP WHAVDPAAVL PGHGRVADLL RRLDTSGIRR RDDLHLAVPA GTGAPVPGGT AAQAGGSSGH AAGHTAVAPG ALP
|
| |