Gene Tneu_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1919 
SymbolpurP 
ID6165059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1691576 
End bp1692610 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content62% 
IMG OID641669082 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_001795280 
Protein GI171186361 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.750635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAATTA GTGGCGCCAA TCCGACGCAC GTGTCGGCCG CGTTGAAGAG ATACGACGTG 
GAGAAGCTGG CGGTCGCCAC GGTGGCGTCG CACACCGCGT TGCAGATCTT GAGGGGGGCG
AAGAGGTTCG GCTTCAGGAC CATCGCCGTG GCCGGCAGGG CAGACGCCGC CGAGTTCTAC
AGGCAGTTCG GCTTCATAGA CGAGGTGTGG ACAGCCGACT TTAGAAACTT CGTGAAGACG
GCGGAGAAGC TCGTGGAGGC CAACGCGGTG CTCGTGCCCC ACGGCTCCTA CGTGGAATAC
GTAGGCTGGA GGCAGGCGCT TGAGGCGCCG GTCCCCACCC TCGGCTGCAG AGAGCTGATA
AGGTGGGAGG CGGATCAGTA CAAGAAGATG GAGCTCCTCC AGAGAGCCGG CGTCCCCACG
CCGAGGGTCT ACAAAACGCC GGAGGAGGTG GATAGGCCCG TCATAGTTAA GCTCTTCGGC
GCCAAGGGGG GCAGGGGGTA CTTCCTCGCC AGAGATAGAG AGGAGCTGAG GAGGAGGCTG
GCCGGCCTAG GCGAGTACAT CATCCAGGAG TACGTATTCG GCGTGCCGGC CTACTACCAC
TTCTTCTCCT CCCCCGTATA CGGCAGGGTG GAGGTCTTCG GCGCAGACAT CAGGTACGAA
TCTAACGTAG ACGGGAGGAC CTTCGGCTGG GTGGAGCCCA CCTTCGTCGT CGTTGGCAAC
CTCCCCCTGG TCCTCAGGGA GTCTCTGCTC CCAACTATAT GGAAATACGG CGTCCAGTTC
GCCAAAGCCG TCGAGGAGGC GGTCGGCTGC AGGCTGGCGG GGCCCTACTG CCTGGAGTCC
ATAATAAGGG ACGACATGTC CATCTCGGTC TTCGAGTTCT CAGGGCGTAT CGTGGCTGGG
ACCAACATAT ACATGGGCTA CGGCTCGCCC TACTCGGTCC TCTACTTCGA CAGGCCTATG
GACATGGGGG AGAGGATAGC CCACGAGATA AGGGAGGCCG CCCGGAGGGG CCGCCTAGAG
GACCTCTTCA CGTAG
 
Protein sequence
MLISGANPTH VSAALKRYDV EKLAVATVAS HTALQILRGA KRFGFRTIAV AGRADAAEFY 
RQFGFIDEVW TADFRNFVKT AEKLVEANAV LVPHGSYVEY VGWRQALEAP VPTLGCRELI
RWEADQYKKM ELLQRAGVPT PRVYKTPEEV DRPVIVKLFG AKGGRGYFLA RDREELRRRL
AGLGEYIIQE YVFGVPAYYH FFSSPVYGRV EVFGADIRYE SNVDGRTFGW VEPTFVVVGN
LPLVLRESLL PTIWKYGVQF AKAVEEAVGC RLAGPYCLES IIRDDMSISV FEFSGRIVAG
TNIYMGYGSP YSVLYFDRPM DMGERIAHEI REAARRGRLE DLFT