Gene Dgeo_1488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1488 
SymbolpurU 
ID4057374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1572380 
End bp1573270 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content64% 
IMG OID641230506 
Productformyltetrahydrofolate deformylase 
Protein accessionYP_604952 
Protein GI94985588 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0788] Formyltetrahydrofolate hydrolase 
TIGRFAM ID[TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent
[TIGR00655] formyltetrahydrofolate deformylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.973322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCC CCTCCTCTTC CTCCAGGCTT GATCCCCTCA ACACCGCCGT CCTCACCATC 
ACCTGCCCGG ACCGGGGCGG CATCGTGGCG GCGGTGTCGC AGTTTCTTTT TAGCCACGGC
GCGAACATCC TCCACTCCGA CCAGCACTCC ACTGACCCCG CAGGCGGCAC CTTTTTCATG
CGGATGGAGT TTCACCTCGA TGGCCTCGAT CTGGCGCGCG AGCCGTTCGA GCGGGCCTTT
GCGCAGGTCA TCGCCGCCCC CTTTGGCATG GACTGGCGCC TGAGCTACAC GGCTCAGCCC
AAGCGCATGG CGATTTTGGT GAGCCGCTAC GACCACTGCT TTTTGGATCT GCTGTGGCGC
AGGCGCCGGG GCGAACTGAA TGTGGAAATT CCCCTCGTGA TCAGTAACCA CCCGGACCTC
GCCCGAGACG CCGACATGTT CGGCATTCCC TTTCACGTGG TCCCCGTAAC GCGGGAGAAC
AAGGCAGAGG CCGAAGCCGA GCAGGTGCGG TTGCTGCAGG AAGCCGGAGC CGACTTCGCC
GTTCTCGCGC GCTACATGCA GATTCTCAGC GGTGACTTCC TGCGCGAGTT TGGGCGTCCG
GTCATCAACA TCCACCACTC GTTCCTGCCG GCCTTTGTGG GAGCCAACCC CTACCGCGCC
GCCTTTCAGC GCGGCGTAAA GCTCATTGGC GCGACCAGCC ACTACGTGAC GGAAGAACTC
GACGCCGGGC CGATCATCGC CCAGGACGTG ATTCCCGTGA CCCACCGTGA GACTCCCGAC
ACCCTGATGC GCCTGGGCCG CGACGTGGAA CGCCAGGTGC TCGCTCGCGC CGTCAAGGCC
CACGTGGAAG ACCGGGTGCT GGTGCACGGC AACAAGACGG TGGTGTTTTA G
 
Protein sequence
MTAPSSSSRL DPLNTAVLTI TCPDRGGIVA AVSQFLFSHG ANILHSDQHS TDPAGGTFFM 
RMEFHLDGLD LAREPFERAF AQVIAAPFGM DWRLSYTAQP KRMAILVSRY DHCFLDLLWR
RRRGELNVEI PLVISNHPDL ARDADMFGIP FHVVPVTREN KAEAEAEQVR LLQEAGADFA
VLARYMQILS GDFLREFGRP VINIHHSFLP AFVGANPYRA AFQRGVKLIG ATSHYVTEEL
DAGPIIAQDV IPVTHRETPD TLMRLGRDVE RQVLARAVKA HVEDRVLVHG NKTVVF