Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_1488 |
Symbol | purU |
ID | 4057374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | - |
Start bp | 1572380 |
End bp | 1573270 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641230506 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_604952 |
Protein GI | 94985588 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.973322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCCC CCTCCTCTTC CTCCAGGCTT GATCCCCTCA ACACCGCCGT CCTCACCATC ACCTGCCCGG ACCGGGGCGG CATCGTGGCG GCGGTGTCGC AGTTTCTTTT TAGCCACGGC GCGAACATCC TCCACTCCGA CCAGCACTCC ACTGACCCCG CAGGCGGCAC CTTTTTCATG CGGATGGAGT TTCACCTCGA TGGCCTCGAT CTGGCGCGCG AGCCGTTCGA GCGGGCCTTT GCGCAGGTCA TCGCCGCCCC CTTTGGCATG GACTGGCGCC TGAGCTACAC GGCTCAGCCC AAGCGCATGG CGATTTTGGT GAGCCGCTAC GACCACTGCT TTTTGGATCT GCTGTGGCGC AGGCGCCGGG GCGAACTGAA TGTGGAAATT CCCCTCGTGA TCAGTAACCA CCCGGACCTC GCCCGAGACG CCGACATGTT CGGCATTCCC TTTCACGTGG TCCCCGTAAC GCGGGAGAAC AAGGCAGAGG CCGAAGCCGA GCAGGTGCGG TTGCTGCAGG AAGCCGGAGC CGACTTCGCC GTTCTCGCGC GCTACATGCA GATTCTCAGC GGTGACTTCC TGCGCGAGTT TGGGCGTCCG GTCATCAACA TCCACCACTC GTTCCTGCCG GCCTTTGTGG GAGCCAACCC CTACCGCGCC GCCTTTCAGC GCGGCGTAAA GCTCATTGGC GCGACCAGCC ACTACGTGAC GGAAGAACTC GACGCCGGGC CGATCATCGC CCAGGACGTG ATTCCCGTGA CCCACCGTGA GACTCCCGAC ACCCTGATGC GCCTGGGCCG CGACGTGGAA CGCCAGGTGC TCGCTCGCGC CGTCAAGGCC CACGTGGAAG ACCGGGTGCT GGTGCACGGC AACAAGACGG TGGTGTTTTA G
|
Protein sequence | MTAPSSSSRL DPLNTAVLTI TCPDRGGIVA AVSQFLFSHG ANILHSDQHS TDPAGGTFFM RMEFHLDGLD LAREPFERAF AQVIAAPFGM DWRLSYTAQP KRMAILVSRY DHCFLDLLWR RRRGELNVEI PLVISNHPDL ARDADMFGIP FHVVPVTREN KAEAEAEQVR LLQEAGADFA VLARYMQILS GDFLREFGRP VINIHHSFLP AFVGANPYRA AFQRGVKLIG ATSHYVTEEL DAGPIIAQDV IPVTHRETPD TLMRLGRDVE RQVLARAVKA HVEDRVLVHG NKTVVF
|
| |