Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1631 |
Symbol | purU |
ID | 7979109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1706719 |
End bp | 1707621 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644798514 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_002949686 |
Protein GI | 239827062 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000043119 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATACAT TTCGCAAACA TCGATGGCAA TCATTTCTAC AAAACTATGA AAATCGTGCT CGTTTGTTAA TTTCATGTCC GGATAAACCG GGTATCGTCG CGTCTGTAAC ATCGTTTTTG TACGAACAAG GCGCAAATAT TGTCGAATCT AGTCAATATT CCACCGATCC GGAAGGTGGC ACCTTTTTCT TAAGAATCGA ATTTGATTGT CCGAATATCG CGTTGCGAAA GCAAGAAATT GAATCCGCTT TTCAGCCGAT CGCCGAGTCG TTTCATATGG ATTGGCGCCT TCGCTTACAT AATGATGTCA AACGAATTGC TATATTTGTT TCCAAGGCAG AACATTGTTT ACTAGAATTA TTATGGCAGT GGCAAGCGGG TGAATTAATT GCCGATATTG CGCTTGTGAT AAGCAATCAT GAACATCTCA GAAGCACAGT AGAATCGGTT GGCATTCCAT ATTTCCATAT TCCTGTTACG AAAGAAACGA AAGCGGAAGC CGAGCAAAAG CAAATCGAGC TGCTAAAAAA ATATGAAGTG GATACGATTG TGCTCGCCCG TTATATGCAA ATTTTATCCC CAGCATTTGT CGCTGAATTT CCGGGAAGAA TTATTAACAT TCATCATTCC TTTTTACCAG CGTTCATCGG AGCAAGACCA TACGAACGGG CGTATGAGCG AGGCGTGAAA CTAATTGGCG CAACTTCGCA TTACGTAACC GATGATTTAG ATGAAGGTCC GATCATCGAA CAAGACGTTG CCCGTGTTGA CCATCGCCAC CATCCCGATG ATTTAAAACG AATGGGAAGA ATTATTGAAA AAACCGTGCT TGCCAGAGCC TTAAAATGGC ATTTGGAAGA TCGTGTCATC ATTCATGGAA ACAAAACGAT TGTCTTTTAT TAA
|
Protein sequence | MHTFRKHRWQ SFLQNYENRA RLLISCPDKP GIVASVTSFL YEQGANIVES SQYSTDPEGG TFFLRIEFDC PNIALRKQEI ESAFQPIAES FHMDWRLRLH NDVKRIAIFV SKAEHCLLEL LWQWQAGELI ADIALVISNH EHLRSTVESV GIPYFHIPVT KETKAEAEQK QIELLKKYEV DTIVLARYMQ ILSPAFVAEF PGRIINIHHS FLPAFIGARP YERAYERGVK LIGATSHYVT DDLDEGPIIE QDVARVDHRH HPDDLKRMGR IIEKTVLARA LKWHLEDRVI IHGNKTIVFY
|
| |