Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2111 |
Symbol | purU |
ID | 5713107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2239132 |
End bp | 2240037 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641268034 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_001533449 |
Protein GI | 159044655 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.229424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGGCCT TGCTGTCAAC CTCCCGGAAT GCTGATGCCG GGACCCCGCC CGCGGCGGAG CCGCATTTCG TCCTGAACCT GTCCTGCGCC GCAGAGCCGG GGATCGTGGC CGCCGTCACC ACGGCGCTGG CCGGGCAGGG GGCGAACCTG GTGGAAACCG CCCAGTTCTG GGACCGTCAG AGCGACCGGT TCTTCCTGCG CGTGGCTTTC CTGGGCCAGC CGGGGACCGA TGTCGCCGGG ATCGAGGCCG CCCTCGCGCC GACCCGCGCG CGCTTCGGCA TGGATGTGAC GGTCCTGGAC TCGGCCCGCA AGCCGCGGAT CCTGATCATG GTGTCGCGCT TCGATCACGC GCTGCTGCAC CTGCTCTACC AGGTGCGGGT CGGCTGGCTC TCGGCCGAGG TGGTCGCGAT CGTGTCGAAC CACCCCGATG CCCGGCGCGT GGCCGAGCAT GAAGGCGTTC CGTTCCACCA TATCCCCGTC AGCCGCGACA CCAAGCCCGA GGCCGAGGCC CGCCTGAAGG CCCTGGTCGC CGAAACCGGC GCCGACCTGG TCGTGCTGGC CCGCTACATG CAGGTGTTGT CCGATGACTT CTCCCGGGTG CTGGCTGGCC GGGTCATCAA CATCCACCAT TCGTTCCTGC CCAGTTTCAA GGGGGCCAAA CCCTATCACC AGGCCCATGA GCGCGGGGTC AAGCTGATCG GCGCCACGGC CCATTACGTG ACAGCCGATC TCGACGAGGG ACCGATCATA GAACAAGAGG CCGAGCGCAT CACCCACTCG ATGACACCCG ACGATCTGGT GGCCGTGGGG CGCGATATCG AATCCCGTGT GCTGGCCCGC GCCGTCAAAC GCCACCTGGA GGGGCGGGTG ATGCTCAACG GGCAGAGAAC GGTCGTCTTC ACCTGA
|
Protein sequence | MEALLSTSRN ADAGTPPAAE PHFVLNLSCA AEPGIVAAVT TALAGQGANL VETAQFWDRQ SDRFFLRVAF LGQPGTDVAG IEAALAPTRA RFGMDVTVLD SARKPRILIM VSRFDHALLH LLYQVRVGWL SAEVVAIVSN HPDARRVAEH EGVPFHHIPV SRDTKPEAEA RLKALVAETG ADLVVLARYM QVLSDDFSRV LAGRVINIHH SFLPSFKGAK PYHQAHERGV KLIGATAHYV TADLDEGPII EQEAERITHS MTPDDLVAVG RDIESRVLAR AVKRHLEGRV MLNGQRTVVF T
|
| |