Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2971 |
Symbol | purU |
ID | 5085174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 3036243 |
End bp | 3037127 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640484542 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_001169162 |
Protein GI | 146279003 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.266607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAGT TCGTCCTCAC CGTCACCTGC CCGACCCGCC GGGGCATCGT CGCCGCCATC TCGAACTTCC TCGCCGACAA TGGCTGCAAC ATCACCGACT CGGCGCAGTT CGACGATCAG GAAACCGGGC GCTTCTTCAT GCGCGTGGGC TTCCAGTCCG AGACCGGCGC CACCCTTGAT GCGCTGAATG ACAGCTTCGC CCGCATCGCC CCGGACTTCG AGATGGGCTG GCAGATCTTC GACAGCTCCC GGAAACTCAA GGTCCTGCTG ATGGTGTCGA ACTTCGGCCA CTGCCTGAAC GACCTGCTCT ACCGCTGGCG GATCGGGGCG CTGCCGATCG AGATCGTGGG CGTCGTCTCG AACCACCTGA CCTACCAGAA GCTCGTGGTG AACCACGACA TTCCCTTCCA CCTCATCCGG GTCACGAAGG AGAACAAGCC CGACGCCGAG GCGCGCCTGC TTGCGCTGGT GGAAGAGACG GGCGCCGAAC TGGTGGTCCT CGCCCGCTAC ATGCAGGTCC TGTCGGATTC CTTCTGCGAA CGGATGTCGG GCCGGATCAT CAACATCCAC CATTCCTTCC TGCCCTCCTT CAAGGGCGCG AACCCCTACA AGCAGGCCTA CCAGCGCGGC GTGAAGCTGA TCGGGGCCAC CGCCCACTAT GTCACCGCCG ACCTCGACGA AGGCCCGATC ATCGAGCAGG ATACGGTCCG CATCACCCAC GCCCAGAGCC CGGACGATTA TGTGAGCCTC GGCCGCGACG TGGAAGCCTC GGTGCTCGCC CGCGCGATCC ACGCCCACAT CCACCACCGG GTCTTCCTGA ACGGCAACAA GACGGTGGTC TTTCCCGCCT CGCCCGGCGC CCACGCCTCC GAACGCATGG GCTGA
|
Protein sequence | MSKFVLTVTC PTRRGIVAAI SNFLADNGCN ITDSAQFDDQ ETGRFFMRVG FQSETGATLD ALNDSFARIA PDFEMGWQIF DSSRKLKVLL MVSNFGHCLN DLLYRWRIGA LPIEIVGVVS NHLTYQKLVV NHDIPFHLIR VTKENKPDAE ARLLALVEET GAELVVLARY MQVLSDSFCE RMSGRIINIH HSFLPSFKGA NPYKQAYQRG VKLIGATAHY VTADLDEGPI IEQDTVRITH AQSPDDYVSL GRDVEASVLA RAIHAHIHHR VFLNGNKTVV FPASPGAHAS ERMG
|
| |