Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2603 |
Symbol | purU |
ID | 4897699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2742471 |
End bp | 2743355 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640113203 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_001044477 |
Protein GI | 126463363 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.426643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAGT TCGTCCTCAC CGTCACCTGC CCGACCCGCC GGGGCATCGT CGCCGCCATC TCGACCTTCC TCGCCGACCA TGGCTGCAAC ATCACCGACT CGGCCCAGTT CGACGATCAG GAAACCGGGC GCTTCTTCAT GCGCGTGGGC TTCCAGTCCG AGACCGGCGC CACGCTCGAC GGGCTCACCG CGGATTTCGC CGCCGTGGGC GAGACGCTCG AGGCCAACTG GCAGATCTTC GACAGCGCCA GCAAGATCAA GGTCCTGCTG ATGGTGTCGA ACTTCGGCCA CTGCCTCAAC GATCTGCTCT ACCGCTGGCG CATCGGCGCC CTGCCGATCG AGATCGTGGG CGTCGTCTCG AACCACCTGA CCTACCAGAA GGTCGTCGTG AACCACGACA TCCCCTTCCA TCTCATCAAG GTCACCAAGG ACAACAAGCC CGAGGCCGAG GCCCGCCTCA TGGCGCTGGT CGACGAGACC GGGGCCGAGC TCGTCGTGCT CGCCCGCTAC ATGCAGGTCC TGTCGGATGC CTTCTGCGCC CGCATGTCGG GCCGGATCAT CAACATCCAC CATTCGTTCC TGCCCTCCTT CAAGGGCGCG AACCCCTACA AGCAGGCCTA TCAGCGCGGC GTGAAGCTGA TCGGCGCCAC GGCGCACTAT GTCACCGCCG ATCTCGACGA GGGCCCGATC ATCGAGCAGG ACACGGTGCG CATCACCCAC GCCCAGAGCC CGGACGATTA TGTGAGCCTC GGCCGCGACG TCGAGGCCTC GGTCCTCTCC CGCGCGATCC ACGCCCATAT CCACCACCGG GTCTTCCTCA ACGGCAACAA GACCGTGGTC TTCCCGGCGT CCCCCGGCGC CCACGCCTCC GAACGGATGG GCTGA
|
Protein sequence | MPKFVLTVTC PTRRGIVAAI STFLADHGCN ITDSAQFDDQ ETGRFFMRVG FQSETGATLD GLTADFAAVG ETLEANWQIF DSASKIKVLL MVSNFGHCLN DLLYRWRIGA LPIEIVGVVS NHLTYQKVVV NHDIPFHLIK VTKDNKPEAE ARLMALVDET GAELVVLARY MQVLSDAFCA RMSGRIINIH HSFLPSFKGA NPYKQAYQRG VKLIGATAHY VTADLDEGPI IEQDTVRITH AQSPDDYVSL GRDVEASVLS RAIHAHIHHR VFLNGNKTVV FPASPGAHAS ERMG
|
| |