Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1628 |
Symbol | purU |
ID | 6980364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1656737 |
End bp | 1657633 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643396353 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_002281144 |
Protein GI | 209549227 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0459247 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.542854 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA CGGCTCCTCG CTACGCTCTT CGTGTGGCAT GCCCGTCAAT CCGCGGCGTG ACGGCCGCCA TCGCCACCTA TCTGTCGCAG AGCGGTTGCA ACATCTCGGA CAGTGCGCAA TTCGATGACG CGGACACCGG AAGATATTTC ATGCGGATCA GCTTCCAGCC GCAGGACGGA CATACGCTTG GGCAGCTTCG TGACGGCTTC GAGCCGATCG CCGACAGGTT CGAGGCGAAC GCCGAATTCT TCGACGAAGC CGAAAAGAAG AAGGTGATCC TCATGGTCAG CCGCTTCGGA CACTGCCTGA ACGATCTTCT CTACCGCTGG CGGATCGGGG CGCTTCCGAT CGACATCGTC GGCGTGATCT CCAACCACAT GGATTACCAG CGGATCGTCG TGAACCACGA CATCCCGTTC CACTGCATCA AGGTCACTAG GGAAAATAAG CCGGAGGCCG AGGCGAAACA GATGCAGATC GTCGAGGGCT CGGGAGCCGA ACTCGTCGTA CTGGCTCGAT ATATGCAGGT CCTGTCCGAC GAAATGTGCC GCAAGATGTC CGGCAGGATC ATCAATATTC ACCACTCGTT TCTGCCGAGC TTCAAGGGCG CCAATCCTTA CAAGCAGGCG TTCGAACGGG GCGTGAAGCT CATCGGCGCG ACGTCGCACT ACGTGACGGC AGACCTCGAC GAAGGTCCGA TCATCGAGCA GGATATCGTC CGCGTCACGC ATGCGCAGAG CGGCGAGGAC TATGTGAGCC TCGGCCGCGA CGTCGAAAGC CAGGTACTTG CCCGAGCCAT CCACGCCCAC ATCCATGGCC GTGTGTTCAT CAACGGCAAC AAGACAGTTG TATTCCCCGC TTCACCGGGC TCCTACGCAT CGGAGCGCAT GGGCTGA
|
Protein sequence | MTTTAPRYAL RVACPSIRGV TAAIATYLSQ SGCNISDSAQ FDDADTGRYF MRISFQPQDG HTLGQLRDGF EPIADRFEAN AEFFDEAEKK KVILMVSRFG HCLNDLLYRW RIGALPIDIV GVISNHMDYQ RIVVNHDIPF HCIKVTRENK PEAEAKQMQI VEGSGAELVV LARYMQVLSD EMCRKMSGRI INIHHSFLPS FKGANPYKQA FERGVKLIGA TSHYVTADLD EGPIIEQDIV RVTHAQSGED YVSLGRDVES QVLARAIHAH IHGRVFINGN KTVVFPASPG SYASERMG
|
| |