Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1789 |
Symbol | purU |
ID | 6375476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1934979 |
End bp | 1935908 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642684282 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_001960188 |
Protein GI | 189500718 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00390951 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTATCTG TAGCAAACCC GATCAGCAAG GCGATTTTCG ATGATAACAT TTTAACCCGA AGCGATGTGA CGCATACTTC TCAAAAAGCA GTTCTCCTGC TCTCCTGTCC GGACCGCATC GGGCTCGTTT CACGGATCTC GAATTTTATC TTCGAACGAA GAGGAAATAT TCTCGATCTC GATGAACATG TGGATATTGC ATCAGGCATG TTTTTTATCA GGGTGTCCTG GAGCAGGGAT GATGTATCCA TAACGACGGC TGATCTTCAA GGTGCATTCA GTCCGCTCGC CCTGGAGCTG GGGGCTGACT GGAAAATTTA TGTGATTCCT GAAAAACCGC GCGTGGCTGT GTTTGTCTCC AGGTATGATC ACTGTCTGCA GGATCTGTTA TGGCGATACA AGACCGGGGA ATTTGCTATG GAAATCCCCT TGATTATATC CAATCACCGG GATCTGGAGG ATCTTGCCGC ACAGTATTCC ATCCCTTTTC ATGTGTTCCC GAAAACTCGT GAAAACAAGC TGGAGCAGGA AACGAAGGAA CTTGAATTGC TCAAGGAAAA CCGTGTCGAC ACGATTGTTC TTGCCCGGTA TATGCAGGTT CTTTCTCAAC GGTTTGTCGA TGCGTATCCT GACAGGATCA TCAACATCCA TCACTCGTTT CTTCCTGCCT TTTCAGGCGG CAGTCCTTAT AAACAGGCCT TTGAAAGGGG GGTCAAAATA ATCGGCGCTA CCAGTCACTA TGTGACCGGA GAACTCGATG AAGGTCCGAT AATCGAGCAG GATATCATCA GAATCACGCA CAAGGACACT CTCGGCGATC TTATACGAAA AGGTCGGGAC CTCGAGCGTC TGGTTCTTTC AAGGGCGATC AGTTCGCATG TAGACCACCG GGTTCTGGTA AACGGCCGTA AAACCATTAT TTTTACCTGA
|
Protein sequence | MLSVANPISK AIFDDNILTR SDVTHTSQKA VLLLSCPDRI GLVSRISNFI FERRGNILDL DEHVDIASGM FFIRVSWSRD DVSITTADLQ GAFSPLALEL GADWKIYVIP EKPRVAVFVS RYDHCLQDLL WRYKTGEFAM EIPLIISNHR DLEDLAAQYS IPFHVFPKTR ENKLEQETKE LELLKENRVD TIVLARYMQV LSQRFVDAYP DRIINIHHSF LPAFSGGSPY KQAFERGVKI IGATSHYVTG ELDEGPIIEQ DIIRITHKDT LGDLIRKGRD LERLVLSRAI SSHVDHRVLV NGRKTIIFT
|
| |