Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2073 |
Symbol | purU |
ID | 4026535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2342073 |
End bp | 2342939 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637967272 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_574123 |
Protein GI | 92114195 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0345334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGCTC GCATGTCCCA TTCGTATCGT CTTATCGTCT CGTGCCCCGA CCGCGTGGGC ATCGTCTCTC GTGTGTCCAG TTACATCGCC GGCCATGGCG GCTGGATCAC CGAGGCCAAT CAGCATTCGG ATCTGGTTAC CGGACGGTTT TTCATGCGTT ACGAAATCAA GGCCGAATCG CTGGCGCTGG ATATCGAGGC GCTGCGCGAC GATTTCGCCG CCATTGCCGA CGAGTTCTCG ATGGAGTGGG CATTGAGCGA TACCGCGCGG CGCAAGCGGG TCGTGCTGAT GGCATCGCGT GCCTCGCATT GCCTGGTCGA CCTGTTGTAT CGCTGGAATG CGGGTGAACT CGATTGCGAC ATTCCCTGCG TGATTTCTAA TCATGAATCG CTGCGCCCCT TGGTCGAGTG GCACGGCATT CCTTTCTATC ATGTCCCGGT CGAGCCGCAC GACAAGGCGG CAGCGTTCGC GCGTGTCGAA GCACTGGTCG AAGAAGCTCG CGCGGATGCC GTGGTGCTGG CGCGCTACAT GCAGATCCTG CCGCCCAACC TGTGCCAGCG TTACGCGGGG CGCGTGATCA ATATCCACCA TAGCTTTCTG CCTTCCTTCG CTGGTGCCAA GCCGTATCAC CAGGCCTACG AGCGCGGCGT CAAGCTGATC GGCGCCACCT GTCACTATGT CACCGAGGAA CTCGATGCCG GTCCGATCAT CGAGCAGGAC ATCCAGCGCG TGACGCACTG CCATACCGCC GATGACCTGG TGCGTCTGGG GCGCGACGTC GAGAAGGCGG TGCTGGCGCG TGGCCTGCGC TGGCACCTGC AGGACCGGGT GTTGATCCAC GGCAACAAGA CGGTCGTCTT CGCGTGA
|
Protein sequence | MVARMSHSYR LIVSCPDRVG IVSRVSSYIA GHGGWITEAN QHSDLVTGRF FMRYEIKAES LALDIEALRD DFAAIADEFS MEWALSDTAR RKRVVLMASR ASHCLVDLLY RWNAGELDCD IPCVISNHES LRPLVEWHGI PFYHVPVEPH DKAAAFARVE ALVEEARADA VVLARYMQIL PPNLCQRYAG RVINIHHSFL PSFAGAKPYH QAYERGVKLI GATCHYVTEE LDAGPIIEQD IQRVTHCHTA DDLVRLGRDV EKAVLARGLR WHLQDRVLIH GNKTVVFA
|
| |