Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1791 |
Symbol | |
ID | 4571153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2042800 |
End bp | 2043804 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766374 |
Product | chlorophyll synthesis pathway, BchC |
Protein accession | YP_912232 |
Protein GI | 119357588 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.916125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCAA AAGCCATCGT TTTCAGCGGC TTGAGGGAGA TTATCTTGCG GGAGGTCACC CTCAAACCGC TCTCTTCAAC TGACATCCTT GTTGAAACCT ACTGGTCGTC GATCAGTACC GGTACAGAAA AAATGGCCTT TAACGGCCTG ATACCCTCTC CGCCCTTCAT TTTCCCTTTT ATACCAGGGT ATGAAACCGT TGGCAAAATT ATTGATGCGG GCGAACATGT CAACCAGAGC CTTATCGGCA AGTTTGCCTA TGTTGCCGGA TCGTTCGGCT ATGAGGATGT GAATGCGGCT TTCGGCGGAG CTTCACAGTT CATCGTCTGC CCTGTTGACA GCATTACCGT TCTTGATACG CTTGATAACC CGCAATGCGG CATAGCCCTG CCACTTGGTG CTACGGCACT TCACATTGTG GATCTTGCCA ATGTTCAAAA CAAAAAAGTA TTGGTGCTCG GTCAGGGTGC AGTCGGCATT CTTGCCGCTG AACTTGCCGG ATTGATGGGC GCGCAACTCG TTGCAGCTAC AGAACCGCAT CAGAATCGCC TTAACCTCTC TTCAGCCGAC CTTAAAGTAA ATCCGGAAAC GCAGGATGCT TCGGCAGTTC TTGCCGGTCA CGAGTTCGAT GTGCTCATCG ACAGCACCGG AATCATGAGC GCCATTGATA CAGGTCTTCG CTTTCTCAAG TTTCACGGGG TTGTTATTTT CGGCGGCTAC TATCAGCGAG TGAACATCGA CTACTCGCAA GCTTTTCAAA AAGAGCTTTC CTTTATTGCT GCAAGGCAGT GGGCACATGG CGATCTTGAC CGTGTCAAGG ATCTCATAGG CCGGCACAAG CTTAATGCTG AAAAAATCTT CACCCACCAG AGCCAGGTAG ACGAAAACAT AACATCGGTC TACATGCAGG CATTCAGCGA TCCTGACTGC CTGAAGATGA TTCTTCACTG GAAAACAGAC AAAGAAGAGC AGGATACCGC CTGCTATATT GCAAGCACTT CATAA
|
Protein sequence | MKSKAIVFSG LREIILREVT LKPLSSTDIL VETYWSSIST GTEKMAFNGL IPSPPFIFPF IPGYETVGKI IDAGEHVNQS LIGKFAYVAG SFGYEDVNAA FGGASQFIVC PVDSITVLDT LDNPQCGIAL PLGATALHIV DLANVQNKKV LVLGQGAVGI LAAELAGLMG AQLVAATEPH QNRLNLSSAD LKVNPETQDA SAVLAGHEFD VLIDSTGIMS AIDTGLRFLK FHGVVIFGGY YQRVNIDYSQ AFQKELSFIA ARQWAHGDLD RVKDLIGRHK LNAEKIFTHQ SQVDENITSV YMQAFSDPDC LKMILHWKTD KEEQDTACYI ASTS
|
| |