Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0401 |
Symbol | |
ID | 3747779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 465941 |
End bp | 466939 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637772929 |
Product | 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide |
Protein accession | YP_378717 |
Protein GI | 78188379 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.632471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCAA TGAAAGCCAA AGCAATTGTA TTTAGCGGCG TTCGGCAAAT TGAACTTGCT GATGTAAAGC TTAAACCGCT CTCATCCACC GATGTGTTGG TTGAAACATG GTGGTCATCT ATTAGCACGG GCACTGAAAA AATGGCATGG AATGGTTTAA TTCCATCACC CCCATTTATC TTTCCTTTTA TTCCGGGCTA TGAAACCGTT GGCAAAATTA TTGCCGTTGG CGCTCATGTA AACGATAATT TGATTGGACG CTTTGCCTAT GTTGCAGGCT CGTTTGGCTA CGAAGGGGTA AATGCTGCAT TTGGCGGCGC ATCGGAATTT ATTGCCTGCC CTGTGGATAG CTTAACCGTG CTTGATAACA TTGAGCATCC TGAAGCAGGC ATTGCTCTAC CGCTTGGCGC TACGGCACTA CATATTGTGG ATTTAGCTCA TGTGGAAGCC AAAAAAGTGT TGGTGCTTGG GCAAGGTGCC GTCGGTATTC TTGCGGCGGA ACTTGCCAAA CTGATGGGCG CAAAACTTGT TGCTGTTACC GAACCAAATT GTAACCGCTT AAAACTTTCG GCTGCCGACC TGAAAGTTAA CCCCGATCGT CAAGATGTTT CGGCGGCGCT TGCGGGGCAT GAATTTGATG TGTTGATTGA TAGTACCGGT ATTATGAGCG CAATTGATAC AGGCTTACGG TTCTTGAAAT TCCAAGGCAC GGTAATTTTT GGTGGCTACT ACCAACGCAT CAACATTGAT TATTCTCAAG CCTTCCAAAA AGAGTTGTCG TTTATTGCCG CTAAACAGTG GGCAAAAGGC GATCTTGAAC GGGTGCGTGA GCTGATTGCA TCGCATAAGC TTAATGCCGA ACGGATTTTT ACCCACCACC ATACGGTTGG CAGTGGCAAC ATCACCGATG CTTATCAGCA AGCCTTTACC GATCAAGATT GCTTAAAAAT GGTGCTGCAT TGGAAACAAG CCAACGAAGA GCCAACCACA AGCAACTAA
|
Protein sequence | MKSMKAKAIV FSGVRQIELA DVKLKPLSST DVLVETWWSS ISTGTEKMAW NGLIPSPPFI FPFIPGYETV GKIIAVGAHV NDNLIGRFAY VAGSFGYEGV NAAFGGASEF IACPVDSLTV LDNIEHPEAG IALPLGATAL HIVDLAHVEA KKVLVLGQGA VGILAAELAK LMGAKLVAVT EPNCNRLKLS AADLKVNPDR QDVSAALAGH EFDVLIDSTG IMSAIDTGLR FLKFQGTVIF GGYYQRINID YSQAFQKELS FIAAKQWAKG DLERVRELIA SHKLNAERIF THHHTVGSGN ITDAYQQAFT DQDCLKMVLH WKQANEEPTT SN
|
| |