Gene Cag_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0401 
Symbol 
ID3747779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp465941 
End bp466939 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content46% 
IMG OID637772929 
Product2-desacetyl-2-hydroxyethyl bacteriochlorophyllide 
Protein accessionYP_378717 
Protein GI78188379 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.632471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAA TGAAAGCCAA AGCAATTGTA TTTAGCGGCG TTCGGCAAAT TGAACTTGCT 
GATGTAAAGC TTAAACCGCT CTCATCCACC GATGTGTTGG TTGAAACATG GTGGTCATCT
ATTAGCACGG GCACTGAAAA AATGGCATGG AATGGTTTAA TTCCATCACC CCCATTTATC
TTTCCTTTTA TTCCGGGCTA TGAAACCGTT GGCAAAATTA TTGCCGTTGG CGCTCATGTA
AACGATAATT TGATTGGACG CTTTGCCTAT GTTGCAGGCT CGTTTGGCTA CGAAGGGGTA
AATGCTGCAT TTGGCGGCGC ATCGGAATTT ATTGCCTGCC CTGTGGATAG CTTAACCGTG
CTTGATAACA TTGAGCATCC TGAAGCAGGC ATTGCTCTAC CGCTTGGCGC TACGGCACTA
CATATTGTGG ATTTAGCTCA TGTGGAAGCC AAAAAAGTGT TGGTGCTTGG GCAAGGTGCC
GTCGGTATTC TTGCGGCGGA ACTTGCCAAA CTGATGGGCG CAAAACTTGT TGCTGTTACC
GAACCAAATT GTAACCGCTT AAAACTTTCG GCTGCCGACC TGAAAGTTAA CCCCGATCGT
CAAGATGTTT CGGCGGCGCT TGCGGGGCAT GAATTTGATG TGTTGATTGA TAGTACCGGT
ATTATGAGCG CAATTGATAC AGGCTTACGG TTCTTGAAAT TCCAAGGCAC GGTAATTTTT
GGTGGCTACT ACCAACGCAT CAACATTGAT TATTCTCAAG CCTTCCAAAA AGAGTTGTCG
TTTATTGCCG CTAAACAGTG GGCAAAAGGC GATCTTGAAC GGGTGCGTGA GCTGATTGCA
TCGCATAAGC TTAATGCCGA ACGGATTTTT ACCCACCACC ATACGGTTGG CAGTGGCAAC
ATCACCGATG CTTATCAGCA AGCCTTTACC GATCAAGATT GCTTAAAAAT GGTGCTGCAT
TGGAAACAAG CCAACGAAGA GCCAACCACA AGCAACTAA
 
Protein sequence
MKSMKAKAIV FSGVRQIELA DVKLKPLSST DVLVETWWSS ISTGTEKMAW NGLIPSPPFI 
FPFIPGYETV GKIIAVGAHV NDNLIGRFAY VAGSFGYEGV NAAFGGASEF IACPVDSLTV
LDNIEHPEAG IALPLGATAL HIVDLAHVEA KKVLVLGQGA VGILAAELAK LMGAKLVAVT
EPNCNRLKLS AADLKVNPDR QDVSAALAGH EFDVLIDSTG IMSAIDTGLR FLKFQGTVIF
GGYYQRINID YSQAFQKELS FIAAKQWAKG DLERVRELIA SHKLNAERIF THHHTVGSGN
ITDAYQQAFT DQDCLKMVLH WKQANEEPTT SN