Gene Cphamn1_0763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0763 
Symbol 
ID6374428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp811774 
End bp812724 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content53% 
IMG OID642683270 
Productshort chain dehydrogenase 
Protein accessionYP_001959196 
Protein GI189499726 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.920637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.460734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTG TTAACGCTCT TTGTTTTCAG GGCAGAACAC TGATGACTAT GGCAAAGAAA 
GAACAATGGG ATACCCGGTT GATGCTCGAT CAGAGCGGGA AGGTGGCGAT CGTTACCGGA
GCGACGAGCG GGCTCGGGTA TGAAACTGCC AGAGCTCTTG CGGGAAAGGG AGCCAGGGTG
ATCATTGCTG CGCGCGATAC AGCAAAGGGA GAAAGCGCGA AAGAAAAACT TAAAAAAGAG
TATCCAGAGG CGGATGTTGC GGTTATGAAG CTCGATCTTG CTGATCTTCA GTCAGTGAGG
AAGTTCAGTG ATGATTTCAG CAAACGCTAC TCCCGTCTTG ACCTGCTGAT CAACAACGCG
GGGGTTATGG CTCCTCCCCA CGGAAAAACA GCGGATGGTT TCGAGCTGCA GTTCGGCACC
AACCATCTCG GTCACTTTGC GTTGACAATT CTTCTGCTCG AAATGCTGAA AAAAGTGCCT
GGAAGCAGGG TCGTGACGGT CAGTAGCGGT GCCCATGCGT TCGGGATGCT TGATTTTGAC
GATCTTAACT GGGAAAAGCG AAAGTATAAC AAGTGGCAGG CATATGGAGA CAGTAAGCTT
GCGAATCTGT ATTTTACGAG AGAGCTGCAG CGTCTTCTTG ACCAGGCCGG GGTAAACGTG
TTTTCCGTCG CGGCCCATCC CGGCTGGGCG GCAACGGAAC TCCAGCGATA TCAGGGATGG
CTTGTCTTGC TGAACAGTTT TTTCGCGCAG CCTCCTGGTA TGGGGGCGCT GCCGACGCTC
TACGCGGCGA CAGCGCCCGA TGTGCACGGA GGGGATTTTT TCGGTCCTGA CGGTTTCGGG
GAGATGCGCG GCTATCCGGT AAAAGTACAG TCAAGCAGGC GCTCACGCGA TATGGATGCT
GCCCGCAAGT TATGGGAGGT TTCTGAAAAA ATGACCGGGA TCAGGTGGTA G
 
Protein sequence
MSFVNALCFQ GRTLMTMAKK EQWDTRLMLD QSGKVAIVTG ATSGLGYETA RALAGKGARV 
IIAARDTAKG ESAKEKLKKE YPEADVAVMK LDLADLQSVR KFSDDFSKRY SRLDLLINNA
GVMAPPHGKT ADGFELQFGT NHLGHFALTI LLLEMLKKVP GSRVVTVSSG AHAFGMLDFD
DLNWEKRKYN KWQAYGDSKL ANLYFTRELQ RLLDQAGVNV FSVAAHPGWA ATELQRYQGW
LVLLNSFFAQ PPGMGALPTL YAATAPDVHG GDFFGPDGFG EMRGYPVKVQ SSRRSRDMDA
ARKLWEVSEK MTGIRW