Gene Cag_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1642 
Symbol 
ID3747949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2140412 
End bp2142529 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content48% 
IMG OID637774180 
Productshort chain dehydrogenase 
Protein accessionYP_379937 
Protein GI78189599 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAACC TTTGGAACGA TGCAGAATTA CAGGGCTTTG TGAGCAATGT GTGCCACGAG 
CCTGATGACC ATCCTGAGCT TGCAGCATTG GTGTATGCTT CGCGGTTGCT TGGGCGTGAA
CGTTCATTAG TAATGCACGG TGGTGGCAAT ACCTCCGTTA AGTGTGGGCT TACCGATATG
GTTGGCAACC ATGCGGAAGT GTTGCTTATT AAAGCAAGTG GTATTGATTT AAGCAATGTT
ACCTGTCGCG ATTATACCCC GCTCCGCTTA GGTCCACTGA GTAAACTTGT GGAGCTGTGC
AGCAGTAACG ATCCTATTCA TGCTGAGCGT GTTGAACGTT TTTCAACGAA AGAGTTTAAG
CATCTTTTAA TGCTGAATAT GTTCAGCCTT ACTGACCATA TGGCTGAAAA ACGTTTAACT
CCATCTATTG AAACGCTGCT CCATGCCTTT CTGCCTCATC GCTACATTTT ACATACCCAC
TCATTTGCCT TGCTTACCAT GAGCAATCAG CCAAACGGTG AAGCGCTCTG CCGCGAAACC
TTAGGCGAGG CATTTGGCTC GGTGCCTTAC ATTAAACCCG GTCTTGGTTT GGCGCGTGCC
GCCGCTGGTG TGTATGAAGC ACATCCCGCA ATTGAGGGGC TTGTGTTGCA AAAGCATGGC
TTAGTAACTT TTGGTGAAAC GGCTCAAGAG GCGTACAACC GCATGATTGA CGCTGTTACA
AAACTTGAAG AGCGCATTGC TTTGGCTGGT CGTAAACCAT TTACAACGGT TCCTTTGCCC
GAAGAAATTG CAAAAGTGGA AGATGTTGCT CCCATTATTC GTGGAGCTTG TGCTGAGGAA
AAAGAGGTTG GTCGTCGCGA TTATCAACAT TTGATTCTTG ATTTTCGTAC CTCCGATGAA
ATTCTCACTT ACGTCAATAG TGCCGATGTT GTGCGCATGA GTCAAAAAGG CTCCATGACG
CCCGATTTTA TTATTCGTAC AAAAAATAAG CCACTTGTGG TGCCTGCACC TGACGCAGCG
GATCTTAACG GATTTAAAGC TGCTGTTGAT GAAGCGGTGC AGCGCTATCG TGATGCCTAT
ATTGCCTACT TTAATGCGCA ACAGCAAGCT TCAGGTATGG AGGTTACCAT GCTTGATCCT
ATGCCGCGTG TGGTGTTGGT GCCAGGGCTT GGGCTTTTTG GTTTAGGAAA AAGTGCCGCG
GCAGCCGCAG TGAATGCTGA TATTGCTACC TGTACTGCCA CTGCTATTCT TGATGCTGAA
TCGGTGGGTT CTTTTGAATC CATTAGCGAG CGTGAAGCTT TTGATATTGA GTATTGGGAT
ATGGAGCAGG CAAAAATCAA TAAGGTGTAT CACGGGACGT TTGCGGGTAA AGTGGTGATG
GTAACGGGAG GAGCAAGTGG CATTGGGCTT GCTACAGCCA AAGCATTTCG TCAGCGTGGT
GCTGAGTTAG TGGTGTTAGA TCTTTCTCAA GAAGCGCTTG ATAAAGCGGC TGAAGAGATT
GGCGGTAATC CCTTAACGCT TACCTGCAAT GTTACCTCAC GTGCTGATAT TCGTGCGGCG
TATGATGCGG TTTGCAAGCG TTATGGCGGT GTTGATGTAA TTGTCTCCAA CGTTGGTGCG
GCTATTCAAG GGCGCATTGG CGATGTGTCG GATGAGTTGT TGCGCAAGAG TTTTGAAATT
AACTTTTTCT CCCACCACTA CATTGCTCAA GAAGCGGTAC GTGTGATGCG TTTGCAAGGC
ACGGGCGGTG TGTTGCTTTT TAATGTTTCA AAGCAAGCGG TTAATCCAGG TCCCGATTTT
GGACCTTACG GTTTACCAAA AGCTGCCACC ATGTTTCTTG TGCGCCAATA TGCACTTGAC
CACGGTCGTG ATGGCATTCG TGCAAACGGC ATTAATGCCG ACCGCATTCG CACCGGACTT
TTGACTGAAG AGATGATTAA ATCGCGCTCG GCGGCGCGTG GTTTAAGCGA GCACGAATAT
ATGGCTGGTA ATTTGTTGCA ACTTGAGGTA TATGCTGAAG ATGTGGCTGA AGCCTTTGTG
CATTTAGCCC AAGAAATTCG CACCAACGCC GCAATCATTA CCGTTGATGG TGGCAACATT
GCTGCTACGT TGCGGTAG
 
Protein sequence
MQNLWNDAEL QGFVSNVCHE PDDHPELAAL VYASRLLGRE RSLVMHGGGN TSVKCGLTDM 
VGNHAEVLLI KASGIDLSNV TCRDYTPLRL GPLSKLVELC SSNDPIHAER VERFSTKEFK
HLLMLNMFSL TDHMAEKRLT PSIETLLHAF LPHRYILHTH SFALLTMSNQ PNGEALCRET
LGEAFGSVPY IKPGLGLARA AAGVYEAHPA IEGLVLQKHG LVTFGETAQE AYNRMIDAVT
KLEERIALAG RKPFTTVPLP EEIAKVEDVA PIIRGACAEE KEVGRRDYQH LILDFRTSDE
ILTYVNSADV VRMSQKGSMT PDFIIRTKNK PLVVPAPDAA DLNGFKAAVD EAVQRYRDAY
IAYFNAQQQA SGMEVTMLDP MPRVVLVPGL GLFGLGKSAA AAAVNADIAT CTATAILDAE
SVGSFESISE REAFDIEYWD MEQAKINKVY HGTFAGKVVM VTGGASGIGL ATAKAFRQRG
AELVVLDLSQ EALDKAAEEI GGNPLTLTCN VTSRADIRAA YDAVCKRYGG VDVIVSNVGA
AIQGRIGDVS DELLRKSFEI NFFSHHYIAQ EAVRVMRLQG TGGVLLFNVS KQAVNPGPDF
GPYGLPKAAT MFLVRQYALD HGRDGIRANG INADRIRTGL LTEEMIKSRS AARGLSEHEY
MAGNLLQLEV YAEDVAEAFV HLAQEIRTNA AIITVDGGNI AATLR