Gene Cag_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1964 
Symbol 
ID3747826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2491503 
End bp2494361 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content50% 
IMG OID637774500 
Productmolybdenum enzyme related to thiosulfate reductase and polysulfide reductase, large subunit 
Protein accessionYP_380255 
Protein GI78189917 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.664709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTACA CGCATAAACC AACCGTTATT GAAGCCATCG CTGAAAAGCT GCACCTTATT 
CCCAATCTTC ATGAGGGAGA TGGTGCAGCG CCGCTGCCTC GTATAGCGGC AGAGGGAAGC
GAAGTAAGTT GCCCACCTCC TGACCAATGG GATAATTGGG TGGAGTACGA TGCTAAGAGC
TGGCCTGAGC GTAAAAGCAA TGAGTATATG CTGATTCCAA CAGCCTGTTT TAATTGCGAA
GCGGCGTGTG GCTTGCTTGC TTATGTGGAT AAGCAAACCA TGCAAGTTCG TAAGCTTGTT
GGCAATCCAT ACCATCCCGC AAGTCGTGGA CGTAATTGCG CGAAAGGTCC AGCAACGCTC
AATCAGCTTG AGGATACCGA TCGCATTCTT TATCCCATGA AGCGGGTTGG TAAGCGCGGC
GAAGGTAAGT GGGAGCGCGT GAGTTGGGAT AGTGTGCTTG ACGATATTGC GGCTCGAATG
CGTAAAGCGT TGCTTGAAGG GCGCAATAAT GAAATTTCGT ACCACGTTGG TCGTCCCGGT
CATACGGGAT TTATGGATTG GGTGCTGCCT GCATGGAATG TTGATGGGCA CAATAGCCAC
ACCAATGTTT GCTCGTCGGG CGCTCGGTTT GGCTATGCTA TTTGGGAAGG GTACGATCGC
CCTTCGCCCG ATTACACCAA TACCAAGTTT ATGCTGTTGG TGAGTGCGCA TCTTGAATCG
GGGCACTATT TTAACCCCCA TGCTCAGCGC ATTATGGAGG CAAAAATGAA AGGTGCCAAG
CTTGCCGTGC TTGATCCTCG CCTTTCAAAC ACTGCCAGTA TGGCAGATTA TTGGTTGACC
ACTTTCCCTG GTAGTGAAGC GGCTGTGCTG CTTGCTTTAG CGAAAGTGCT GATTGATGAA
GGACTTTACA ATCGCGACTA TCTTGAAAAT TGGGTGAATT GGCAGGAGTA CCTTCAGAAG
GAATATCCAA AAGAGCCGGT AACCTTTGAG CGCTTTATTG AGGCGTTAAA GGCTGAGTAT
CGCGACTACA CGCCCGAGTT TGCTGAGCAA GAGAGTGGTG TAAAGGCTGC TACCATTGTG
GAGATTGCCC GTAACATTGG CGAGGCTGGA ACCCAATTTG CAACCCACGT GTGGCGTAGT
GCTTGTGCAG GCAACCTTGG TGGATGGGCA GTATCGCGCA CGTTGCACTT CTTGAACGTC
CTTACGGGTA GCGTAGGAAC GCCCGGTGGT ACGTCGCCAA GTTCATGGAA TAAATTTCAC
GTGCATGTTC ATGCAGAGCC AAAACCGCAA ACGTTTTGGA ATCCTTTGCA CATGCCTAAT
GAATATCCAT TAGCCCACTT TGAAGTCAGC ATGTTGCTTC CTCACTTCCT TAAAGAGGGA
CGTGGCAAGC TCGATGTTTA CTTCACTCGT GTGTTTAACC CCGTGTGGAC TTATCCCGAT
GGTTTTTCAT GGATTGAAGC GCTTGAAGAT GAATCAAAAA TTGGGCTTCA TGCGGCGCTT
ACGCCAACAT GGAGCGAAAC TGCTTACTTT GCCGATTATG TGCTGCCAAT GGGGCACTCA
ACCGAGCGCC ACGATATTAT TAGCTACGAA ACCCATGCCG CGCAGTGGAT TGGTTTTCGC
CAACCCATAT TGCGCGTTGC AAGCGAAAAA ATGGGTAAGC CGGTTACGTT CACTTACGAA
GCGAATCCGG GTGAAGTGTG GGAAGAGGAT GAATTTTGGA TTGAACTGAG CTGGCGCATT
GATCCTGATG GATCGCTTGG TATTCGCCAG CACTTTATGT CGCCCTATCG TCCGGGTGAA
AAAATCACCA TTAGCGAATA TTACCGTTAC ATTTTTGAGC ACACTCCAGG ATTGCCCGAA
AAAGCCGCTG AAGAGGGCTT AACGGCATTG GAGTACATGC AGAAGTATGG CGCGTTTGAA
GTTGAAACCA ACATCTATAA CTACAACGAA CGTCCACTCT CTCCTGACGA TCGTCAAGGG
GCAACGGTTG ATGCCGAAAG CGGTCTTATT TCCAAAAATG GAAAAGCGGT TGGGGTGAAG
ATAAATGGCA TGGAGTGTGC AGGTTTTCCA ACACCATCGC GCAAGCAAGA GTTTTACTCG
CAAACCATGG TGGATTGGAA GTGGCCTGAG TATCGCAAGC CAACCTATGT TAAAAGCCAT
GTGCATCATG AGCAAATGAA TCTTAGCAAT GGTGAGTTTG CGTTAGTTCC AACCTTCCGC
TTACCCGTGC TGATTCACTC GCGTTCAGGT AATGCAAAGT GGTTGGCTGA AATTGCCCAC
CGCAATCCGC TGTGGATGAA TGCTGCCGAT GCACGAATGC TTGGCTTTGA AGATGGCGAT
TTAGTTCGAG TGAATACCGA TATCGGTTAC TCGGTGAATC GCATTTGGGC AACCGAAGGT
ATTCGCAATG GCGTGGTAGC TTGCTCGCAC CACATTGGAC GCTGGCGCCG TAGCCAAGAC
CCTGAAGCGA ACCGTTGGGC AACCAATCGT GTAGCAATTA AGCGCGAAGG GGGGAGCACA
TGGAGAATGA GAGTTGAGGA AAGCATTGAG CCTTATGAAA GCAGCGATCC CGATTCATCA
CGTATTTTTT GGTCCGATGG TGGCGTCCAC CAAAACATAA CCTTCCCCGT GCACCCCGAT
CCCATTAGTG GAATGCACTG CTGGCACCAA AAAGTGCGCG TTGAAAAAGC GCATGACGGT
GACCAGTATG GTGATATTGT GGTTGATACC AATCGTTCGC ATGAAATTTA CCGAGAGTGG
CTTGCTATGA CGCGCCCAGC TCCCGGACCA AACGGCTTGC GCCGTCCATT GTGGCTTGGT
CGCCCTTATC GCCCCGATGA AAAAACCTAT TATATTTAA
 
Protein sequence
MSYTHKPTVI EAIAEKLHLI PNLHEGDGAA PLPRIAAEGS EVSCPPPDQW DNWVEYDAKS 
WPERKSNEYM LIPTACFNCE AACGLLAYVD KQTMQVRKLV GNPYHPASRG RNCAKGPATL
NQLEDTDRIL YPMKRVGKRG EGKWERVSWD SVLDDIAARM RKALLEGRNN EISYHVGRPG
HTGFMDWVLP AWNVDGHNSH TNVCSSGARF GYAIWEGYDR PSPDYTNTKF MLLVSAHLES
GHYFNPHAQR IMEAKMKGAK LAVLDPRLSN TASMADYWLT TFPGSEAAVL LALAKVLIDE
GLYNRDYLEN WVNWQEYLQK EYPKEPVTFE RFIEALKAEY RDYTPEFAEQ ESGVKAATIV
EIARNIGEAG TQFATHVWRS ACAGNLGGWA VSRTLHFLNV LTGSVGTPGG TSPSSWNKFH
VHVHAEPKPQ TFWNPLHMPN EYPLAHFEVS MLLPHFLKEG RGKLDVYFTR VFNPVWTYPD
GFSWIEALED ESKIGLHAAL TPTWSETAYF ADYVLPMGHS TERHDIISYE THAAQWIGFR
QPILRVASEK MGKPVTFTYE ANPGEVWEED EFWIELSWRI DPDGSLGIRQ HFMSPYRPGE
KITISEYYRY IFEHTPGLPE KAAEEGLTAL EYMQKYGAFE VETNIYNYNE RPLSPDDRQG
ATVDAESGLI SKNGKAVGVK INGMECAGFP TPSRKQEFYS QTMVDWKWPE YRKPTYVKSH
VHHEQMNLSN GEFALVPTFR LPVLIHSRSG NAKWLAEIAH RNPLWMNAAD ARMLGFEDGD
LVRVNTDIGY SVNRIWATEG IRNGVVACSH HIGRWRRSQD PEANRWATNR VAIKREGGST
WRMRVEESIE PYESSDPDSS RIFWSDGGVH QNITFPVHPD PISGMHCWHQ KVRVEKAHDG
DQYGDIVVDT NRSHEIYREW LAMTRPAPGP NGLRRPLWLG RPYRPDEKTY YI