Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1964 |
Symbol | |
ID | 3747826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2491503 |
End bp | 2494361 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637774500 |
Product | molybdenum enzyme related to thiosulfate reductase and polysulfide reductase, large subunit |
Protein accession | YP_380255 |
Protein GI | 78189917 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.664709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTACA CGCATAAACC AACCGTTATT GAAGCCATCG CTGAAAAGCT GCACCTTATT CCCAATCTTC ATGAGGGAGA TGGTGCAGCG CCGCTGCCTC GTATAGCGGC AGAGGGAAGC GAAGTAAGTT GCCCACCTCC TGACCAATGG GATAATTGGG TGGAGTACGA TGCTAAGAGC TGGCCTGAGC GTAAAAGCAA TGAGTATATG CTGATTCCAA CAGCCTGTTT TAATTGCGAA GCGGCGTGTG GCTTGCTTGC TTATGTGGAT AAGCAAACCA TGCAAGTTCG TAAGCTTGTT GGCAATCCAT ACCATCCCGC AAGTCGTGGA CGTAATTGCG CGAAAGGTCC AGCAACGCTC AATCAGCTTG AGGATACCGA TCGCATTCTT TATCCCATGA AGCGGGTTGG TAAGCGCGGC GAAGGTAAGT GGGAGCGCGT GAGTTGGGAT AGTGTGCTTG ACGATATTGC GGCTCGAATG CGTAAAGCGT TGCTTGAAGG GCGCAATAAT GAAATTTCGT ACCACGTTGG TCGTCCCGGT CATACGGGAT TTATGGATTG GGTGCTGCCT GCATGGAATG TTGATGGGCA CAATAGCCAC ACCAATGTTT GCTCGTCGGG CGCTCGGTTT GGCTATGCTA TTTGGGAAGG GTACGATCGC CCTTCGCCCG ATTACACCAA TACCAAGTTT ATGCTGTTGG TGAGTGCGCA TCTTGAATCG GGGCACTATT TTAACCCCCA TGCTCAGCGC ATTATGGAGG CAAAAATGAA AGGTGCCAAG CTTGCCGTGC TTGATCCTCG CCTTTCAAAC ACTGCCAGTA TGGCAGATTA TTGGTTGACC ACTTTCCCTG GTAGTGAAGC GGCTGTGCTG CTTGCTTTAG CGAAAGTGCT GATTGATGAA GGACTTTACA ATCGCGACTA TCTTGAAAAT TGGGTGAATT GGCAGGAGTA CCTTCAGAAG GAATATCCAA AAGAGCCGGT AACCTTTGAG CGCTTTATTG AGGCGTTAAA GGCTGAGTAT CGCGACTACA CGCCCGAGTT TGCTGAGCAA GAGAGTGGTG TAAAGGCTGC TACCATTGTG GAGATTGCCC GTAACATTGG CGAGGCTGGA ACCCAATTTG CAACCCACGT GTGGCGTAGT GCTTGTGCAG GCAACCTTGG TGGATGGGCA GTATCGCGCA CGTTGCACTT CTTGAACGTC CTTACGGGTA GCGTAGGAAC GCCCGGTGGT ACGTCGCCAA GTTCATGGAA TAAATTTCAC GTGCATGTTC ATGCAGAGCC AAAACCGCAA ACGTTTTGGA ATCCTTTGCA CATGCCTAAT GAATATCCAT TAGCCCACTT TGAAGTCAGC ATGTTGCTTC CTCACTTCCT TAAAGAGGGA CGTGGCAAGC TCGATGTTTA CTTCACTCGT GTGTTTAACC CCGTGTGGAC TTATCCCGAT GGTTTTTCAT GGATTGAAGC GCTTGAAGAT GAATCAAAAA TTGGGCTTCA TGCGGCGCTT ACGCCAACAT GGAGCGAAAC TGCTTACTTT GCCGATTATG TGCTGCCAAT GGGGCACTCA ACCGAGCGCC ACGATATTAT TAGCTACGAA ACCCATGCCG CGCAGTGGAT TGGTTTTCGC CAACCCATAT TGCGCGTTGC AAGCGAAAAA ATGGGTAAGC CGGTTACGTT CACTTACGAA GCGAATCCGG GTGAAGTGTG GGAAGAGGAT GAATTTTGGA TTGAACTGAG CTGGCGCATT GATCCTGATG GATCGCTTGG TATTCGCCAG CACTTTATGT CGCCCTATCG TCCGGGTGAA AAAATCACCA TTAGCGAATA TTACCGTTAC ATTTTTGAGC ACACTCCAGG ATTGCCCGAA AAAGCCGCTG AAGAGGGCTT AACGGCATTG GAGTACATGC AGAAGTATGG CGCGTTTGAA GTTGAAACCA ACATCTATAA CTACAACGAA CGTCCACTCT CTCCTGACGA TCGTCAAGGG GCAACGGTTG ATGCCGAAAG CGGTCTTATT TCCAAAAATG GAAAAGCGGT TGGGGTGAAG ATAAATGGCA TGGAGTGTGC AGGTTTTCCA ACACCATCGC GCAAGCAAGA GTTTTACTCG CAAACCATGG TGGATTGGAA GTGGCCTGAG TATCGCAAGC CAACCTATGT TAAAAGCCAT GTGCATCATG AGCAAATGAA TCTTAGCAAT GGTGAGTTTG CGTTAGTTCC AACCTTCCGC TTACCCGTGC TGATTCACTC GCGTTCAGGT AATGCAAAGT GGTTGGCTGA AATTGCCCAC CGCAATCCGC TGTGGATGAA TGCTGCCGAT GCACGAATGC TTGGCTTTGA AGATGGCGAT TTAGTTCGAG TGAATACCGA TATCGGTTAC TCGGTGAATC GCATTTGGGC AACCGAAGGT ATTCGCAATG GCGTGGTAGC TTGCTCGCAC CACATTGGAC GCTGGCGCCG TAGCCAAGAC CCTGAAGCGA ACCGTTGGGC AACCAATCGT GTAGCAATTA AGCGCGAAGG GGGGAGCACA TGGAGAATGA GAGTTGAGGA AAGCATTGAG CCTTATGAAA GCAGCGATCC CGATTCATCA CGTATTTTTT GGTCCGATGG TGGCGTCCAC CAAAACATAA CCTTCCCCGT GCACCCCGAT CCCATTAGTG GAATGCACTG CTGGCACCAA AAAGTGCGCG TTGAAAAAGC GCATGACGGT GACCAGTATG GTGATATTGT GGTTGATACC AATCGTTCGC ATGAAATTTA CCGAGAGTGG CTTGCTATGA CGCGCCCAGC TCCCGGACCA AACGGCTTGC GCCGTCCATT GTGGCTTGGT CGCCCTTATC GCCCCGATGA AAAAACCTAT TATATTTAA
|
Protein sequence | MSYTHKPTVI EAIAEKLHLI PNLHEGDGAA PLPRIAAEGS EVSCPPPDQW DNWVEYDAKS WPERKSNEYM LIPTACFNCE AACGLLAYVD KQTMQVRKLV GNPYHPASRG RNCAKGPATL NQLEDTDRIL YPMKRVGKRG EGKWERVSWD SVLDDIAARM RKALLEGRNN EISYHVGRPG HTGFMDWVLP AWNVDGHNSH TNVCSSGARF GYAIWEGYDR PSPDYTNTKF MLLVSAHLES GHYFNPHAQR IMEAKMKGAK LAVLDPRLSN TASMADYWLT TFPGSEAAVL LALAKVLIDE GLYNRDYLEN WVNWQEYLQK EYPKEPVTFE RFIEALKAEY RDYTPEFAEQ ESGVKAATIV EIARNIGEAG TQFATHVWRS ACAGNLGGWA VSRTLHFLNV LTGSVGTPGG TSPSSWNKFH VHVHAEPKPQ TFWNPLHMPN EYPLAHFEVS MLLPHFLKEG RGKLDVYFTR VFNPVWTYPD GFSWIEALED ESKIGLHAAL TPTWSETAYF ADYVLPMGHS TERHDIISYE THAAQWIGFR QPILRVASEK MGKPVTFTYE ANPGEVWEED EFWIELSWRI DPDGSLGIRQ HFMSPYRPGE KITISEYYRY IFEHTPGLPE KAAEEGLTAL EYMQKYGAFE VETNIYNYNE RPLSPDDRQG ATVDAESGLI SKNGKAVGVK INGMECAGFP TPSRKQEFYS QTMVDWKWPE YRKPTYVKSH VHHEQMNLSN GEFALVPTFR LPVLIHSRSG NAKWLAEIAH RNPLWMNAAD ARMLGFEDGD LVRVNTDIGY SVNRIWATEG IRNGVVACSH HIGRWRRSQD PEANRWATNR VAIKREGGST WRMRVEESIE PYESSDPDSS RIFWSDGGVH QNITFPVHPD PISGMHCWHQ KVRVEKAHDG DQYGDIVVDT NRSHEIYREW LAMTRPAPGP NGLRRPLWLG RPYRPDEKTY YI
|
| |