Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1642 |
Symbol | |
ID | 3747949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2140412 |
End bp | 2142529 |
Gene Length | 2118 bp |
Protein Length | 705 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637774180 |
Product | short chain dehydrogenase |
Protein accession | YP_379937 |
Protein GI | 78189599 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only [S] Function unknown |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3347] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAACC TTTGGAACGA TGCAGAATTA CAGGGCTTTG TGAGCAATGT GTGCCACGAG CCTGATGACC ATCCTGAGCT TGCAGCATTG GTGTATGCTT CGCGGTTGCT TGGGCGTGAA CGTTCATTAG TAATGCACGG TGGTGGCAAT ACCTCCGTTA AGTGTGGGCT TACCGATATG GTTGGCAACC ATGCGGAAGT GTTGCTTATT AAAGCAAGTG GTATTGATTT AAGCAATGTT ACCTGTCGCG ATTATACCCC GCTCCGCTTA GGTCCACTGA GTAAACTTGT GGAGCTGTGC AGCAGTAACG ATCCTATTCA TGCTGAGCGT GTTGAACGTT TTTCAACGAA AGAGTTTAAG CATCTTTTAA TGCTGAATAT GTTCAGCCTT ACTGACCATA TGGCTGAAAA ACGTTTAACT CCATCTATTG AAACGCTGCT CCATGCCTTT CTGCCTCATC GCTACATTTT ACATACCCAC TCATTTGCCT TGCTTACCAT GAGCAATCAG CCAAACGGTG AAGCGCTCTG CCGCGAAACC TTAGGCGAGG CATTTGGCTC GGTGCCTTAC ATTAAACCCG GTCTTGGTTT GGCGCGTGCC GCCGCTGGTG TGTATGAAGC ACATCCCGCA ATTGAGGGGC TTGTGTTGCA AAAGCATGGC TTAGTAACTT TTGGTGAAAC GGCTCAAGAG GCGTACAACC GCATGATTGA CGCTGTTACA AAACTTGAAG AGCGCATTGC TTTGGCTGGT CGTAAACCAT TTACAACGGT TCCTTTGCCC GAAGAAATTG CAAAAGTGGA AGATGTTGCT CCCATTATTC GTGGAGCTTG TGCTGAGGAA AAAGAGGTTG GTCGTCGCGA TTATCAACAT TTGATTCTTG ATTTTCGTAC CTCCGATGAA ATTCTCACTT ACGTCAATAG TGCCGATGTT GTGCGCATGA GTCAAAAAGG CTCCATGACG CCCGATTTTA TTATTCGTAC AAAAAATAAG CCACTTGTGG TGCCTGCACC TGACGCAGCG GATCTTAACG GATTTAAAGC TGCTGTTGAT GAAGCGGTGC AGCGCTATCG TGATGCCTAT ATTGCCTACT TTAATGCGCA ACAGCAAGCT TCAGGTATGG AGGTTACCAT GCTTGATCCT ATGCCGCGTG TGGTGTTGGT GCCAGGGCTT GGGCTTTTTG GTTTAGGAAA AAGTGCCGCG GCAGCCGCAG TGAATGCTGA TATTGCTACC TGTACTGCCA CTGCTATTCT TGATGCTGAA TCGGTGGGTT CTTTTGAATC CATTAGCGAG CGTGAAGCTT TTGATATTGA GTATTGGGAT ATGGAGCAGG CAAAAATCAA TAAGGTGTAT CACGGGACGT TTGCGGGTAA AGTGGTGATG GTAACGGGAG GAGCAAGTGG CATTGGGCTT GCTACAGCCA AAGCATTTCG TCAGCGTGGT GCTGAGTTAG TGGTGTTAGA TCTTTCTCAA GAAGCGCTTG ATAAAGCGGC TGAAGAGATT GGCGGTAATC CCTTAACGCT TACCTGCAAT GTTACCTCAC GTGCTGATAT TCGTGCGGCG TATGATGCGG TTTGCAAGCG TTATGGCGGT GTTGATGTAA TTGTCTCCAA CGTTGGTGCG GCTATTCAAG GGCGCATTGG CGATGTGTCG GATGAGTTGT TGCGCAAGAG TTTTGAAATT AACTTTTTCT CCCACCACTA CATTGCTCAA GAAGCGGTAC GTGTGATGCG TTTGCAAGGC ACGGGCGGTG TGTTGCTTTT TAATGTTTCA AAGCAAGCGG TTAATCCAGG TCCCGATTTT GGACCTTACG GTTTACCAAA AGCTGCCACC ATGTTTCTTG TGCGCCAATA TGCACTTGAC CACGGTCGTG ATGGCATTCG TGCAAACGGC ATTAATGCCG ACCGCATTCG CACCGGACTT TTGACTGAAG AGATGATTAA ATCGCGCTCG GCGGCGCGTG GTTTAAGCGA GCACGAATAT ATGGCTGGTA ATTTGTTGCA ACTTGAGGTA TATGCTGAAG ATGTGGCTGA AGCCTTTGTG CATTTAGCCC AAGAAATTCG CACCAACGCC GCAATCATTA CCGTTGATGG TGGCAACATT GCTGCTACGT TGCGGTAG
|
Protein sequence | MQNLWNDAEL QGFVSNVCHE PDDHPELAAL VYASRLLGRE RSLVMHGGGN TSVKCGLTDM VGNHAEVLLI KASGIDLSNV TCRDYTPLRL GPLSKLVELC SSNDPIHAER VERFSTKEFK HLLMLNMFSL TDHMAEKRLT PSIETLLHAF LPHRYILHTH SFALLTMSNQ PNGEALCRET LGEAFGSVPY IKPGLGLARA AAGVYEAHPA IEGLVLQKHG LVTFGETAQE AYNRMIDAVT KLEERIALAG RKPFTTVPLP EEIAKVEDVA PIIRGACAEE KEVGRRDYQH LILDFRTSDE ILTYVNSADV VRMSQKGSMT PDFIIRTKNK PLVVPAPDAA DLNGFKAAVD EAVQRYRDAY IAYFNAQQQA SGMEVTMLDP MPRVVLVPGL GLFGLGKSAA AAAVNADIAT CTATAILDAE SVGSFESISE REAFDIEYWD MEQAKINKVY HGTFAGKVVM VTGGASGIGL ATAKAFRQRG AELVVLDLSQ EALDKAAEEI GGNPLTLTCN VTSRADIRAA YDAVCKRYGG VDVIVSNVGA AIQGRIGDVS DELLRKSFEI NFFSHHYIAQ EAVRVMRLQG TGGVLLFNVS KQAVNPGPDF GPYGLPKAAT MFLVRQYALD HGRDGIRANG INADRIRTGL LTEEMIKSRS AARGLSEHEY MAGNLLQLEV YAEDVAEAFV HLAQEIRTNA AIITVDGGNI AATLR
|
| |