Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0240 |
Symbol | |
ID | 3747900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 271394 |
End bp | 272383 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637772765 |
Product | dihydroflavonol 4-reductase family protein |
Protein accession | YP_378559 |
Protein GI | 78188221 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00224634 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAC TCTCCATTGC TCTTACCGGT GCTACCGGCT ACATAGGTTC GCAAGTGCTG CTTGAGTTGC TCAAGCGTTT TAAGGGTGAA CTTGATTGCC GCGTGTTGGT GCGTGGCAGC TCCAATTATG CGTGGTTGGA GGCGTTGCCG GTGCAAGTAA TTGCTGCCGA TGTGCTTGAG CCAATTGCTT TGCATGAGGC GTTGCGTGGT GTGGATACGC TTTTTCATTG TGCGGGTTTG GTGTCGTGGA CGCGCCGTTT TCGCTCTCAG CTTTATGAGG TAAATGTGGT GGGCACGCGC AATGTGCTTC ATGCGGCGCT TTATAATGGT GTGCGCCGTG TGGTAATGAC CAGCTCCATT GCAGCGGTTG GCATGTCGGA AGATGGTGCG CCTGCTAACG AGGCGGCGTT GTTTAAAGAG TGGCAGCGAC GCAATGGGTA CATGGAGGCA AAACATCTTG CCGAGCTTGA GGCGTTGCGT GCGGTGGCTG AAGGGTTGGA TGTAGTGCTG TTGAATCCGG GCGTGGTGAT TGGTGTTGAC CACCACAATC CAGCATCCCT TAGCTCGTCA AACCGTACAC TTAGGCAAAT GTATGATGAA AAGCTATGGG TTGCACCTGC GGGTAGCACG GGTTTTGTAG ATGTGCGCGA TGTTGCCATG GCGCATATTG CGGCGTGGGA AAAAGGGAAA TCGGGGGAGC GTTACATTGT GGTAGGGCAT AACGTGAGCT TCCATGAATT ACTCAGCCGT TTATCAGCTC TTAACAATGG TGTTGCCGCT AAGGTGCTTA CGGTGCCCCG TTCGGTGGGA ATGGTTGCAG CCCTTGGTGG CGAAGCGTGG TCGCTCTTGA CGGGAAATCC TTCGTTTATT GCTTTTGAAA GTATTGGCAC CTCAGCGCGG CAGTTGGCAT ACAATAATGA GCGTTCACTT TGCGAGTTGG GCATTGCGTA TCACGATTTA GAAGAGACAT TTCAAACAAT TCTTAAATAA
|
Protein sequence | MSKLSIALTG ATGYIGSQVL LELLKRFKGE LDCRVLVRGS SNYAWLEALP VQVIAADVLE PIALHEALRG VDTLFHCAGL VSWTRRFRSQ LYEVNVVGTR NVLHAALYNG VRRVVMTSSI AAVGMSEDGA PANEAALFKE WQRRNGYMEA KHLAELEALR AVAEGLDVVL LNPGVVIGVD HHNPASLSSS NRTLRQMYDE KLWVAPAGST GFVDVRDVAM AHIAAWEKGK SGERYIVVGH NVSFHELLSR LSALNNGVAA KVLTVPRSVG MVAALGGEAW SLLTGNPSFI AFESIGTSAR QLAYNNERSL CELGIAYHDL EETFQTILK
|
| |