Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1906 |
Symbol | |
ID | 3747651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2425075 |
End bp | 2426754 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637774443 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_380199 |
Protein GI | 78189861 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATCCG ACACCATAAA ATCGGGCTTT GAAAAAGCTC CCCATCGTAG CCTTTTAAAA GCTACAGGCG CAATCCGCTC AAGCAGCGAT TACCGCAAGC CGTTTATTGG CATCTGCAAT TCATATAATG AGTTAATTCC CGGTCATACC CATTTGCAAG AGCTGGGACG CATTGCGAAA GAGGCGGTAC GCGAAGCGGG CGGTGTGCCT TTTGAGTTCA ACACCATTGG CGTTTGTGAT GGCATTGCTA TGGGGCATAT TGGGATGCGC TACTCGCTTG CAAGCCGTGA GTTAATTGCT GATAGCGTTG AAACCGTTGC CGAAGCGCAT CGGCTTGATG GCTTAGTCTG TATTCCAAAT TGTGATAAAA TCACCCCGGG TATGATGATG GCTGCACTAC GCATTAACAT TCCTGTGATT TTTGTTTCAG GCGGACCAAT GAAAGCTGGT CATACTCCCG AAGGTAAAAC GGTGGACTTA ATTTCGGTTT TTGAAGCGGT TGGGCAATGC AGCAACGGCT CAATAACAGA AGGTGAACTG CAAAATATTG AGGAGCATGC CTGCCCGGGT TGCGGCTCAT GCTCAGGCAT GTTTACCGCA AATTCCATGA ACTGCTTAAG CGAAGCGCTT GGTTTTGCCT TACCGGGTAA CGGCACCATT GTCGCTGAAG ATCCTCGTCG GCTGGAGTTA GTAAAAGCTG CCTCACGCCG CATTGTGGAT TTAGTAGAGA ACAATGTGCG TCCACGCGAT ATTTTAACGC GCCAAGCGTT GCTCAATGCC TTTGCGCTCG ATTTTGCTAT GGGCGGCAGC ACCAACACTA TTTTGCATAC GCTTGCCATT GCGAATGAAG CGGGTTTGAG TTTCGACTTT AGCGAGTTAA ACGCTCTTTC AGCGAAAACG CCTTACATCT GCCAAGTAAG CCCGGCTACT ATGGCGGTGC ATATTGAGGA CGTTGATCGT GCGGGTGGCA TTTCCGCTAT TTTAAAAGAG TTAAGCTCGA TTGATGGGTT GCTTGATCTT TCAGCAATAA CGGTAACAGG TAAAACGTTA GGCGAAAATA TTGCCAACGC CGAAGTGCTC GACCGCAGCG TTATTCGCAG CATCAGCGAT CCCTATTCCG CAACGGGTGG CTTGGCGGTG CTTTATGGCA ATTTAGCGCC ACAAGGTGCG GTGGTAAAAA CGGGTGCGGT AAGCCCACAA ATGATGCAGC ATAGCGGTCC CGCTAAAGTG TATAATGCTC AAGATGATGC TATTAAAGGC ATTATGGAGG GTGATGTAAA AGCTGGCGAT GTGGTGGTAA TTCGCTACGA AGGTCCAAAA GGAGGTCCAG GAATGCCTGA AATGCTCTCG CCAACCAGCG CCATTATGGG GCGCGGACTT GGTGATTCTG TTGCACTCAT TACCGATGGA CGCTTTTCAG GCGGATCACG AGGAGCTTGC ATTGGGCACG TTTCCCCTGA AGCGGCAGAA CGTGGACCAA TTGCCGCCCT GCAAAATGGC GATATTATCA CCATTGATAT TCCTGCACGC ACCATGTCGG TTGCGTTGAG CGAATCAACT ATCAAGGAAC GCTTAGCACA ATTGCCGCCA TTTGAACCTA AAATTAAACG AGGCTATTTA GCTCGCTATG CGCAATTAGT AACCTCAGCC AACACGGGTG CAATTTTAGG GCACCTCTAA
|
Protein sequence | MRSDTIKSGF EKAPHRSLLK ATGAIRSSSD YRKPFIGICN SYNELIPGHT HLQELGRIAK EAVREAGGVP FEFNTIGVCD GIAMGHIGMR YSLASRELIA DSVETVAEAH RLDGLVCIPN CDKITPGMMM AALRINIPVI FVSGGPMKAG HTPEGKTVDL ISVFEAVGQC SNGSITEGEL QNIEEHACPG CGSCSGMFTA NSMNCLSEAL GFALPGNGTI VAEDPRRLEL VKAASRRIVD LVENNVRPRD ILTRQALLNA FALDFAMGGS TNTILHTLAI ANEAGLSFDF SELNALSAKT PYICQVSPAT MAVHIEDVDR AGGISAILKE LSSIDGLLDL SAITVTGKTL GENIANAEVL DRSVIRSISD PYSATGGLAV LYGNLAPQGA VVKTGAVSPQ MMQHSGPAKV YNAQDDAIKG IMEGDVKAGD VVVIRYEGPK GGPGMPEMLS PTSAIMGRGL GDSVALITDG RFSGGSRGAC IGHVSPEAAE RGPIAALQNG DIITIDIPAR TMSVALSEST IKERLAQLPP FEPKIKRGYL ARYAQLVTSA NTGAILGHL
|
| |