Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0854 |
Symbol | |
ID | 4570448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 977433 |
End bp | 979130 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639765452 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_911329 |
Protein GI | 119356685 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATCCG ATACCATAAA AAAAGGGTTT GATAAAGCCC CCCATCGCAG CCTTCTTAAA GCCACCGGAG TCATAACGTC GTCCGATGAC TATCAAAAGC CGTTTATCGG CATCTGCAAC TCATTCAACG AACTGATTCC CGGGCATGCC CACTTGCAGG AACTGGGAAG AATCGCAAAG AATGAGGTGC GCAAAGCCGG TGGAATTCCC TTTGAATTCA ATACTATCGG GGTCTGCGAC GGCATCGCCA TGGGCCATAT CGGCATGCGC TACTCGCTTG CAAGCCGTGA ACTCATTGCC GACAGCGTTG AAACCGTCGT TGAGGCGCAC CGGCTCGACG GAATCGTCTG TATACCGAAC TGCGACAAGA TCACGCCAGG CATGATGATG GCGGCACTCC GCGTCAACAT TCCGGTCATC TTTGTTTCCG GCGGACCGAT GAAAGCCGGT CATACTCCGG ATGGAAAAAC TGTAGATCTC ATTTCAGTCT TCGAAGCCGT TGGAAGGCAC AGCACAGCCG AAATCACCGA CGGCGAACTC CAGACGATCG AAGAGAACGC CTGTCCCGGT TGCGGATCAT GCTCAGGAAT GTTTACAGCC AACTCGATGA ACTGCCTTAG CGAAGCGCTC GGTCTGGCGC TGCCCGGAAA CGGAACCATC CTTGCCTCCG ACCCAAGGCG CAACGAGCTG GTAAAAGAAG CTTCGCGAAA AATCATCGAC CTTGTCAGGA GCAACACGAG GCCACGCGAC ATTCTTTCAA GAAAAGCGCT GCTCAATGCC TTTGCCCTCG ATTTTGCCAT GGGAGGCAGC ACCAATACCA TTCTGCACAC CCTGGCCATA GCAAATGAAG CCGAACTTGA CTTCGATTTC TCGGAGCTCA ACGCCCTCTC TGCAAAAACA CCTTATATCT GCAAGGTAAG CCCGGCAACC ATGGCTGTAC ATATTGAAGA CGTCGATCGG GCAGGTGGAG TTTCAGCAAT TCTCCTGGAG CTCAGCAAGA TAGATGGACT TCTCGATCTG TCGGCACCGA CAGTAAGCGG GAAAACCCTC GGCGAAAACA TTGCCGGCGC AGAGATCAAG GACGAAAAGG TCATTCGTAC CATCGACAAC CCTTACTCTG CCACAGGCGG TCTTGCCGTT CTTTACGGAA ACCTCGCACC CCAGGGAGCT GTGGTCAAAA CCGGCGCGGT AAGCCCATCG ATGATGCGGC ATACCGGACC GGCAAAGGTC TTCGACTGCC AGGATGACGC CATCAAAGGC ATCATGGAAG ACATCATCAA ACCGGGAGAT GTTGTTGTGA TCCGTTACGA AGGCCCAAAG GGCGGCCCCG GCATGCCGGA AATGCTCTCG CCGACAAGCG CAATCATGGG TCGGGGTCTT GGTGACTCGG TTGCTCTTAT CACCGACGGA CGATTCTCCG GGGGGTCAAG AGGAGCCTGT ATCGGCCACG TTTCTCCTGA AGCAGCCGAA AACGGCCCGA TCGCCGCACT GAAAAACGGC GACATGATCA CCATTGACAT TCCCGCAAGA ACCATTTCGG TCGATCTCTC AACAGAAGCA ATAAATGAAA GAATTGCTCT TCTGCCGGTT TTTGAGCCGA AAATCAAAAA AGGGTATCTG GCAAGATATG CGCAACTTGT CACGTCAGCC TGCACCGGCG CTATACTGAA AACATCTCCT TACTGTGAAC CAAAATAA
|
Protein sequence | MRSDTIKKGF DKAPHRSLLK ATGVITSSDD YQKPFIGICN SFNELIPGHA HLQELGRIAK NEVRKAGGIP FEFNTIGVCD GIAMGHIGMR YSLASRELIA DSVETVVEAH RLDGIVCIPN CDKITPGMMM AALRVNIPVI FVSGGPMKAG HTPDGKTVDL ISVFEAVGRH STAEITDGEL QTIEENACPG CGSCSGMFTA NSMNCLSEAL GLALPGNGTI LASDPRRNEL VKEASRKIID LVRSNTRPRD ILSRKALLNA FALDFAMGGS TNTILHTLAI ANEAELDFDF SELNALSAKT PYICKVSPAT MAVHIEDVDR AGGVSAILLE LSKIDGLLDL SAPTVSGKTL GENIAGAEIK DEKVIRTIDN PYSATGGLAV LYGNLAPQGA VVKTGAVSPS MMRHTGPAKV FDCQDDAIKG IMEDIIKPGD VVVIRYEGPK GGPGMPEMLS PTSAIMGRGL GDSVALITDG RFSGGSRGAC IGHVSPEAAE NGPIAALKNG DMITIDIPAR TISVDLSTEA INERIALLPV FEPKIKKGYL ARYAQLVTSA CTGAILKTSP YCEPK
|
| |