Gene Cpha266_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0854 
Symbol 
ID4570448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp977433 
End bp979130 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content53% 
IMG OID639765452 
Productdihydroxy-acid dehydratase 
Protein accessionYP_911329 
Protein GI119356685 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCCG ATACCATAAA AAAAGGGTTT GATAAAGCCC CCCATCGCAG CCTTCTTAAA 
GCCACCGGAG TCATAACGTC GTCCGATGAC TATCAAAAGC CGTTTATCGG CATCTGCAAC
TCATTCAACG AACTGATTCC CGGGCATGCC CACTTGCAGG AACTGGGAAG AATCGCAAAG
AATGAGGTGC GCAAAGCCGG TGGAATTCCC TTTGAATTCA ATACTATCGG GGTCTGCGAC
GGCATCGCCA TGGGCCATAT CGGCATGCGC TACTCGCTTG CAAGCCGTGA ACTCATTGCC
GACAGCGTTG AAACCGTCGT TGAGGCGCAC CGGCTCGACG GAATCGTCTG TATACCGAAC
TGCGACAAGA TCACGCCAGG CATGATGATG GCGGCACTCC GCGTCAACAT TCCGGTCATC
TTTGTTTCCG GCGGACCGAT GAAAGCCGGT CATACTCCGG ATGGAAAAAC TGTAGATCTC
ATTTCAGTCT TCGAAGCCGT TGGAAGGCAC AGCACAGCCG AAATCACCGA CGGCGAACTC
CAGACGATCG AAGAGAACGC CTGTCCCGGT TGCGGATCAT GCTCAGGAAT GTTTACAGCC
AACTCGATGA ACTGCCTTAG CGAAGCGCTC GGTCTGGCGC TGCCCGGAAA CGGAACCATC
CTTGCCTCCG ACCCAAGGCG CAACGAGCTG GTAAAAGAAG CTTCGCGAAA AATCATCGAC
CTTGTCAGGA GCAACACGAG GCCACGCGAC ATTCTTTCAA GAAAAGCGCT GCTCAATGCC
TTTGCCCTCG ATTTTGCCAT GGGAGGCAGC ACCAATACCA TTCTGCACAC CCTGGCCATA
GCAAATGAAG CCGAACTTGA CTTCGATTTC TCGGAGCTCA ACGCCCTCTC TGCAAAAACA
CCTTATATCT GCAAGGTAAG CCCGGCAACC ATGGCTGTAC ATATTGAAGA CGTCGATCGG
GCAGGTGGAG TTTCAGCAAT TCTCCTGGAG CTCAGCAAGA TAGATGGACT TCTCGATCTG
TCGGCACCGA CAGTAAGCGG GAAAACCCTC GGCGAAAACA TTGCCGGCGC AGAGATCAAG
GACGAAAAGG TCATTCGTAC CATCGACAAC CCTTACTCTG CCACAGGCGG TCTTGCCGTT
CTTTACGGAA ACCTCGCACC CCAGGGAGCT GTGGTCAAAA CCGGCGCGGT AAGCCCATCG
ATGATGCGGC ATACCGGACC GGCAAAGGTC TTCGACTGCC AGGATGACGC CATCAAAGGC
ATCATGGAAG ACATCATCAA ACCGGGAGAT GTTGTTGTGA TCCGTTACGA AGGCCCAAAG
GGCGGCCCCG GCATGCCGGA AATGCTCTCG CCGACAAGCG CAATCATGGG TCGGGGTCTT
GGTGACTCGG TTGCTCTTAT CACCGACGGA CGATTCTCCG GGGGGTCAAG AGGAGCCTGT
ATCGGCCACG TTTCTCCTGA AGCAGCCGAA AACGGCCCGA TCGCCGCACT GAAAAACGGC
GACATGATCA CCATTGACAT TCCCGCAAGA ACCATTTCGG TCGATCTCTC AACAGAAGCA
ATAAATGAAA GAATTGCTCT TCTGCCGGTT TTTGAGCCGA AAATCAAAAA AGGGTATCTG
GCAAGATATG CGCAACTTGT CACGTCAGCC TGCACCGGCG CTATACTGAA AACATCTCCT
TACTGTGAAC CAAAATAA
 
Protein sequence
MRSDTIKKGF DKAPHRSLLK ATGVITSSDD YQKPFIGICN SFNELIPGHA HLQELGRIAK 
NEVRKAGGIP FEFNTIGVCD GIAMGHIGMR YSLASRELIA DSVETVVEAH RLDGIVCIPN
CDKITPGMMM AALRVNIPVI FVSGGPMKAG HTPDGKTVDL ISVFEAVGRH STAEITDGEL
QTIEENACPG CGSCSGMFTA NSMNCLSEAL GLALPGNGTI LASDPRRNEL VKEASRKIID
LVRSNTRPRD ILSRKALLNA FALDFAMGGS TNTILHTLAI ANEAELDFDF SELNALSAKT
PYICKVSPAT MAVHIEDVDR AGGVSAILLE LSKIDGLLDL SAPTVSGKTL GENIAGAEIK
DEKVIRTIDN PYSATGGLAV LYGNLAPQGA VVKTGAVSPS MMRHTGPAKV FDCQDDAIKG
IMEDIIKPGD VVVIRYEGPK GGPGMPEMLS PTSAIMGRGL GDSVALITDG RFSGGSRGAC
IGHVSPEAAE NGPIAALKNG DMITIDIPAR TISVDLSTEA INERIALLPV FEPKIKKGYL
ARYAQLVTSA CTGAILKTSP YCEPK