Gene Cag_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1906 
Symbol 
ID3747651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2425075 
End bp2426754 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content49% 
IMG OID637774443 
Productdihydroxy-acid dehydratase 
Protein accessionYP_380199 
Protein GI78189861 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCCG ACACCATAAA ATCGGGCTTT GAAAAAGCTC CCCATCGTAG CCTTTTAAAA 
GCTACAGGCG CAATCCGCTC AAGCAGCGAT TACCGCAAGC CGTTTATTGG CATCTGCAAT
TCATATAATG AGTTAATTCC CGGTCATACC CATTTGCAAG AGCTGGGACG CATTGCGAAA
GAGGCGGTAC GCGAAGCGGG CGGTGTGCCT TTTGAGTTCA ACACCATTGG CGTTTGTGAT
GGCATTGCTA TGGGGCATAT TGGGATGCGC TACTCGCTTG CAAGCCGTGA GTTAATTGCT
GATAGCGTTG AAACCGTTGC CGAAGCGCAT CGGCTTGATG GCTTAGTCTG TATTCCAAAT
TGTGATAAAA TCACCCCGGG TATGATGATG GCTGCACTAC GCATTAACAT TCCTGTGATT
TTTGTTTCAG GCGGACCAAT GAAAGCTGGT CATACTCCCG AAGGTAAAAC GGTGGACTTA
ATTTCGGTTT TTGAAGCGGT TGGGCAATGC AGCAACGGCT CAATAACAGA AGGTGAACTG
CAAAATATTG AGGAGCATGC CTGCCCGGGT TGCGGCTCAT GCTCAGGCAT GTTTACCGCA
AATTCCATGA ACTGCTTAAG CGAAGCGCTT GGTTTTGCCT TACCGGGTAA CGGCACCATT
GTCGCTGAAG ATCCTCGTCG GCTGGAGTTA GTAAAAGCTG CCTCACGCCG CATTGTGGAT
TTAGTAGAGA ACAATGTGCG TCCACGCGAT ATTTTAACGC GCCAAGCGTT GCTCAATGCC
TTTGCGCTCG ATTTTGCTAT GGGCGGCAGC ACCAACACTA TTTTGCATAC GCTTGCCATT
GCGAATGAAG CGGGTTTGAG TTTCGACTTT AGCGAGTTAA ACGCTCTTTC AGCGAAAACG
CCTTACATCT GCCAAGTAAG CCCGGCTACT ATGGCGGTGC ATATTGAGGA CGTTGATCGT
GCGGGTGGCA TTTCCGCTAT TTTAAAAGAG TTAAGCTCGA TTGATGGGTT GCTTGATCTT
TCAGCAATAA CGGTAACAGG TAAAACGTTA GGCGAAAATA TTGCCAACGC CGAAGTGCTC
GACCGCAGCG TTATTCGCAG CATCAGCGAT CCCTATTCCG CAACGGGTGG CTTGGCGGTG
CTTTATGGCA ATTTAGCGCC ACAAGGTGCG GTGGTAAAAA CGGGTGCGGT AAGCCCACAA
ATGATGCAGC ATAGCGGTCC CGCTAAAGTG TATAATGCTC AAGATGATGC TATTAAAGGC
ATTATGGAGG GTGATGTAAA AGCTGGCGAT GTGGTGGTAA TTCGCTACGA AGGTCCAAAA
GGAGGTCCAG GAATGCCTGA AATGCTCTCG CCAACCAGCG CCATTATGGG GCGCGGACTT
GGTGATTCTG TTGCACTCAT TACCGATGGA CGCTTTTCAG GCGGATCACG AGGAGCTTGC
ATTGGGCACG TTTCCCCTGA AGCGGCAGAA CGTGGACCAA TTGCCGCCCT GCAAAATGGC
GATATTATCA CCATTGATAT TCCTGCACGC ACCATGTCGG TTGCGTTGAG CGAATCAACT
ATCAAGGAAC GCTTAGCACA ATTGCCGCCA TTTGAACCTA AAATTAAACG AGGCTATTTA
GCTCGCTATG CGCAATTAGT AACCTCAGCC AACACGGGTG CAATTTTAGG GCACCTCTAA
 
Protein sequence
MRSDTIKSGF EKAPHRSLLK ATGAIRSSSD YRKPFIGICN SYNELIPGHT HLQELGRIAK 
EAVREAGGVP FEFNTIGVCD GIAMGHIGMR YSLASRELIA DSVETVAEAH RLDGLVCIPN
CDKITPGMMM AALRINIPVI FVSGGPMKAG HTPEGKTVDL ISVFEAVGQC SNGSITEGEL
QNIEEHACPG CGSCSGMFTA NSMNCLSEAL GFALPGNGTI VAEDPRRLEL VKAASRRIVD
LVENNVRPRD ILTRQALLNA FALDFAMGGS TNTILHTLAI ANEAGLSFDF SELNALSAKT
PYICQVSPAT MAVHIEDVDR AGGISAILKE LSSIDGLLDL SAITVTGKTL GENIANAEVL
DRSVIRSISD PYSATGGLAV LYGNLAPQGA VVKTGAVSPQ MMQHSGPAKV YNAQDDAIKG
IMEGDVKAGD VVVIRYEGPK GGPGMPEMLS PTSAIMGRGL GDSVALITDG RFSGGSRGAC
IGHVSPEAAE RGPIAALQNG DIITIDIPAR TMSVALSEST IKERLAQLPP FEPKIKRGYL
ARYAQLVTSA NTGAILGHL