Gene Clim_0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0742 
Symbol 
ID6356023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp812770 
End bp814467 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content55% 
IMG OID642668367 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001942802 
Protein GI189346273 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.853368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCTG ATACGGTAAA GACAGGATTT GAAAAAGCTC CGCACCGCAG CCTTCTGAAG 
GCTACCGGAG CTATCGCCTC CAACAATGAT TTCAGAAAGC CCTTCATCGG CATCTGCAAC
TCATTCAATG AACTTATACC CGGCCACGCC CACCTTCAGG AACTGGGAAG AATCGCAAAA
GAGGAGGTCC GCAAGGCAGG AGGCGTTCCT TTCGAGTTCA ATACCATCGG CGTTTGCGAC
GGCATCGCCA TGGGTCATAT CGGCATGCGC TACTCTCTTG CAAGCCGCGA GCTCATTGCC
GACAGTGTTG AAACCGTTGC CGAGGCACAT CGTCTTGACG GCCTGGTCTG CATACCCAAC
TGCGACAAGA TCACCCCCGG CATGATGATG GCTGCTCTGC GTATCAACAT ACCTGTTATT
TTCGTTTCGG GAGGTCCGAT GAAGGCCGGA TGTACCCCAT CGGGAAAAAC CGTTGACCTG
ATCTCGGTTT TCGAGGCCGT CGGACAGTGC AGTACCGGAG AGATCACGGA ATCCGAGCTT
GAGACCATCC AGGACAGCGC CTGTCCCGGA TGCGGATCGT GTTCCGGCAT GTTTACCGCC
AATTCAATGA ACTGCCTCTC GGAGGCTCTC GGCATAGCAC TTCCCGGTAA CGGAACGATT
CTTGCAATCG ATCCGAGACG TAACGAACTG GTCCGCGAAG CCTCGCGAAA AATTGTCGAT
CTGGTCAACA ACGACATACG GCCCAGGGAC ATCATAACCA GAAAATCCCT GCTCAACGCC
TTTGCCCTTG ATTTTGCCAT GGGGGGCAGC ACCAATACGA TCCTGCACAC CCTGGCCATC
GCCAATGAAG CGGAACTGGA TTTCGACTTC TCGGAACTCA ATGCCCTTTC GGCTAAAACG
CCGTATATCT GCAAAGTCAG TCCGGCCACC ATGGATGTCC ACATCGAGGA TGTCGATCGT
GCCGGCGGCA TTTCAGCCAT ACTGAAAGAA CTCAGCCGTG TCGACGGTCT TCTCGACCTC
TCGGCACCGA CCGTTACCGG AAAAACACTC GGAGAGAATA TTGCCGGCGC GGAAGTGCTT
GACAGAAACG TCATCAGAAG CATCGAAAAC CCATACTCCG CAACGGGCGG TCTGGCCGTT
CTTTACGGAA ATCTGGCCCC GCAGGGTGCC GTCATCAAAA CCGGCGCCGT CAGCCCTGAA
ATGATGACCC ATACCGGACC GGCAAAAGTT TACGACTCGC AGGATGAAGC CATCAAAGGC
ATCATGGACG GCGATATCTG CGCCGGAGAT GTGGTGGTTA TCCGATATGA GGGACCGAAA
GGCGGACCGG GAATGCCTGA AATGCTCTCC CCTACCAGTG CCATCATGGG GCGCGGTCTC
GGCGGTTCTG TCGCTCTCAT TACCGACGGC CGGTTCTCCG GCGGATCGAG GGGAGCATGC
ATCGGCCACG TCTCGCCGGA AGCTGCCGAA AAAGGACCGA TCGCCGCGCT GGAAAACGGC
GACATGATCA CCATCGACAT CCCGAACCGC TGTATCAGTG TCGATCTTCC GGAAACAGTC
ATTGCCGGAC GTATTGCCGC TCTCAAGCCT TTCGAGCCAA AAATCAAAAA AGGCTACCTT
GCGCGTTACG CACAACTTGT CACCTCGGCA AATACCGGGG CGATCATGAA AAACCCTGCT
TACTGTGAAT CAAAATAA
 
Protein sequence
MRSDTVKTGF EKAPHRSLLK ATGAIASNND FRKPFIGICN SFNELIPGHA HLQELGRIAK 
EEVRKAGGVP FEFNTIGVCD GIAMGHIGMR YSLASRELIA DSVETVAEAH RLDGLVCIPN
CDKITPGMMM AALRINIPVI FVSGGPMKAG CTPSGKTVDL ISVFEAVGQC STGEITESEL
ETIQDSACPG CGSCSGMFTA NSMNCLSEAL GIALPGNGTI LAIDPRRNEL VREASRKIVD
LVNNDIRPRD IITRKSLLNA FALDFAMGGS TNTILHTLAI ANEAELDFDF SELNALSAKT
PYICKVSPAT MDVHIEDVDR AGGISAILKE LSRVDGLLDL SAPTVTGKTL GENIAGAEVL
DRNVIRSIEN PYSATGGLAV LYGNLAPQGA VIKTGAVSPE MMTHTGPAKV YDSQDEAIKG
IMDGDICAGD VVVIRYEGPK GGPGMPEMLS PTSAIMGRGL GGSVALITDG RFSGGSRGAC
IGHVSPEAAE KGPIAALENG DMITIDIPNR CISVDLPETV IAGRIAALKP FEPKIKKGYL
ARYAQLVTSA NTGAIMKNPA YCESK