Gene Clim_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1778 
Symbol 
ID6354607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1955563 
End bp1956612 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID642669381 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001943796 
Protein GI189347267 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00040311 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATAC TCGTCACCGG CGGCGCCGGA TTTATCGGAT CTCATGTAGT CCGGCATTTT 
TTGAGAGCAT ATCCTGACTA TACAGTCACC AATCTCGACA GTCTTACCTA TGCGGGCAAT
CTTGCGAATC TCCGGGATGT AGAGCACCAG CCGAACTATC GTTTTGTTAA AGGCGATATT
ACCGATGCCC TTTTTCTGGA GAGGCTTTTC GAAGAGTTCC GGTTCGACGG GGTGATCCAT
CTTGCGGCGG AGTCGCATGT GGACCGTTCG ATAGCCAGTC CTGTCGAATT CGTGACGACC
AATGTGCTCG GAACGGTGAA TCTGCTCAAT GCCGCCCGCC GCTGCTGGGC AGGAAGTTTC
GGGGGAAAGC GGTTCTATCA CATTTCGACC GATGAGGTTT ACGGTTCGCT TGGCAGCGAA
GGGATGTTCA CCGAAGAGAC CGCTTATGAT CCGCACAGTC CGTATTCGGC ATCGAAAGCG
TCGTCGGATC ATTTTGTCAG GGCCTGGCAC GATACCTACG GGCTGCCCGT CGTGATCAGC
AATTGTTCGA ACAACTACGG TTCGTACCAG TTTCCCGAGA AGCTGATTCC GCTTTTTATC
AACAACATCC GGACGAAAAA GCCCCTGCCG GTTTACGGAA AGGGTGAAAA TGTGCGCGAC
TGGCTCTGGG TGGTCGACCA CGCTTCGGCT ATCGACGTTA TTTATCATGA AGGTGTTGAT
GGTGAAACAT ATAATATCGG CGGGCACAAC GAGTGGAAGA ACATCGATCT GATTCTTCTG
TTGTGCAGGA TCATGGATGA AAAACTGGGA AGGAAACCGG GCGAGTCGGC AGAGCTGATA
ACCTATGTGA CCGATCGTGC AGGGCATGAT CTGCGTTATG CGATCGATTC TTCGAAACTG
CAGCGTGAGT TGGGATGGGA GCCCTCCATA CGGTTTGAGG AGGGGCTGGA GAAAACCGTT
GCATGGTATC TCGGTAACCA GGAGTGGCTC GATCAGGTAA CTTCCGGAGA TTATCGCGAT
TATTACGAAA ACATGTATGC TTCTTCCTGA
 
Protein sequence
MHILVTGGAG FIGSHVVRHF LRAYPDYTVT NLDSLTYAGN LANLRDVEHQ PNYRFVKGDI 
TDALFLERLF EEFRFDGVIH LAAESHVDRS IASPVEFVTT NVLGTVNLLN AARRCWAGSF
GGKRFYHIST DEVYGSLGSE GMFTEETAYD PHSPYSASKA SSDHFVRAWH DTYGLPVVIS
NCSNNYGSYQ FPEKLIPLFI NNIRTKKPLP VYGKGENVRD WLWVVDHASA IDVIYHEGVD
GETYNIGGHN EWKNIDLILL LCRIMDEKLG RKPGESAELI TYVTDRAGHD LRYAIDSSKL
QRELGWEPSI RFEEGLEKTV AWYLGNQEWL DQVTSGDYRD YYENMYASS