Gene Clim_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2151 
Symbol 
ID6355945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2370287 
End bp2371297 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content56% 
IMG OID642669742 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001944154 
Protein GI189347625 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.111318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAG GTGCGGGTGT TGAAAAAGTG CTGGTTACCG GTTCGACCGG GTTTATCGGA 
AAGCGGCTGG TTGCTGCCCT GCTGGAAAAA GGATTCGTTG TCCGGGTATT CCTTCGCAAC
GAGAGCGTTT CGGAAGGTCT TTTTCCGGAA TCAGTAGAGG TGGTGCGCGG TGGCTACCAT
GATCGGGCAG CTCTTGCTGC GGCCGTGGAG GGCGTTCAGC GTATTATCCA TCTTGCCGGG
GTTACCAAGG CTGCCGATGA AGCCGGTTTT GATTCCGGCA ATGTCTATCC TGCAGAGCAG
ATGCTTGAAG CCGTGAAGCG GTATAATCCC GATCTGAAGC GTTTTCTGCT GGTATCGTCT
CTTGCTGCGG CCGGTCCGGC GCGTGAGGGA AGTGTCGGCC TGAGGGAGAG CGATGCCCCT
CAGCCGGTGA GCGCTTACGG ACAAAGCAAG CTCAGAGGCG AGGAGGTCTG CCTGAAACGT
GCAGGTTCCG TTCCTGTAAC CATTGTCAGG CCGCCGGCGG TATATGGTCC GGGAGACCGA
GATATACTGC AGATTTTTCA GATGATGCAG AAAGGTATGC TTGTTTCCGC CGGCAATGCA
TCGAAGCAGC GTTTCAGTAT GATCTATGTC GATGATCTTG TCGCCGGTAT CATCGTCGCA
GCCTGCGCGG AAAAGGCGGC AGGGCGGATA TATTACCTGG CCGCTCCACG GCCGTATTCG
TGGGAGGATC TCATTGCCGC CGTAAAGCCG GCTATTGGGT TCAGCCGACT GATGAGAATT
ACGCTTCCCA AACCTCTGGT GTTTGTGCTT GGCGCCATTC TTGGCACATT CGGAGCCATA
ACCGGAACGC TGCCGCTGAT CAATCGCGAC AAAGCCAATG AACTTGTTCA GGATTTCTGG
GTCTGTTCTC CTGACCGGGC GGCAGCGGAA CTTGGTTTTC ATGCTGAAAC CTCGCTTGAG
GATGGGGCGG CCAAAACGGT TGCCTGGTAT CGGGATAAGG GGTGGATGTA G
 
Protein sequence
MGKGAGVEKV LVTGSTGFIG KRLVAALLEK GFVVRVFLRN ESVSEGLFPE SVEVVRGGYH 
DRAALAAAVE GVQRIIHLAG VTKAADEAGF DSGNVYPAEQ MLEAVKRYNP DLKRFLLVSS
LAAAGPAREG SVGLRESDAP QPVSAYGQSK LRGEEVCLKR AGSVPVTIVR PPAVYGPGDR
DILQIFQMMQ KGMLVSAGNA SKQRFSMIYV DDLVAGIIVA ACAEKAAGRI YYLAAPRPYS
WEDLIAAVKP AIGFSRLMRI TLPKPLVFVL GAILGTFGAI TGTLPLINRD KANELVQDFW
VCSPDRAAAE LGFHAETSLE DGAAKTVAWY RDKGWM