Gene Clim_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1588 
Symbol 
ID6354236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1709836 
End bp1710846 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content53% 
IMG OID642669190 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001943612 
Protein GI189347083 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTTC TGGTTACCGG CGCAGCCGGT TTTATCGGTT CACATGTCTG TCAACGGCTT 
CTTGAAAGAG GAGAGCGTGT GACAGGGCTT GATAACCTGA ATGATTATTA TGATGTGAGC
CTGAAGGAGG CCCGTCTTGA CTGGCTCAGG CCATATGCTG ATTTCCGGTT TGTTAAAACC
GATCTTGCCG ACCGGCAGGG CATGGAAGAG CTTTTTCGCA AAGGCGGATT TGAAAAAGTG
GTTAATCTTG CCGCTCAGGC CGGGGTTCGT TATTCCATTG TCAATCCGCA CTCCTATGTC
GAAAGCAATA TTCTGGGATT TCTGAATATT CTCGAAGGGT GTCGTCATAA CGGCGTGGAG
CATCTCGTTT ATGCATCGTC AAGTTCGGTC TACGGCGCGA ACGAAACTAT GCCGTTTTCG
GTGCACGACA ATGTCGATCA CCCGCTCTCT CTATACGCAG CCAGCAAGAA AGCCAACGAA
CTGATGGCGC ATACATACAG CCATCTCTAC AACATTTCCG CAACAGGACT GCGCTTCTTT
ACCGTATATG GCCCGTGGGG ACGTCCCGAT ATGGCGCTCT TTCTCTTTAC CGATGCCATT
CTGAACAACC GCCCGATCAA GGTGTTCAAC TATGGCAAAC ACCGGCGAGA TTTCACCTAC
ATCGACGACA TCGTCGAGGG GGTGATCCGG ACGCTCGATC ACAATGCCGA AAGCAATCCT
GAGTGGTCCG GGCTGCACCC TGATCCCGGA TCGAGCCGTG CGCCGTGGAA GGTGTACAAC
ATCGGCAACA GCCAGCCGGT CAACCTGATG GACTACATCG GGGCGCTCGA ACGGCAGCTC
GGCAAAACAG CGGAAAAGGA GTTTCTGCCC ATGCAGCCGG GTGACGTGCC CGACACCTAT
GCCGATGTCG AGCAGCTCAT ACAGGATGTG CATTATAAAC CGGAAACTAC CGTGGAGGAA
GGTGTCAGAC GGTTTGTTGC CTGGTATCGG GATTATTATG ATGTCAGGTA G
 
Protein sequence
MNVLVTGAAG FIGSHVCQRL LERGERVTGL DNLNDYYDVS LKEARLDWLR PYADFRFVKT 
DLADRQGMEE LFRKGGFEKV VNLAAQAGVR YSIVNPHSYV ESNILGFLNI LEGCRHNGVE
HLVYASSSSV YGANETMPFS VHDNVDHPLS LYAASKKANE LMAHTYSHLY NISATGLRFF
TVYGPWGRPD MALFLFTDAI LNNRPIKVFN YGKHRRDFTY IDDIVEGVIR TLDHNAESNP
EWSGLHPDPG SSRAPWKVYN IGNSQPVNLM DYIGALERQL GKTAEKEFLP MQPGDVPDTY
ADVEQLIQDV HYKPETTVEE GVRRFVAWYR DYYDVR