Gene Clim_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1972 
Symbol 
ID6355476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2188655 
End bp2190772 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content56% 
IMG OID642669570 
Productshort chain dehydrogenase 
Protein accessionYP_001943983 
Protein GI189347454 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000839125 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATC TTTGGAACGA CACGGATCTT CGGTGCTCGG TGAACGGGCA GTGCAGTGCG 
GACGATATTC CTGCGGAACT TGCCGAACTG GTTTATGCCT CCCGTCTGCT TGGAAGGGAG
AGCAGTCTGG TGATGCATGG CGGCGGCAAT ACCTCGGTGA AAAGCGAGCT GCACGACATT
ATCGGTAACC GGGTTAACGT TATTTTCATC AAAGGAAGCG GTGTCGATCT GGCCGCTCTG
GACGCACACG ATTTCACACC GGTCAGGATC GAACCCCTGC AGAAGCTGCA GCATCTCTAT
GCTACCGGAG AGCGTCGCAG CGAGGAAGAT ATGCAGCGGT TTTCGACGAG GGAGTTCAAG
AACTTTCTCT ATCTGAATCT TTTCTATCTG ACCGATCACA TGGTGAACAA CTCGCTTTCA
CCTTCGATCG AAACTCTGCT GCATGCGTTT CTTCCCCACC GGTTTATTTT TCATACCCAT
TCGACAGCGC TGTTGACGCT CAGCAATCAG CCGAACGGCG CCGAACTCTG CAGGGAGGTG
CTCGGAGAGG AGTTCGGTCT TGTGCCCTAT ATCAAGCCGG GTCTCGGTCT TGCGCGTTCT
GCTGCAGAGG CATACGGAAA CGCTCCGGAC ATCAGGGGTC TTGTCCTTCA GAAACACGGT
CTTGTAACGC TGGCCGACAG TGCGGCTGCC GCCTATGACT GCATGATAGA ATGCGTTTCG
AAACTTGAAG AGCGCATAGC CAGGGCCGGA AGAAACGTTT TCCCCTCGAT CTCCCTGCCG
GAGACAGTCG CTCTGCTCGA AGATGTCGCA CCCGTTATCA GAGGCGCGGT TGTCGAAGAA
AAATCTCCCG GCACGTTCGA ATACAACCAG TTTGTGCTTG ATTTCCGTTC ATCCCCGGAT
ATACTGCAAT ACGTGAATGG TATCGAGCTT GAAGAGGTAA GCGGCAGAGG CGCCATGACG
CCGGATTTCA TTATCCGGAC AAAAAACCGG CCGCTCGTCG TTCCGGCTCC GGACGCTCTT
GATCCCGAGG GGTTCAAGAG TGCCGTTCAT GAGGCCGTCG AGCGGTATAA AGCGGAATAT
CTCGCTTATT TTCAGCGTCA GCAGCAGGCT TCGGGCATGC AGGTGACGAT GCTCGATCCG
CTTCCGAGGG TGGTTCTTGT CCCCGGGCTC GGACTGTTCG GTCTCGGAAG GACGGCCCAT
GCGGCCTCGG TCAATGCCGA TATCGCCGAA AGCACCGCCT CGGCGATTCT CGATGCGCAA
TCGGTCGGAA CCTTCGAGTC GATTACCGAG AGGGATGTTT TCAATATCGA GTACTGGGAG
ATGGAGCAGG CGAAAATGAA GAAAGTCCGT CACGATGTAT TTGCCGGCAA GGTAGCTCTC
GTTACCGGCG CGGCAAGCGG TATCGGGCTC GCTACGGCCA AGGCGTTCCG GCAGAGGGGG
GCGGAGCTGG TAATCGTCGA TCTGAACCCG GAGGCACTTG AACGTGCCTC CGCAGAGCTT
GGCGGAGGGG TACTCTCCAT AGCGTGCGAC GTTACCGACC GCAATGCGGT AAAGCGGGCC
TTTGACGCTG TCTGCCGCCG ATTCGGAGGT CTCGACATTC TTGTTTCGAA TGTAGGAGTC
GCACTGCAGG GAAGGATCGG CGACGTTGCG GACGAAGTGC TTCGCCGGAG TTTCGAATTG
AATTTCTTTT CGCATCAGTC CATTGCGCAG CAGGCGGTCA GAATCATGAA ACTGCAGGGC
ACCGGTGGCG TTCTGCTTTT CAACGTATCC AAGCAGGCGG TAAACCCCGG CCCCGATTTC
GGCCCTTACG GGTTGCCCAA AGCGGCAACG ATGTTTCTCG TGCGCCAGTA TGCGCTTGAT
CACGGACGCG ACGGCATTCG GGCCAACGGG ATCAATGCCG ACCGTATCCG TACCGGTCTT
CTGACCGATG AAATGATCAA AACCCGCTCG AAAGCCCGAG GGCTGAGCGA ACGGGAGTAC
ATGGCCGGCA ACCTGCTTCA GGTTGAGGTC ACGGCCGAAG ACGTTGCCGA GGCATTCGTG
CACCAGGCGC TTGAAACAAA AACGACCGGC TCTATCGTCA CGGTTGACGG TGGCAACATT
GCTGCCGCCC TTCGTTGA
 
Protein sequence
MQNLWNDTDL RCSVNGQCSA DDIPAELAEL VYASRLLGRE SSLVMHGGGN TSVKSELHDI 
IGNRVNVIFI KGSGVDLAAL DAHDFTPVRI EPLQKLQHLY ATGERRSEED MQRFSTREFK
NFLYLNLFYL TDHMVNNSLS PSIETLLHAF LPHRFIFHTH STALLTLSNQ PNGAELCREV
LGEEFGLVPY IKPGLGLARS AAEAYGNAPD IRGLVLQKHG LVTLADSAAA AYDCMIECVS
KLEERIARAG RNVFPSISLP ETVALLEDVA PVIRGAVVEE KSPGTFEYNQ FVLDFRSSPD
ILQYVNGIEL EEVSGRGAMT PDFIIRTKNR PLVVPAPDAL DPEGFKSAVH EAVERYKAEY
LAYFQRQQQA SGMQVTMLDP LPRVVLVPGL GLFGLGRTAH AASVNADIAE STASAILDAQ
SVGTFESITE RDVFNIEYWE MEQAKMKKVR HDVFAGKVAL VTGAASGIGL ATAKAFRQRG
AELVIVDLNP EALERASAEL GGGVLSIACD VTDRNAVKRA FDAVCRRFGG LDILVSNVGV
ALQGRIGDVA DEVLRRSFEL NFFSHQSIAQ QAVRIMKLQG TGGVLLFNVS KQAVNPGPDF
GPYGLPKAAT MFLVRQYALD HGRDGIRANG INADRIRTGL LTDEMIKTRS KARGLSEREY
MAGNLLQVEV TAEDVAEAFV HQALETKTTG SIVTVDGGNI AAALR