Gene Clim_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1109 
Symbol 
ID6355751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1210314 
End bp1211270 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content53% 
IMG OID642668726 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_001943157 
Protein GI189346628 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGCG AAGCAAAAGC CATCGTTCTG CCGAAAGCCA GCAAACTGAG ACTGCAGAAC 
ACTCCATACC ATATCGGCAA TCCCGGCGAT CTGCTGGTAA AAACCATTGC AAGCACCATA
ACTCCCGGCC TTGACCGGCT GCTCCTGACC AATAAACCGG TATCGCACAA GGTGCTCGAA
TATCCGATCA TTCCCGGAAG TGAGTCCATA GGACAGGTCG TTGAAGCTGG TCCTGATACC
CGCGAGGTGG AGACGGGCGA TTTCGTCTAT GTCTTCAGGG GAGACAGGTG GAGCGGCGTG
GAAAGCTACT ACGGATGCCA TGCTGAGCTC ATCCCGACCT CTTCAGAAAA CGTACTGCCA
CTGGGCCGCC CACCCATTCA CCGCGATCTT CTTACAGGGC TTCTGGCCTA CGTTATCAGT
GCGCTTGACA AGGTCCCGAT CGATCCCTCG ATGCGGGTGC TCATCCTCGG ACTTGGTTCG
GTTGGACTGA TGATTTCCGA GTACCTGTTC CAGAAAGGGT GTCTGCATGT CGATGCAGTC
GAAACATTCG GCATCAGGGG ACAGCTATCC CGGGCAGAGC ACATTGCGCT GGAGATCGGA
GACTTTACCG CCGAATTCAA CGACCGGTAT GACCTGGTTA TCGAAACCAC CGGACGCATT
CTCATGATCG AGAAGGCGAT GCGCCTGATG AAGCATCAGG CAAAAGTGCT CCTCATGGGC
AACTATGAGG TGATGGCCTA CGATTACCGG TTGATTCAGC ACAAGGAACC GGCGATCATC
TGTTCCAATA TCTCAACTTT CCGGCATATA CAGAGAGCCT CCACGCTGCT CGATACCGGA
GAGCTCGACA CCGAAAAATT TTTCACCAAC GTTTTTCCGG TCAGCCAGTT TGAATTCGCA
TACCGTATCG CTCTTGACAG CAAGGAGGCT ATAAAGACCG TAATAAGCTG GGTATAA
 
Protein sequence
MKGEAKAIVL PKASKLRLQN TPYHIGNPGD LLVKTIASTI TPGLDRLLLT NKPVSHKVLE 
YPIIPGSESI GQVVEAGPDT REVETGDFVY VFRGDRWSGV ESYYGCHAEL IPTSSENVLP
LGRPPIHRDL LTGLLAYVIS ALDKVPIDPS MRVLILGLGS VGLMISEYLF QKGCLHVDAV
ETFGIRGQLS RAEHIALEIG DFTAEFNDRY DLVIETTGRI LMIEKAMRLM KHQAKVLLMG
NYEVMAYDYR LIQHKEPAII CSNISTFRHI QRASTLLDTG ELDTEKFFTN VFPVSQFEFA
YRIALDSKEA IKTVISWV