Gene Rcas_3747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3747 
Symbol 
ID5541249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4916366 
End bp4917337 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content63% 
IMG OID640895858 
Productalcohol dehydrogenase 
Protein accessionYP_001433805 
Protein GI156743676 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00165792 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000793402 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGACGA CACGCGCCAT CGTCATCACG GCGCCACAGC ATCTGGAGTT GCGCAACGTG 
GCGCTGACAG ACGCAGGACC TGAGGAGGTC GTGGTGCAAA CCGCGTTTAC CTCGGTCAGC
GCCGGAACAG AGCGCATGTT GCTGGCGGGA CGAATGCCAC ACCCAATGTT GCAGTTTCCG
GTGGTGCCGG GGTACGAAAC GGTCGGTCGG GTCGTTGGAC GGGGCGCAGC GGTTCCGGCA
GAATACGAGG GGCGCTGGGT GTATGTCGGC GGGGCGCGCT GCTTCCGTGA TGTCAACCCG
GCGTGGGGTG GCCAATCGGC GATCCTGCTG GTCGATTACC GCCGGGTGGT TCCGCTCGAT
GGCGTCTCGC CGGATCACGG CGTGCTATTG GCGCTGGCGG CAACTGCGCT GCACGGCGTC
GATCTGATTG CCGGCGCCAA CCATGATCTG ACCGGTCGAC GGATACTGGT GCTGGGGCAG
GGTCCGGTGG GACAGTTTGC GGCGCGGATT GCGCGGGCAC GTGGCGCATG GGTGGCTGTC
GGCGACCGGA TTGCCAGTCG CCTTGAACGG AGTGTAGCGG ACCTGCGCAT TGACGTTACG
ACCGACTCGC TCGTAGCGGC AGTGCCGCAA CCGGTCGATA CAATTATCGA GGCAACCGGA
TCGATGGCAG CGTTGAACGA TGCGCTGCCA CTTCTTGCGA ACGATGGCAC ACTGCTGTTA
CTCGGATACT ACGACGAACT GCGCCTGCCG TATATGCCGT TGTTCCTCAA ACAGGCGCGA
TTGCTGACTG CGAAGGAATG GGCGGCGGGC GATCTCCAAC GGAGCCGTGA TCTGCTGGCG
AGCGGTATGC TCGACGCTGC GGCGCTGATC ACGCATCGCA TGCCGGTAGC GCAGTTCGAG
GCAGCGTATG CCACAGCGCT GAACGACCCG GAGTGCTTGA AACTGGTGAT CGAGTGGGAT
GTTTCAGGCT GA
 
Protein sequence
METTRAIVIT APQHLELRNV ALTDAGPEEV VVQTAFTSVS AGTERMLLAG RMPHPMLQFP 
VVPGYETVGR VVGRGAAVPA EYEGRWVYVG GARCFRDVNP AWGGQSAILL VDYRRVVPLD
GVSPDHGVLL ALAATALHGV DLIAGANHDL TGRRILVLGQ GPVGQFAARI ARARGAWVAV
GDRIASRLER SVADLRIDVT TDSLVAAVPQ PVDTIIEATG SMAALNDALP LLANDGTLLL
LGYYDELRLP YMPLFLKQAR LLTAKEWAAG DLQRSRDLLA SGMLDAAALI THRMPVAQFE
AAYATALNDP ECLKLVIEWD VSG