Gene Rcas_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3454 
Symbol 
ID5540953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4512026 
End bp4514218 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content63% 
IMG OID640895572 
Productshort chain dehydrogenase 
Protein accessionYP_001433522 
Protein GI156743393 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.117557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA ACCCATCGCG ATTCCGCCAC GTTCATGACG GCTGGGACGA CGCCTGTGCC 
GCGACGCTCG ATCCGGTCGG GCGGCTGGTC TATCGTTCTA ACCTGCTGGG CAGCGATCAG
CGCATCACCA ATACCGGCGG CGGCAACACA TCGGCGAAGA TCATGGAACG TGATCCGTTG
ACCGGCGAGT CGGTCGAGGT CTTGTGGGTC AAGGGGTCCG GCGGCGACCT GCGCACGAGT
ACGCGGGCGA ACTTTGCTTC GCTCGATCTG AAAAAACTCT TCACGCTGCG CTCGATCTAC
CTGCGCGATC CGTTGCGCGG GCCGAAGAGC GCCGCCGAAG ATGCAATGGT CAATCTCTAT
CCGCACTGCA CGTTCAACCT GAATCCGCGT GCTTCTTCGA TTGATACGCC GCTCCACGCC
TTCATTCCCT ACCGCCACGT CGATCACATG CATCCGAATG CGGTGATCGC CATCGCTGCT
GCGCGCAACG GCGAGCGCCT GACCCGCGCG ATCTATGGTG ATGAAGTGAT CTGGACGCCC
TGGCAGCGTC CCGGTTTCGA TCTCGGACTG ACGCTCGAAC GCATTTGCCG CGAGCATCCG
CAGGCGCATG GGGTGATCCT GGGCGGGCAT GGGTTGATCA ACTGGGCTGA TGATGACCGG
GAGTGCTACG AGCGTACCCT CGATCTGATC GAGCGCGCCG CCACGTACAT CGAGGAGCGC
GACCGCGGCG CTGCAACGTT CGGCGGACCG AAGTGCGAAG CGTTGCCGTT CGACCGCCGA
CGCGCGGCAT TTGCGACCAT TCTGCCCTGG CTGCGCGGTC AGATCAGTCG GGAGCGACGG
TTCATTGCGA CGATTCAGGA CGATGACGCC ACGCTGCGCT TTGTCAACAG CGTCGATGCG
CCGCGTCTGG CGGAACTGGG AACCAGTTGC CCCGACCATT TCCTGCGCAC AAAAATCAAG
CCGCTCTATG TCGATTGGCA TCCGCACAAC GAAACGCTCG ACGACCTGAA GCGCAAACTG
TCGGATGGGC TGGAACGGTA TCGCGCCGAT TACACGCGCT ACTACGAGAC CTTCCGCCAT
AGCGACTCGC CGCCGATGCG CGATCCCAAC CCGACGGTCG TTCTGATCCC TGGTTTGGGC
ATGATCGCCT GGGGCAAGGA CAAGAGCGAG TCGCGGGTCA CGGCGGAGTT CTACACCGCT
GCCATCGAGG TGATGCGCGG CGCCGAAGCG ATTGATGAGT ATGTCGCGCT ACCGCTGCAA
GAGGCGTTCG ATATCGAATA CTGGCAATTG GAAGAGGCGA AACTGCGCCG GATGCCGCCG
GAGAAAGAAC TGGCGCGGCA GGTGATCGCC GTCATTGGCA GCGGCAGCGG CATTGGGCGC
GAGGTCGCGC TACGCCTCGC CGATGAGGGC GCCCACGTGG TGTGCGTCGA TAAGGACGAA
GCCGCCGCCA GTGCAACCGC GCAGATGATT ATCGACCGGC ACGGCATGGG CATCGGCGTG
GCGGGTAGTG ACATTTCCGC CTGCGGACCG GCGATCAACC TGGCTGCCGA CATTACGGAT
CGCGACAGTG TGCGCCGTAT GGTGCAGCAC CTTCTCCTCG CCTACGGCGG ACTCGATGCG
GTTGCGATCA CGGCGGGCAT CTTCGTGGCG CCCGATGTGG AAGGACGCAT CCGTGATGAT
CAGTGGGCGC TCACCTTCGC AATCAATGTT ACCGGGCAGT ATATCGTCGC CGATGAAGCC
GCAGCGATCT GGCGCGCGCA GGGGTTGCCC GCCAGCCTGG TGCTGACTAC CTCGGTCAAT
GCGGTGGTGG CGAAAAAGGG TTCGCTGGCA TACGATACAA GCAAAGCTGC CGCCAACCAT
CTTGTGCGCG AACTGGCGAT CGATCTTGCC CCGCTGGTGC GGGTCAACGG TGTCGCGCCG
GCGACGGTTG TGCAAGGGAG CAGCATGTTC CCCCGCGAGC GGGTGATAGC ATCGCTCACC
AAGTACGGCA TTCCCTTCAA GCCGGATGAA CCGACCGGGG CGTTGACGGC GAAACTGGCG
CAGTTTTACG CCGACCGATC GCTGCTCAAG CGACCCGTGA CGCCGACCGA TCAGGCGGAA
GCCTTCTTCC TCTTGCTCAG CCGGCGACTG GGACAGACAA CCGGCCAGAT CATCACGGTG
GACGGCGGGT TGCACGAAGC ATTCCTGCGG TGA
 
Protein sequence
MSANPSRFRH VHDGWDDACA ATLDPVGRLV YRSNLLGSDQ RITNTGGGNT SAKIMERDPL 
TGESVEVLWV KGSGGDLRTS TRANFASLDL KKLFTLRSIY LRDPLRGPKS AAEDAMVNLY
PHCTFNLNPR ASSIDTPLHA FIPYRHVDHM HPNAVIAIAA ARNGERLTRA IYGDEVIWTP
WQRPGFDLGL TLERICREHP QAHGVILGGH GLINWADDDR ECYERTLDLI ERAATYIEER
DRGAATFGGP KCEALPFDRR RAAFATILPW LRGQISRERR FIATIQDDDA TLRFVNSVDA
PRLAELGTSC PDHFLRTKIK PLYVDWHPHN ETLDDLKRKL SDGLERYRAD YTRYYETFRH
SDSPPMRDPN PTVVLIPGLG MIAWGKDKSE SRVTAEFYTA AIEVMRGAEA IDEYVALPLQ
EAFDIEYWQL EEAKLRRMPP EKELARQVIA VIGSGSGIGR EVALRLADEG AHVVCVDKDE
AAASATAQMI IDRHGMGIGV AGSDISACGP AINLAADITD RDSVRRMVQH LLLAYGGLDA
VAITAGIFVA PDVEGRIRDD QWALTFAINV TGQYIVADEA AAIWRAQGLP ASLVLTTSVN
AVVAKKGSLA YDTSKAAANH LVRELAIDLA PLVRVNGVAP ATVVQGSSMF PRERVIASLT
KYGIPFKPDE PTGALTAKLA QFYADRSLLK RPVTPTDQAE AFFLLLSRRL GQTTGQIITV
DGGLHEAFLR