Gene Rcas_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1990 
Symbol 
ID5539468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2554644 
End bp2555816 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content60% 
IMG OID640894125 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_001432096 
Protein GI156741967 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.352675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0179199 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCGC CGCGAGAGTT TCTCATGCCT GCGCAGGTTC TGATCGGCTC AGGCGCTGCC 
GAGCAGGTAG GGGCGCAGTG TCAGAAGCGC GGTTGGAAGA AAGCCCTGAT CGTGTCCGAC
AAGATCATGC TCAGTCTGGG GTTAGTCGGC AAGGTGGAGC AGTATCTTGC CGACAGCGGC
ATTGCCAGCG CCGTATATGC TGGTGTCAAC ACCGAACCGG TGGTGGAATA TGTTCAGGAA
GGGTTGCAGG TATACAAAGA AGGCAACTGC GATTTTGTCG TAGCGGTTGG CGGCGGCAGT
CCCATCGATA CCGCAAAAGC CATCGCGGTG TTGGTCACGA ACCCCGGTTC AATTGAGCAG
TACAAAGGGA TCGGCAAAAT CGGCGCTCCC GGCGTGCCGG TCGTCGCCAT TCCCACCACT
GCCGGCACCG GCAGCGAGGC GACGGTCTAT ACTGTCATCA CCGACCAGAA GACCGATGTA
AAAATGTTGA TCGGAAGTCC TTACCTGATG CCGACGATCG CCATCGTTGA TCCGATGCTG
ACCGTCTCTT CGCCGCAGAG TGTCACAGCG GCAACCGGCG TTGATGCGCT GGTGCATGCC
ATCGAGGCGT ATGTGTCGGT GAAGCGCCAG CCGATGACCG ATATCTTTTG CCTGTCGGCG
ATCCGGTTGA TTGCGCAAAG CATTCGCCAG GCGTGGGCGA ACGGCAACAA TATGGAGGCG
CGTGAACAGA TGATGCTCGG CGCCTTTCAG GCAGGCGTGG CGTTCAGCAA CTCATCGGTG
GCGTTGGTGC ATGGGATGTC GCGCCCCATC GGCGCGCACT TCCATATCGC GCACGGCGTA
TCGAACGCGG CATTGCTCGC AGTCGTCACC GAGTTCAGTC TGATCGGCGA TCCGCTGCGA
TATGCGCAGA TTGCCGAAGC CATGGGGGAG CCGGTTCAGG GGTTGTCCAT GATGGAAGCC
GCCGACCGCG TGGTCCATGC CATCCGTCGT CTGGTCAGCG ACATCAAGAT TCCCTCGCTA
AGGCAACTGG GAGTCGAGCG TGAGCGCCTG ATCGAACTGG CGCCGTCGAT GGCGGATGCG
GCGATTGATA GCGGAAGCCC GGCAAATAAC CCGCGCAAGC CCACGCGTCA GGAAATCATC
GATCTCTACG TCAAGGCGTA TGACGAGGCG TAG
 
Protein sequence
MMPPREFLMP AQVLIGSGAA EQVGAQCQKR GWKKALIVSD KIMLSLGLVG KVEQYLADSG 
IASAVYAGVN TEPVVEYVQE GLQVYKEGNC DFVVAVGGGS PIDTAKAIAV LVTNPGSIEQ
YKGIGKIGAP GVPVVAIPTT AGTGSEATVY TVITDQKTDV KMLIGSPYLM PTIAIVDPML
TVSSPQSVTA ATGVDALVHA IEAYVSVKRQ PMTDIFCLSA IRLIAQSIRQ AWANGNNMEA
REQMMLGAFQ AGVAFSNSSV ALVHGMSRPI GAHFHIAHGV SNAALLAVVT EFSLIGDPLR
YAQIAEAMGE PVQGLSMMEA ADRVVHAIRR LVSDIKIPSL RQLGVERERL IELAPSMADA
AIDSGSPANN PRKPTRQEII DLYVKAYDEA