Gene Rcas_3193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3193 
Symbol 
ID5540691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4151383 
End bp4152384 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content62% 
IMG OID640895314 
Productaldo/keto reductase 
Protein accessionYP_001433265 
Protein GI156743136 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0849672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.13692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACC GTTTCTTTGG GCGCACCGGC GTGCGCGTGG CGCCGTTGTG CATCGGCGCA 
ATGAATTTTG GCAACCCGAC CGATGACGCC GAATCGATGC GCATCATTGA CCGCGCCATT
GACGCCGGTA TCACCATGAT CGACACTGCT GACAGTTACA ACCATGGCGA AAGCGAACGC
ATCATCGGGC GCGCACTGGC GCGCAACGGC AGGCGCGACC AGGTGTTCCT GGCGACGAAG
GGGCATTTCC CAACGGGACC TGGTCCTAAT GACCGGGGCA ACTCGCGTCT GCACCTGATA
CGCGCCTGTG AAGATAGTCT GCGTCGGTTG CAAACCGATC ATATCGATCT GTACCAGATC
CATCGTCCTG ATCCAAACAC GCCGGTCGAA GAGACCCTCG CGGCACTGAC CGATCTGGTG
CGCCAGGGGA AGGTGCGCTA TATCGGTTGT TCAACGCACC CGGCGTGGCG CGTCATGGAA
GCGATCATGG TCAGCGAGTT GAAGGGGTAT GCGCGTTACG TCTCGGAGCA ACCGCCGTAC
AATCTGCTTG ATCGGCGCAT CGAGAACGAA CTGCTGCCGC TCTGCCAGGC GCACGGTCTG
GCAATCATCC CGTGGGCGCC GCTGGCGCAG GGGGTGCTGG CAGGGCGCTA TACCGACATT
ACCGCGCCGC CGCCGGATTC GCGCGTTGTG CTGCGTGGTG GCATCTATGC CGAGCGTGTC
ACTGCGCGCG GCATCGAGGT CGGTCGTGCA TTTGCCGCAC TCGCGCGCGA GCATGGTCTG
ACGCCAGCGC AACTCGCCAT TTTGTGGGTC AAAGATCAGC CTGGTGTCAC CGCGCCCATT
GTTGGTGTGC GCACCCTGGC GCAACTCGAG GAAATACTGC CGGTTTTGGA GATGGCGCTG
AGCGCTGATC TGCGCGCCGC GTGCGACGCA CTCGTGCCGC CGGGAAGCGC GGTCGTCGAT
TTCCACAACA CAGCAGGTTG GATGAAGATG CGTATTGCCT GA
 
Protein sequence
MEYRFFGRTG VRVAPLCIGA MNFGNPTDDA ESMRIIDRAI DAGITMIDTA DSYNHGESER 
IIGRALARNG RRDQVFLATK GHFPTGPGPN DRGNSRLHLI RACEDSLRRL QTDHIDLYQI
HRPDPNTPVE ETLAALTDLV RQGKVRYIGC STHPAWRVME AIMVSELKGY ARYVSEQPPY
NLLDRRIENE LLPLCQAHGL AIIPWAPLAQ GVLAGRYTDI TAPPPDSRVV LRGGIYAERV
TARGIEVGRA FAALAREHGL TPAQLAILWV KDQPGVTAPI VGVRTLAQLE EILPVLEMAL
SADLRAACDA LVPPGSAVVD FHNTAGWMKM RIA