Gene Rcas_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2149 
Symbol 
ID5539629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2759985 
End bp2761223 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content62% 
IMG OID640894283 
Product2-alkenal reductase 
Protein accessionYP_001432252 
Protein GI156742123 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0134489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00552688 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGCGTG GAACGAAATA TGCGCTGATC TTCGGCATGG TGGTTACCCT GGTAATCGGT 
GCGCTTATCG GCGCTCTGGC CGGCGGCGGC GTTGCCTGGT ATGTAACGCA GCAGCAGATT
GAGCGGATTG CAGCACAACC GTCGACGCCC GCGCCAATTC CGGCGTCAAT GCCGGCGACG
ACGGTCGTTC CGCAAGCAAC TGACGTGCCG CTGCCAACTC CTGCCCAGGT CCCGACGCCG
GCGCCAGCGG CGCCGGCAAC GACTTCACCG GTTGTTGAGG CGGTTCAGAA GGTGTCGCCG
GCGGTTGTGA CGGTCGTGAA CACGCTGGCA TCAGGTGCGC AGGGATCGCC GCTGCTTGGC
GATCTACCGT TTCCGCTGCC GGATCAACCC GGCGGTTCAG TGCGCCGCGG CAGCGGTTCT
GGGGTCATTA TCAGCCCGGA TGGGTATATT CTTACCAACA ATCACGTGAT TGAAGGGTAT
CGCTCGCTCT CGGTCATTTT CTACGACGGT TCGCGCCGTG ATGCAACATT GGTCGGCGCC
GATCCACTGA TGGATCTTGC CGTGGTCAAG GTCGATGGTC CGGTTCCCGG CGTGGCGACG
CTGGGCGACT CCGACGCGCT CCAACCCGGT GAAACGGTCA TTGCGATTGG CAGCCCGCTT
GGCGACTTCC GCAACACGGT GACGGTTGGC GTGGTGAGCG CTCTCAACCG TTCGCTTGGC
GCCGACGCAC CCGAAGGATT GATCCAGACT GATGCGGCGA TCAACAGCGG CAACAGCGGC
GGTCCACTGA TCAATCTGCG CGGTGAAGTC GTCGGGATCA ATACGCTCGT CGTGCGGGGG
AGCGGTTTGG GAACGGCGCC CATCGAAGGG CTTGGGTTTG CAGTGCCAAG CTCGATTGCC
AGGCGGGTGA GCGAGCAGTT GATCGCCAAT GGCAAAATCG TTTACCCGTT CCTCGGTGTG
CGTTTTGGCA CAATCGATGC TATGCTGGCG CTCGATAACG ATCTGCCGGT CAATGCTGGC
GCACTGATCT CCGCTGTCGA GCCGGGTGGA CCGGCTGCCC GCGCCGGGTT GCGCAGCGGT
GACATTGTGA CCAAAGTTGA TGGAAAGACG ATTGGACCGG GGCAGTCGTT GCGTGCTCTG
TTGCTGGAGT ACAAACCGGG CGACACGGTT ACGCTCGAGG TGTTGCGTAA TGGTGAACGG
CTGTCGTTGG ACGTGACTCT GGGGACGCGC CCGGATTGA
 
Protein sequence
MERGTKYALI FGMVVTLVIG ALIGALAGGG VAWYVTQQQI ERIAAQPSTP APIPASMPAT 
TVVPQATDVP LPTPAQVPTP APAAPATTSP VVEAVQKVSP AVVTVVNTLA SGAQGSPLLG
DLPFPLPDQP GGSVRRGSGS GVIISPDGYI LTNNHVIEGY RSLSVIFYDG SRRDATLVGA
DPLMDLAVVK VDGPVPGVAT LGDSDALQPG ETVIAIGSPL GDFRNTVTVG VVSALNRSLG
ADAPEGLIQT DAAINSGNSG GPLINLRGEV VGINTLVVRG SGLGTAPIEG LGFAVPSSIA
RRVSEQLIAN GKIVYPFLGV RFGTIDAMLA LDNDLPVNAG ALISAVEPGG PAARAGLRSG
DIVTKVDGKT IGPGQSLRAL LLEYKPGDTV TLEVLRNGER LSLDVTLGTR PD