Gene RoseRS_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3074 
Symbol 
ID5210042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3862887 
End bp3863888 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content62% 
IMG OID640596665 
Productaldo/keto reductase 
Protein accessionYP_001277387 
Protein GI148657182 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00558733 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00303424 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGTACC GTCTGTTTGG TCGCACCGGC GTGCGTGTGG CGCCATTGTG CATTGGCGCG 
ATGAACTTCG GCAATCCAAC CGACGAAGCG GAAGCGCTGC GCATCATTGA TCGCGCCCTC
GACGCCGGGA TCAATATGTT CGACACCGCC AACAGTTACA ACAACGGGGA GAGTGAACGC
ATCATCGGGC GCGCCCTGGC GCGCGACGGA AAGCGTGACC GGGTGTTCCT CGCCACCAAG
GGACATTTTC CCGTCGGACC CGGACCCAAT GATCGGGGCA ATTCGCGCCT GCACCTGATG
CGCGCCTGTG AGGACAGCCT CCGCCGCTTG CAGACCGATC ATATCGATCT TTATCAGATC
CATCGTCCCG ATCCCGCCAC ACCGGTCGAA GAGACCCTGG CAGCGCTGAC CGATCTGGTG
CGTCAGGGAA AGGTGCGCTA TGTCGGGTGT TCGACCCACC CCGCCTGGCG AGTGATGGAA
GCGCTGATGG TAAGCGAGTT GAAGGGGTAT GTGCGCTACG TCTCGGAGCA ACCGCCCTAC
AACCTGCTTG ATCGACGCAT CGAAAATGAA CTGTTGCCGC TCTGTCAAAC GTATGGTCTG
GCAATTATTC CGTGGGCGCC GCTGGCTCAA GGGGTGCTGG CGGGGCGCTA TACCGATATT
GCTGCGCCTC CGCCCGACTC GCGCGTCGCC CTGCGCGGCG GCATCTATGC CGAACGAGTC
ACTGCGCGCG GCATCGAGGT CGGGCGCGCC TTTGCCGGGC TTGCGCGCGA GCATGGTCTC
ACACCTGCGC AGCTTGCTCT GCTGTGGGTC AAGGATCAAC CCGGCATTAC GGCGCCGATC
TTCGGTGTGC GCACCATTGC GCAACTGGAA GAAGCGCTGC CGGTGCTGGA GATGACGTTG
AGCGATGATC TGCGCGTCGC GTGTGATGCG CTCGTGCCGC CGGGCAGCGC GGTCGTCGAT
TTCCACAACA CATCGGGCTG GATGAAGATG CGACTTCCGT AA
 
Protein sequence
MEYRLFGRTG VRVAPLCIGA MNFGNPTDEA EALRIIDRAL DAGINMFDTA NSYNNGESER 
IIGRALARDG KRDRVFLATK GHFPVGPGPN DRGNSRLHLM RACEDSLRRL QTDHIDLYQI
HRPDPATPVE ETLAALTDLV RQGKVRYVGC STHPAWRVME ALMVSELKGY VRYVSEQPPY
NLLDRRIENE LLPLCQTYGL AIIPWAPLAQ GVLAGRYTDI AAPPPDSRVA LRGGIYAERV
TARGIEVGRA FAGLAREHGL TPAQLALLWV KDQPGITAPI FGVRTIAQLE EALPVLEMTL
SDDLRVACDA LVPPGSAVVD FHNTSGWMKM RLP