Gene RoseRS_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1084 
Symbol 
ID5208031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1349156 
End bp1350304 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content63% 
IMG OID640594698 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001275442 
Protein GI148655237 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.413196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAGA TCGCCGCCGC TGTTGTCGGT ACAGGATTTA TCGGCGTTGT GCACGTCGAG 
GCGCTCCGCC GGCTCGGCAT TCCGGTGCTC GGCGTGGTCG GCTCGAGCGT CGAACGCGCC
CGCGCGAAAG CCGATGCAAT GGGCATTTCG GTCTATGCCA GTTTTGAGGA TATGCTGGCT
GATCCGCGGG TCACTGTGGT TCACATCACC ACCCCGAACT ACCTGCACTT CCCGCAGGTC
AAAGCGGCAA TCGCCGCCGG TAAGCACGTA GTCTGCGAAA AGCCCCTGGC GATGACGTCC
GCCGAGTCGG CGGAACTGCT GCACCTGGCA ACCGAAGCAG GCATCGTCCA CGCCGTCAAT
TTCAATATTC GCTTCTACCC GCTCTGCCAG CACGCCCGCG CCCTGGTGCA GGCAGGTGAC
ATCGGCGCAC CGCGCATTAT TCAGGGATCG TACCTCCAGG ACTGGCTCAT GCTGCCCACC
GACTGGAACT GGCGGCTGGA GCCGGAACTC GGCGGCGAAT TGCGCGCCGT GGCCGATATT
GGTTCGCACT GGCTCGACCT GACGACCTTC ATCACCGGCG AGAAGGTCAG TGCAGTGATG
GCCGACCTGG CAACATTCAT TCCGGTGCGC CGCAAACCCA CCCGACCTAT CGACACCTTC
ACCGGCAAAG AGGTAACCGT TACCGAGACG ATCGATCAAC CTATCCATAC TGAAGACTAT
GCGAGCATCC TGCTGCGCTT CGCAAGCGGT GCACGCGGCG TCCTGACAGT GTCACAGGTC
AGTCCGGGGC GCAAAAACCG GCTGAGTTTC GAGATCGATG GCGAGCGTTC GGCGCTGGCG
TGGGATTCGG AGCGTCCTGA GGAGTTGTGG TTGGGGCGTC GCGAGACCGC AAGCGGACTG
TTGCTGCGCG ATCCGGCGTT ACTGCTCCCG GCTGCCCGCA GCACAACCGA TTACCCCGGC
GGTCATGCCG AAGGCTTCCC CGATACCTTC AAACAACTCT ACAAAGCGGT CTACCGCGCG
GTTGCCGCAG GCGCGCCGCC AACGACGCCC GACTACCCGA CATTCGCCGA CGGTCATGAG
GAACTACTGC TGGGAGAAGC GATCCTGCGC AGCGCGCGCG AGGAGCGCTG GGTGGCAATC
GAACGCTAA
 
Protein sequence
MPEIAAAVVG TGFIGVVHVE ALRRLGIPVL GVVGSSVERA RAKADAMGIS VYASFEDMLA 
DPRVTVVHIT TPNYLHFPQV KAAIAAGKHV VCEKPLAMTS AESAELLHLA TEAGIVHAVN
FNIRFYPLCQ HARALVQAGD IGAPRIIQGS YLQDWLMLPT DWNWRLEPEL GGELRAVADI
GSHWLDLTTF ITGEKVSAVM ADLATFIPVR RKPTRPIDTF TGKEVTVTET IDQPIHTEDY
ASILLRFASG ARGVLTVSQV SPGRKNRLSF EIDGERSALA WDSERPEELW LGRRETASGL
LLRDPALLLP AARSTTDYPG GHAEGFPDTF KQLYKAVYRA VAAGAPPTTP DYPTFADGHE
ELLLGEAILR SAREERWVAI ER