Gene RoseRS_4046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4046 
Symbol 
ID5211029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5068776 
End bp5069822 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content61% 
IMG OID640597634 
Producthypothetical protein 
Protein accessionYP_001278340 
Protein GI148658135 
COG category[C] Energy production and conversion 
COG ID[COG1592] Rubrerythrin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTG TTGAGTTTTC ACCAGACATA CGGATTGGTG CTATGAGCGA CACTCCCAAT 
GATGCTGCCC GGATGTTCGC TCTTGCGAGC ATGGGACATG CCGCATACCA TCTCTGGGCG
GAACAGGCGC GTCGGGATCG CTGTTTCAAC ATTGCGCGTC TGTTCGAGGC GTTGAGCGCT
GCGCGTCTGG CGCGCGCCGG GAACGCCTTC CGCCGTCTGG GGCTTGTGCG TTCGACGGCG
GAGAATGTTG CCAGCGCTTT TTCCGGTGCA GGCATCGGCG ACATTCCTGC CGACCGGATC
ACCGGCGTGA CGCCGTTTGC GCGGGAACTG CTGGCGCGGG CGCAGCGCGC CGTGGCTGAA
GGGCGCGATC TGCGCGCCGG CGAACTGGGT GATCTCTTCG TCTGCACCAC GTGCGGCGAG
ATCCGCGAAG GTGCGCTCGA AGGCGCGTGT CCGCGCTGTG GCACAGTTCC TGAAGCGCAC
AAAGCGTTCC GCGCCATCGA AGCAATGGGA ACGCTTGGTC CGCACGCAAT TATGACCTTT
CTGGAACATA CGGAGGAGGC GATCCGAACG CTGGTGGCAG GGCTGGACGA GGAGATGCTC
TCCCGGCGCC TGAATGAAAC CACACCGTCG TTGAAAGAGG TGATCGGGCA TCTTGCCGAT
ATGGACGCAA TCTTTCGTCA GCGCGCCTGG TTGCTGCTCG AGACCGTGCG ACCGGTTCTT
CCGCCAGCGC ATCCTCCAAC CCTGGAATCG GCGGATGTGT ATCGTGACCA ACCGATTGAC
CGGGTGATGG AAGCCTATCA CGCAACGCGG GCGCAAACCC TGAACCTGCT GCGCGGATTG
ACCAGCGCGG CGTGGCATCG GGAAGGGGAC CACGAGGTGT ATGGAGTGAT CAATCTGTTG
CATCAGGCGA ACTGGCTTAT ATCGCACGAA CGTGCGTATC TCGTTGAAAT GGCGCAGATC
CGTCATAACC TGATCGCCGC CGATCGGCGC TATTGCGAAG CGGAAGTGAC CGATATCGTT
GTGACCGGCT CGCACGAAGG AGAGTGA
 
Protein sequence
MDIVEFSPDI RIGAMSDTPN DAARMFALAS MGHAAYHLWA EQARRDRCFN IARLFEALSA 
ARLARAGNAF RRLGLVRSTA ENVASAFSGA GIGDIPADRI TGVTPFAREL LARAQRAVAE
GRDLRAGELG DLFVCTTCGE IREGALEGAC PRCGTVPEAH KAFRAIEAMG TLGPHAIMTF
LEHTEEAIRT LVAGLDEEML SRRLNETTPS LKEVIGHLAD MDAIFRQRAW LLLETVRPVL
PPAHPPTLES ADVYRDQPID RVMEAYHATR AQTLNLLRGL TSAAWHREGD HEVYGVINLL
HQANWLISHE RAYLVEMAQI RHNLIAADRR YCEAEVTDIV VTGSHEGE