Gene RoseRS_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4121 
Symbol 
ID5211104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5162282 
End bp5163472 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID640597709 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_001278415 
Protein GI148658210 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCA TCGCAATCCT GATCGCAACA GCCATTGCCG CGCCGATCGA ACTGCGCGCG 
CGGCAACGTC CCGCAGTCGC CGATTTCGCC CCGCCGCTCC CGGCTTCCGA GGCGCAGGCA
GTCGTCGCTG CCGCGCGCGC CGCCGAACGG ATGCAGCAGC AAAGCGCAGC GGAAGCAGTT
CCGCTGCCGG CAGTACAAAA TATGTACTTC GCGTCCTCCG GCTTTCACAT CAGTGATCGC
ACCGGTTTCC TGAGTTTCTG GCGCAGGAAC GGCGGTGAAC TGATCTTCGG CTACCCGATC
AGCGGCGAAC TGATTGAAGA CGGGCGGATC GTGCAGTACT TCGAGCGTGC ACGCTTCGAG
TACCATCCAG AAAACCTGGG CACCGAGTAT CAGGTGATGT TGTCGCTCCT CGGCAACGAG
ATCACCCAGG GGTACGATTT CCCCGACGGT CAACCGACGC AGGGGCGGCT CTACTTCCCC
GAAACCCGCC AGACGCTTGG CGGGAAGTTT CTCAGGTTCT GGCAGAAACG CGGCGGTCTG
CGCATCTTCG GCTACCCGAT CAGCGAGCCG TTCGAGGAAA TCAGCCCGAT CGACGGTCAG
GTGCGTATCA CCCAGTATTT TGAGCGCGCC CGTTTTGAAT ATCATCCCGA ACAGTTGCCC
GCGTTCTACC GTCAGATGGA ACGGGCGAAC GGGATCATAT TATCCGGGCT TCACGAAGTG
CAGCTGGGCG ATCTGGGACG CCAGGCGATG CAGCGCCGTG GCCATACCCC GCATGCGGTC
GGTCCCATGC CCGGTGCGCC CGCCTGGTCG CCCAAACTGT TCGAGCGCCG GATCGAAGTT
AATCTGACGC AACAGATGCT GACCGCTTTC GAGGGGGATG TGCCGGTCTA TCGCGCTCCC
GTCGCAACCG GTCGCGATGG CTTCAACACG CCAGCCGGAT CGTTTGCCAT TTACTACAAA
CTGCCGAGGC AAACGATGAC CGGTTCGGCT GGCGGCGAGT CGTGGTATGT CCCTGATGTG
CCATGGGTTC AGTATGTCGT TGGCGGCGTG GCGCTCCACG GCACCTACTG GCACGATGCC
TGGGGCACCG GAGTGCGCAT GTCGCATGGG TGTATCAATC TGAATATCGA TGATGCGGAA
TGGCTCTATC ACTGGGCGGA CATTGGGACG CGCGTGGATG TCGTGTATTG A
 
Protein sequence
MTGIAILIAT AIAAPIELRA RQRPAVADFA PPLPASEAQA VVAAARAAER MQQQSAAEAV 
PLPAVQNMYF ASSGFHISDR TGFLSFWRRN GGELIFGYPI SGELIEDGRI VQYFERARFE
YHPENLGTEY QVMLSLLGNE ITQGYDFPDG QPTQGRLYFP ETRQTLGGKF LRFWQKRGGL
RIFGYPISEP FEEISPIDGQ VRITQYFERA RFEYHPEQLP AFYRQMERAN GIILSGLHEV
QLGDLGRQAM QRRGHTPHAV GPMPGAPAWS PKLFERRIEV NLTQQMLTAF EGDVPVYRAP
VATGRDGFNT PAGSFAIYYK LPRQTMTGSA GGESWYVPDV PWVQYVVGGV ALHGTYWHDA
WGTGVRMSHG CINLNIDDAE WLYHWADIGT RVDVVY