Gene RoseRS_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2066 
Symbol 
ID5209028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2557245 
End bp2558681 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content57% 
IMG OID640595670 
Producthypothetical protein 
Protein accessionYP_001276399 
Protein GI148656194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.380444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00122076 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCAATG TTCAGTTCAT CCGACAGGCG CTCGTGGAGG CTCTCCCCAT AGCCACGTCG 
AGTTATGGCG AAGAACCCCA TCCCGGCCAC ATGTACGTTC CGTCGGCGCA TGTCAAGGCG
CTTCGATTGG AGTGCAACCT GGTCATCGGC GCCCGGGGCG TCAGAAAATC GTTCTGGACG
GCGGCGCTGC ATGCGCGAAC GTTACGCGCC TTACTCGGTC AATCGGTGCG CGAACTTGAA
AGCACCGATG TGCGCATTGG ATTCTCGGTT GCACCATTGC TTGACGCATA TCCCGATAGC
GATATTTTCA GGGCGCTTAT CAAACAGCAT GCCGCCTATG ATATCTGGCG TGCAGTGATC
GCACGCTGGC TGGCGGAGAT AACTCCAGCA ACTCTGCCTC GCACTACCTG GGAGGAAACA
GTCCGGTGGG CTTCCAGTCA TCCCGAGGCG ATTGCGAAAA TGGTGCAGGA TGCGAATATG
CGTCTGGAGG CGGAGCAACG CTACGGATTG ATTGTTTTCG ATGCGCTTGA CCGTACCAGT
AACGACTGGC GGACAATGGA TGCCATCGTG CGTGATCTGC TGCGTGTGGT TCTGTGGCTC
AAATCCTATG CGCGTCTTCA TGCGAAGGTC TTCCTGCGCG AAGATCAGTT TGATCGCACG
GTGACAGACT TTCCCGATGC ATCAAAATTG CGTGCGACAA TGTCCGAATT GACATGGGCG
CCGCACGATC TGCACGGTCT TCTCTGGCAA ATGCTTTGCA ATGCACCAGA TGAGTACGGA
AAGACGTTGC GAGCGGTGTA TAGCAGCGTT GTTGGAACTC CGCCTCTCTG GCACGATAAC
GTCTGGCGCC TCGCCGAAGA CGTAAAGCGC GAGGGAGAAA AACAACGTCT CTTGTTCGAG
AAACTGGCAG GGAAGTGGAT GGGCAGTGAT CACCGGCGTG GCGTTCCCTA CGTCTGGTCA
GTGAGCCACC TGGCCGATGG ACGTCGACGC ACATCGCCGC GTTCGTTTCT TGCAGCAATC
CGTGCTGCTG CGGAAGACTC TCGTGAACGC TACCCGGATC ATGCATATGC TCTTCACTAT
GAAAGCATCA AGCGCGGCGT GCAGCGTGCA TCCCAGATAC GCGTCGATGA ACTGGCTGAG
GACTACCCAT GGGTCACGAA ACTTATGGCG CCTTTGCGCG GCTTGACCGT ACCATGTTCG
TTTAGCGTCA TCGAAGGACG TTGGAACGAA TACTTTCCGC ATGGACCGGA CGAGATCAGG
AGTACACGCC TTCCGCCTCA GCACGCGGAG CAGGGATGGA GGGGGGTGTG CAACGATCTG
GAGCGACTGG GTATCTTTGA GCGGATGCAC GATATGCGTA TTAATATGCC CGACCTGTAC
CGCGTTGGCT TTGGATTGGG CCGTCGTGGC GGCGTGAAAC CTATTCGACA ACCCTGA
 
Protein sequence
MSNVQFIRQA LVEALPIATS SYGEEPHPGH MYVPSAHVKA LRLECNLVIG ARGVRKSFWT 
AALHARTLRA LLGQSVRELE STDVRIGFSV APLLDAYPDS DIFRALIKQH AAYDIWRAVI
ARWLAEITPA TLPRTTWEET VRWASSHPEA IAKMVQDANM RLEAEQRYGL IVFDALDRTS
NDWRTMDAIV RDLLRVVLWL KSYARLHAKV FLREDQFDRT VTDFPDASKL RATMSELTWA
PHDLHGLLWQ MLCNAPDEYG KTLRAVYSSV VGTPPLWHDN VWRLAEDVKR EGEKQRLLFE
KLAGKWMGSD HRRGVPYVWS VSHLADGRRR TSPRSFLAAI RAAAEDSRER YPDHAYALHY
ESIKRGVQRA SQIRVDELAE DYPWVTKLMA PLRGLTVPCS FSVIEGRWNE YFPHGPDEIR
STRLPPQHAE QGWRGVCNDL ERLGIFERMH DMRINMPDLY RVGFGLGRRG GVKPIRQP