Gene RoseRS_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2196 
Symbol 
ID5209159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2704774 
End bp2707674 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content69% 
IMG OID640595798 
Producthypothetical protein 
Protein accessionYP_001276526 
Protein GI148656321 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.852049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.245648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAC ATTCCAGTCG CTCCGCCGTT CCGCGCCGCC GCGCGGTCAT GCTCGCGCTG 
CTCCTGCTCG TCCTCGCCGC GCTCGGCGCC GCTGCGACCG GCATTGCGCG GGCGGACGGC
AGCACGACGC CCGGACCCGG ATGGGTCGCG GCGGACGCCA TTGTGTCATC GGAGTGCGTG
CCGTCCGGTC CCAACCGCAG CGACTGCTTT TCGCGCGTCG TCCGCGCCGA AGGACTGTAC
GAGTTCGTCC CGCGGCGCAT CCCGGTCGGC GCATCGACTG ACCCGCAGCG CCCATCATCC
CCGCCCGTCG TCATCGACCG CCCGCAGTGC CGCGTCAACG GCGAATTCAT CTTCTATCCG
CCGGGGAACT GGCATTACGG GCAGCGGTGG GTGAAAGCCG TCAGAGTCAC CTATGAGGTC
GACCCTGCCT ATGCCGATCT TCCAATCCAG TACCGTCGCT ATAAAGTAGT GATCACATCG
GAATGGCGCC CGACGGCAGG ACCGACCTAC GCCTTCTACG CCACAGAGGG GTGCGAGAAA
GTTCCGTCCT GGGTTCCCAC CGCGACCCCG GCGGGCGGCG GGGGAACCCC CACCCGCCCG
CCGGGCACGC CCCCGCCGGG CACGAGCATG ACGACCACGC CGACGAGACC CGCCGATCCG
GGGACGCAGA CGCCGGTTCC GACGGCGACG GCGACGCCGC CGCTGTGCAT TCCTGCGCCG
ATCGACCCCG TGCCGCTGAC GCTCTTTGCG GGGAACACGA ACCACTACAA CAACAACGGG
CAGCCGTTCA CGCCGTACAG CGGGCGCTAC TCCGGCGCCG ACGGGTCGAA ATACTATGAC
GGCGCCGCGT GGGGGCAGAT GCTCCACGGC GACCTGACTG ACCTCAACGG CATGGCGCAG
GAGCGCCGCC TCTGGACGCA GGTTCCGGCG AACGCCGTGG CGGAGGCGCG CTATCACTTC
GAGTTCGGGC GGCGCATGTT CGACGCCTCG GACGTAGCCC GTTCCCGCGG CGCGATGGTC
ATTCTCCAGG ACCTCGGCGC CGACATGCAA CCGGGCAGCG GCGACGACCG CCTGCTGGCG
TTCCTGTCGA TGGGCGACGC AAACTTGTTC GATAACCCGG CGAACCGCCG CTTGCGCGTG
CACGGGTCGC GCCTCGACGC CTACCCCCGC CTCAACCTGC CCGGACGCCC GTCCGGGTGG
CAGGCGGCGC TGGCGCCGGG CGTGCAGGTC TGGAGCAACG CCTTCTTCTC CTGGCAGGGG
AACCGGACGC CGCCGTACGC CTGGCGTTCG TGGCCGCAGC CGGAGCAGCG CGACGCCAAC
GGCAATCTGA CGCGATTCGC CGAAGACTGG CTCTATACGG ACGAGTACGG AACCAACCAC
GGACCGCTCA TGCTGCGCTT CATCACGGAA CGCGGACGGG CGTACCGGAT GGTGATGCTC
AACACCGTTC CAGGGTGCAA CACCGATATG CTGTCCACCA TCTCGCAACT CTACTTCTTC
ACCGACCCCG GCGCGAACGT GGCGGTGGAG AAGCAGGCGC CGGAGCGCGC GTCGCGCAAC
CAGATGGTCG GCTACAGTCT GACGGTGCGC AACACGTCGC AGACGACGGT GAACAACGTC
GTGCTGACCG ACACGCTGCC GCTCGGCATG CCGTTCGGCG GCGCCGATCT GACGCTGCGC
GACCCGAACA CGGGCGAAAC GACCACCGAG CGGGTGACGG CGACGGTCAG TCTGGCGCCG
TCGTGGACCG ACGGACGGAC GCTGGTGTGG AACCTCGGCA ATCTGGCGCC GGGCGAGGTG
CGGACGATTG ATCTGACCGT TCCGGCGACC GACGCGGCGC CGGATGCAGC GACGAACGTT
GCGGTCGTCA GCGCCGCCAA TGACGGCGAC CCGAACGACA ACCGCGCCGA AGCGACGACG
GTCTTCGTGC GCACGAACGT CAGGGTGCGC ATCACGACGC CGCGCATCGT GCGACCCGGG
CAGGAGTTCG AGACGGTGAT CTCCTACGAG AACACCTCGC CGGAGGACGC CACGAACGTG
CTGCTCGACT ATCAGGTCGT CCCCGACGTA ACGATCGTCG GCGCGACGCC GGGACACCAG
GAGATGAACA ACGAACGCCC GGTCTGGCGG TTGGGGACGG TCGCCGCCGG GCAACGCGGA
ACGGTGCGCG TGCGCGTGCA GGTTCCGCCG GCGGACAGTC CGGCGCTGCC GGTGGCGGTG
CTGCACCTGG CGCGCATCTC TGCGGACGCG GACGCAGACC CGATGGACAA TGCGGCGCAG
GCGACGACGA CGGTGCTGGT GGTTCCGCCG CCGCAGCGCG GCGAAGAGCG CCTGCGCATT
CACTCGGAGT TCGACCCGGA GCGCGGGGTG TACCTGAGCG AAGGGACGAC GGTGACGTGG
CCGGCGGGCG AAGTGATGGA CTTCACGCCG TTCATCGCGC CGGACAACCG TCAGGTCGGA
CTGCCGTACT ACCGCCTGGA TCGGAAGGTC GTGGCGTGGA GTTTCGTCGG AACAGGGAAC
CTGAACCTGA TCGGCGCGAC GTGCAAGGCG CGTGAGGAGC CGACTGCGGA AGACGTGCAG
CACGCCGACC TGTCGCGACT GAAGGGATGC ATCTATCGCT ATCACGTCAG CCCGTCGTCG
GCGCAGATGC GCTGGCAGGG GCATCTGTTC TGGGGGCAGT ACGCGCCGGA GCGCATGCGC
CAGGACGTGT ACGTGCGCAC GCCGTTGCCG ACGCGCGGAA CCGACCTGCG CATCCAGTAC
GCGGTGCTGA CGGAAGCCGT GGAAACAGGG TACGAGGACG TGGACGGCGA CGGGCGCACC
GACAGCGTGC TGGAGCGACG CACCGACGTG TTTGAGGCGA CCTATCGGGT GGAATTCGTC
GTACCACGCG ACGCGCGGTA G
 
Protein sequence
MPEHSSRSAV PRRRAVMLAL LLLVLAALGA AATGIARADG STTPGPGWVA ADAIVSSECV 
PSGPNRSDCF SRVVRAEGLY EFVPRRIPVG ASTDPQRPSS PPVVIDRPQC RVNGEFIFYP
PGNWHYGQRW VKAVRVTYEV DPAYADLPIQ YRRYKVVITS EWRPTAGPTY AFYATEGCEK
VPSWVPTATP AGGGGTPTRP PGTPPPGTSM TTTPTRPADP GTQTPVPTAT ATPPLCIPAP
IDPVPLTLFA GNTNHYNNNG QPFTPYSGRY SGADGSKYYD GAAWGQMLHG DLTDLNGMAQ
ERRLWTQVPA NAVAEARYHF EFGRRMFDAS DVARSRGAMV ILQDLGADMQ PGSGDDRLLA
FLSMGDANLF DNPANRRLRV HGSRLDAYPR LNLPGRPSGW QAALAPGVQV WSNAFFSWQG
NRTPPYAWRS WPQPEQRDAN GNLTRFAEDW LYTDEYGTNH GPLMLRFITE RGRAYRMVML
NTVPGCNTDM LSTISQLYFF TDPGANVAVE KQAPERASRN QMVGYSLTVR NTSQTTVNNV
VLTDTLPLGM PFGGADLTLR DPNTGETTTE RVTATVSLAP SWTDGRTLVW NLGNLAPGEV
RTIDLTVPAT DAAPDAATNV AVVSAANDGD PNDNRAEATT VFVRTNVRVR ITTPRIVRPG
QEFETVISYE NTSPEDATNV LLDYQVVPDV TIVGATPGHQ EMNNERPVWR LGTVAAGQRG
TVRVRVQVPP ADSPALPVAV LHLARISADA DADPMDNAAQ ATTTVLVVPP PQRGEERLRI
HSEFDPERGV YLSEGTTVTW PAGEVMDFTP FIAPDNRQVG LPYYRLDRKV VAWSFVGTGN
LNLIGATCKA REEPTAEDVQ HADLSRLKGC IYRYHVSPSS AQMRWQGHLF WGQYAPERMR
QDVYVRTPLP TRGTDLRIQY AVLTEAVETG YEDVDGDGRT DSVLERRTDV FEATYRVEFV
VPRDAR