Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2196 |
Symbol | |
ID | 5209159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2704774 |
End bp | 2707674 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640595798 |
Product | hypothetical protein |
Protein accession | YP_001276526 |
Protein GI | 148656321 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.852049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.245648 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAC ATTCCAGTCG CTCCGCCGTT CCGCGCCGCC GCGCGGTCAT GCTCGCGCTG CTCCTGCTCG TCCTCGCCGC GCTCGGCGCC GCTGCGACCG GCATTGCGCG GGCGGACGGC AGCACGACGC CCGGACCCGG ATGGGTCGCG GCGGACGCCA TTGTGTCATC GGAGTGCGTG CCGTCCGGTC CCAACCGCAG CGACTGCTTT TCGCGCGTCG TCCGCGCCGA AGGACTGTAC GAGTTCGTCC CGCGGCGCAT CCCGGTCGGC GCATCGACTG ACCCGCAGCG CCCATCATCC CCGCCCGTCG TCATCGACCG CCCGCAGTGC CGCGTCAACG GCGAATTCAT CTTCTATCCG CCGGGGAACT GGCATTACGG GCAGCGGTGG GTGAAAGCCG TCAGAGTCAC CTATGAGGTC GACCCTGCCT ATGCCGATCT TCCAATCCAG TACCGTCGCT ATAAAGTAGT GATCACATCG GAATGGCGCC CGACGGCAGG ACCGACCTAC GCCTTCTACG CCACAGAGGG GTGCGAGAAA GTTCCGTCCT GGGTTCCCAC CGCGACCCCG GCGGGCGGCG GGGGAACCCC CACCCGCCCG CCGGGCACGC CCCCGCCGGG CACGAGCATG ACGACCACGC CGACGAGACC CGCCGATCCG GGGACGCAGA CGCCGGTTCC GACGGCGACG GCGACGCCGC CGCTGTGCAT TCCTGCGCCG ATCGACCCCG TGCCGCTGAC GCTCTTTGCG GGGAACACGA ACCACTACAA CAACAACGGG CAGCCGTTCA CGCCGTACAG CGGGCGCTAC TCCGGCGCCG ACGGGTCGAA ATACTATGAC GGCGCCGCGT GGGGGCAGAT GCTCCACGGC GACCTGACTG ACCTCAACGG CATGGCGCAG GAGCGCCGCC TCTGGACGCA GGTTCCGGCG AACGCCGTGG CGGAGGCGCG CTATCACTTC GAGTTCGGGC GGCGCATGTT CGACGCCTCG GACGTAGCCC GTTCCCGCGG CGCGATGGTC ATTCTCCAGG ACCTCGGCGC CGACATGCAA CCGGGCAGCG GCGACGACCG CCTGCTGGCG TTCCTGTCGA TGGGCGACGC AAACTTGTTC GATAACCCGG CGAACCGCCG CTTGCGCGTG CACGGGTCGC GCCTCGACGC CTACCCCCGC CTCAACCTGC CCGGACGCCC GTCCGGGTGG CAGGCGGCGC TGGCGCCGGG CGTGCAGGTC TGGAGCAACG CCTTCTTCTC CTGGCAGGGG AACCGGACGC CGCCGTACGC CTGGCGTTCG TGGCCGCAGC CGGAGCAGCG CGACGCCAAC GGCAATCTGA CGCGATTCGC CGAAGACTGG CTCTATACGG ACGAGTACGG AACCAACCAC GGACCGCTCA TGCTGCGCTT CATCACGGAA CGCGGACGGG CGTACCGGAT GGTGATGCTC AACACCGTTC CAGGGTGCAA CACCGATATG CTGTCCACCA TCTCGCAACT CTACTTCTTC ACCGACCCCG GCGCGAACGT GGCGGTGGAG AAGCAGGCGC CGGAGCGCGC GTCGCGCAAC CAGATGGTCG GCTACAGTCT GACGGTGCGC AACACGTCGC AGACGACGGT GAACAACGTC GTGCTGACCG ACACGCTGCC GCTCGGCATG CCGTTCGGCG GCGCCGATCT GACGCTGCGC GACCCGAACA CGGGCGAAAC GACCACCGAG CGGGTGACGG CGACGGTCAG TCTGGCGCCG TCGTGGACCG ACGGACGGAC GCTGGTGTGG AACCTCGGCA ATCTGGCGCC GGGCGAGGTG CGGACGATTG ATCTGACCGT TCCGGCGACC GACGCGGCGC CGGATGCAGC GACGAACGTT GCGGTCGTCA GCGCCGCCAA TGACGGCGAC CCGAACGACA ACCGCGCCGA AGCGACGACG GTCTTCGTGC GCACGAACGT CAGGGTGCGC ATCACGACGC CGCGCATCGT GCGACCCGGG CAGGAGTTCG AGACGGTGAT CTCCTACGAG AACACCTCGC CGGAGGACGC CACGAACGTG CTGCTCGACT ATCAGGTCGT CCCCGACGTA ACGATCGTCG GCGCGACGCC GGGACACCAG GAGATGAACA ACGAACGCCC GGTCTGGCGG TTGGGGACGG TCGCCGCCGG GCAACGCGGA ACGGTGCGCG TGCGCGTGCA GGTTCCGCCG GCGGACAGTC CGGCGCTGCC GGTGGCGGTG CTGCACCTGG CGCGCATCTC TGCGGACGCG GACGCAGACC CGATGGACAA TGCGGCGCAG GCGACGACGA CGGTGCTGGT GGTTCCGCCG CCGCAGCGCG GCGAAGAGCG CCTGCGCATT CACTCGGAGT TCGACCCGGA GCGCGGGGTG TACCTGAGCG AAGGGACGAC GGTGACGTGG CCGGCGGGCG AAGTGATGGA CTTCACGCCG TTCATCGCGC CGGACAACCG TCAGGTCGGA CTGCCGTACT ACCGCCTGGA TCGGAAGGTC GTGGCGTGGA GTTTCGTCGG AACAGGGAAC CTGAACCTGA TCGGCGCGAC GTGCAAGGCG CGTGAGGAGC CGACTGCGGA AGACGTGCAG CACGCCGACC TGTCGCGACT GAAGGGATGC ATCTATCGCT ATCACGTCAG CCCGTCGTCG GCGCAGATGC GCTGGCAGGG GCATCTGTTC TGGGGGCAGT ACGCGCCGGA GCGCATGCGC CAGGACGTGT ACGTGCGCAC GCCGTTGCCG ACGCGCGGAA CCGACCTGCG CATCCAGTAC GCGGTGCTGA CGGAAGCCGT GGAAACAGGG TACGAGGACG TGGACGGCGA CGGGCGCACC GACAGCGTGC TGGAGCGACG CACCGACGTG TTTGAGGCGA CCTATCGGGT GGAATTCGTC GTACCACGCG ACGCGCGGTA G
|
Protein sequence | MPEHSSRSAV PRRRAVMLAL LLLVLAALGA AATGIARADG STTPGPGWVA ADAIVSSECV PSGPNRSDCF SRVVRAEGLY EFVPRRIPVG ASTDPQRPSS PPVVIDRPQC RVNGEFIFYP PGNWHYGQRW VKAVRVTYEV DPAYADLPIQ YRRYKVVITS EWRPTAGPTY AFYATEGCEK VPSWVPTATP AGGGGTPTRP PGTPPPGTSM TTTPTRPADP GTQTPVPTAT ATPPLCIPAP IDPVPLTLFA GNTNHYNNNG QPFTPYSGRY SGADGSKYYD GAAWGQMLHG DLTDLNGMAQ ERRLWTQVPA NAVAEARYHF EFGRRMFDAS DVARSRGAMV ILQDLGADMQ PGSGDDRLLA FLSMGDANLF DNPANRRLRV HGSRLDAYPR LNLPGRPSGW QAALAPGVQV WSNAFFSWQG NRTPPYAWRS WPQPEQRDAN GNLTRFAEDW LYTDEYGTNH GPLMLRFITE RGRAYRMVML NTVPGCNTDM LSTISQLYFF TDPGANVAVE KQAPERASRN QMVGYSLTVR NTSQTTVNNV VLTDTLPLGM PFGGADLTLR DPNTGETTTE RVTATVSLAP SWTDGRTLVW NLGNLAPGEV RTIDLTVPAT DAAPDAATNV AVVSAANDGD PNDNRAEATT VFVRTNVRVR ITTPRIVRPG QEFETVISYE NTSPEDATNV LLDYQVVPDV TIVGATPGHQ EMNNERPVWR LGTVAAGQRG TVRVRVQVPP ADSPALPVAV LHLARISADA DADPMDNAAQ ATTTVLVVPP PQRGEERLRI HSEFDPERGV YLSEGTTVTW PAGEVMDFTP FIAPDNRQVG LPYYRLDRKV VAWSFVGTGN LNLIGATCKA REEPTAEDVQ HADLSRLKGC IYRYHVSPSS AQMRWQGHLF WGQYAPERMR QDVYVRTPLP TRGTDLRIQY AVLTEAVETG YEDVDGDGRT DSVLERRTDV FEATYRVEFV VPRDAR
|
| |