Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2919 |
Symbol | |
ID | 5209888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3649363 |
End bp | 3652605 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640596515 |
Product | hypothetical protein |
Protein accession | YP_001277237 |
Protein GI | 148657032 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACATCA GACGACATCC TTCCAGTATT GCACTCACCC TCGCGCTGCC TGTCGCGCTG GTGTTCGGCG CGCTCAACCC GGTGGCGGCG CAGCAGCCCG CGCCGACGGC TACAGCACTC CCTGCAACCC CCGTGCCAAC TGTGGCGCCA CCCACGCCGA CCGCAGCGCC GCCCGCGCCG ACCGTGGCGC CGCCCGTGCC GACCGTGGCG CTGCCGACCC CTGAGCCGAC GCTCACCGTG CCGGAGGCCA TTGACCGGAT TCTCAGCGGC GAGAAGCCGG TTGAGACCCT CGTCACCCTG TTCGCCTGGC AGCCGGCGCT GCTTGTGGTC ATCATGGCGC TGGTTGTCGC TGTTTCCGTT GTGAAACCCT GGCGCGAGCG TCTCTTGCGC CGGGTTGATC GTCTCATCGG CGGGGTACGG AGCGATGTTG AAGATGAGGT GCAGCGTGAG GAAGAGAAGA AGAGGCTCGA AGCCGAACGG CAAGCTCGAC AATTCCAGGA GCAGTTGCAG GTCGGCATTA CGCACTATCT CGACTGGCTC CAGGCGGAGT ACGGTTTTAC GCAGCCGCTC GGTATCGCCA CCGAGCAGGT GCAACTCAGT CTCGAATCGG TTCACGTGCC GCTGCGGGTT GTCGAACGCG GCGCGATTGA GGCCCACCGC CGGCGCATGC GCGGCGAAGA GCGGCAAGGG CCGGAGCGGG AGTTGCCCGC CGGGGAGCGT CGCAGCAGGT ATGTGTTTGA ACTGCTGAGT GAGCCGGAGT TGCTGGCAGC GCGCCAGACG CCTCCGACCA GAGGGGTATC AGGCGATGAT GAATCTCCGT CGCCGGTGAC TACCACGCGC CTGTTGCTGC TTGGCGATGC GGGAAGCGGG AAAACAACGA CCCTGCGCTA TGCCGCGCTT CGCCTGGCTG AGGCGTATCG CCGGGGCGAT GCTGCGTTGC TGGCGAGCGA TGCCGCCGGT TTGCATCTCC ATCTGCAGCG GGCGCCGTTG CCCATCTATG TGCGCCTGAC GCTTTTTGCC GCGTCGATCC CGGCCGATCT GCGCGAACTG CCGCCGCAGG AGCGGGAGCG CTACGCTGGC GCGCCGGCCG ACCTGTTCCT TACGTGGCTG GATCGCGAGG CGGCAAGGCA TTGCGAGATT CAGGAGGGCG CGCTCTCGTC GCTGATCGGG AAGAACGACG GCAACGTGCT GCTCCTGCTC GACGGACTGG ACGAGGCGGG CGATGAACAG CGCCGCGCGT ACCTGGCGCA GGTGATTGAC AATCTCGCGC GCCGGTATGA TAAGCAGCGC TACGTCGTCG CAAGTCGCAC GGCAGGCTAC GGCGGGCTGG TCTACCTGCC CGACTTCCTG GAGCGGCACC TCAGCCCGCT CGATGAGCAG GAAGCGCAGG CGCTGCTGCG CAAGTGGTTC GATGCCGTGT ATGCGCGCCT GCACGCGATC GGGCGGCGGC GACAGGACGC CGCTGCCGAT CAGGCCGCGC AGCTCTGGGA AGTCATTGAG CGCAATGATC GCCTGCGCGA CATGGCGACG AATCCGCTGT TGCTGACGGT GATGGCGCTG CTCCAGTTCA ACAGCGTCCG GCTCCCCGAC CAGCGCGCGA AACTGTACGA GAAGTTGATC GAACTCCTCC TCGACCTCTG GCGCAGGCAG AATGTTGCCA GCGACACGCT GGTGACGAGC GTTGCGCAGC TTGCGTCCGA GCAACGCCGG CTGGAAGCGC TCGCCCTCGC GATGCAACAA CAGCCGCAGC AGGTGCGCGA GGTGACCCTC CGCCAGGCGC AGGAATGGCT CAGCCCGCTG TATGTCGAAC GATTGAAGAT TGACCGCGAA GAAGCCGACA GGCGGGTGCA TGATCTGCTG CGCCGCCTTG CCGTCGACAG CGGGATCATC CAGCAGCGCG AGGAGCGCTA TGCCTTTTCG CACTACACGT TTCAGGAGTA TCTGGCGGCG CGAGCGCTCG ACAGCCTCGA CAACCGCGAC GGCGCGCCGG ACAGCGTGGC GTTTCTTCTG GAGCGCAGCG CAGACGCGCG CTGGCGCGAG ACCCTGCTGC TCGCCGCCGG CTACTGGAGC AATGGTCAGC AGATCCGTAA GACGGAGCGG CTCCTGCGGG GATTGCTCGA CAGGCGCGAT CCCGAAAACC TGCTGCTCGC CGCCGCTGCT CTTGCCGATG TCGGCGTGGT CGAGGACCTC GCCGACCTGC GCGATGAAGC CACCGCCCGC CTGCGCGCCC TCGCCGCCCT CACGGAGGAC TGGCGCAGCG CCGCCCACCC CGACCCCGCG CTGCGCAACC GCGCCGCCAC CATGCTCGAC CGGCTGGATG CCGATACTGA GCGTCCGGGG CTTGACCTGA CGAAGCCCGA CTACTGGGCG AACCGCATCG AGCCGGGGAC GTTCAGCATG GGTGATACGA ACAGCACATA CGACCGCGAA GAGCCGCAGT TCGACTACAC CATCCGCCGG CCCTACGCCC TGGCGCGCTT CCCGGTGACT AACCGCCAGT ACCTGCTCTT CGTCGAGGCC CTGGCCGGGC GCGGCGCGCC CGAAGCCGTC GCGGCGGCGA ATCGGCTGAA GGATCTGATG AAGCAGCACG GAGAAACCCC GGAAACGTAT AACGGGTTCC GCCCGTACTT CTGGCCCGGC GCGCGCTACC GGGCCGGCGA GGGCAACCAC CCGGTGGTCG GCGTCACATG GTATGCGGCC ACGGCCTTCG CCTGGTGGGC CGACGCCTGG CTGCGCGCCC TGGGTGTACT GAAGGAGGGC GAGGAGGTGC GCCTGCCCAC CGAGGCCGAG TGGGAGCGGG CGGCGGCCTA CCCGCCGACC CTGCCGGGCA GCGACCCCCG TACCGGGCGG CGCGAGTACC CCTGGGGCGC GGAGTTGACA ACCGCGACCA GCGGGAGTAT GATTGCCAGC ATTCAGGCTA ACATCGACGA GAGCAAGATC AGCGGAACCT CGGTGGTGGG CATCTTCCCC CACGGCGCGG CAGCCTGCGG GGCGGAGGAA CTGGCGGGGA ATGTCTGGGA GTGGTGCAGC ACGCCACCTC TGAAGTATCC GTTCAAAGGC GAGGTGAGCG CAGAAAGTCT TTACACAAAA AACAAACGTG CTGGTGGAAC ATACGTGCTG CGCGGCGGCT CGTGGAACAG CCTTCGCGAC GGCGCCCGTT GCGCCTGCCG CAACGTCCTC AACCCTGGCC ACGTCCTCGT CATCATCGGG TTTCGTCTCG CCCGTTTGTT CTCCTCTTGC TAA
|
Protein sequence | MHIRRHPSSI ALTLALPVAL VFGALNPVAA QQPAPTATAL PATPVPTVAP PTPTAAPPAP TVAPPVPTVA LPTPEPTLTV PEAIDRILSG EKPVETLVTL FAWQPALLVV IMALVVAVSV VKPWRERLLR RVDRLIGGVR SDVEDEVQRE EEKKRLEAER QARQFQEQLQ VGITHYLDWL QAEYGFTQPL GIATEQVQLS LESVHVPLRV VERGAIEAHR RRMRGEERQG PERELPAGER RSRYVFELLS EPELLAARQT PPTRGVSGDD ESPSPVTTTR LLLLGDAGSG KTTTLRYAAL RLAEAYRRGD AALLASDAAG LHLHLQRAPL PIYVRLTLFA ASIPADLREL PPQERERYAG APADLFLTWL DREAARHCEI QEGALSSLIG KNDGNVLLLL DGLDEAGDEQ RRAYLAQVID NLARRYDKQR YVVASRTAGY GGLVYLPDFL ERHLSPLDEQ EAQALLRKWF DAVYARLHAI GRRRQDAAAD QAAQLWEVIE RNDRLRDMAT NPLLLTVMAL LQFNSVRLPD QRAKLYEKLI ELLLDLWRRQ NVASDTLVTS VAQLASEQRR LEALALAMQQ QPQQVREVTL RQAQEWLSPL YVERLKIDRE EADRRVHDLL RRLAVDSGII QQREERYAFS HYTFQEYLAA RALDSLDNRD GAPDSVAFLL ERSADARWRE TLLLAAGYWS NGQQIRKTER LLRGLLDRRD PENLLLAAAA LADVGVVEDL ADLRDEATAR LRALAALTED WRSAAHPDPA LRNRAATMLD RLDADTERPG LDLTKPDYWA NRIEPGTFSM GDTNSTYDRE EPQFDYTIRR PYALARFPVT NRQYLLFVEA LAGRGAPEAV AAANRLKDLM KQHGETPETY NGFRPYFWPG ARYRAGEGNH PVVGVTWYAA TAFAWWADAW LRALGVLKEG EEVRLPTEAE WERAAAYPPT LPGSDPRTGR REYPWGAELT TATSGSMIAS IQANIDESKI SGTSVVGIFP HGAAACGAEE LAGNVWEWCS TPPLKYPFKG EVSAESLYTK NKRAGGTYVL RGGSWNSLRD GARCACRNVL NPGHVLVIIG FRLARLFSSC
|
| |