Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3156 |
Symbol | |
ID | 5210126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3973269 |
End bp | 3976451 |
Gene Length | 3183 bp |
Protein Length | 1060 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640596747 |
Product | hypothetical protein |
Protein accession | YP_001277467 |
Protein GI | 148657262 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTCC CCACCGATTT TACCGGCTTA CTCGAATGGC GCTGCATCGG ACCGTTCCGC GGCGGGCGGG TCGTCGCGGT CGCGGGAGAT CCGCGCGACA TTGGAACGTT CTACTTCGGT GCGTGCGCCG GCGGAGTCTG GAAGAGCGTC GATGGCGGGA TGTACTGGGA GTGCGTCTCA GATGGGTTTT TCAACACGGC TGCAATCGGC GCGCTGGCGG TTTCCGATTC CGATCCGAAT GTTCTGTACG CTGGCACAGG TGAAACAACG ATCCGGATCG ATGTATCGCA TGGCGACGGT GTGTACAAAA GTGTCGATGC TGGCAAAACC TGGAAACACG TCGGACTGAC CGACACCCGC TTCATCGGCA AGATTCGCAT TCACCCGCAC AACCCGGATA TTGTCTGGGT TGCAGCGCTC GGGCACGCCT TCGGACCCAA CGATGAACGT GGCGTCTTCA AGAGCATCGA TGGCGGGGCG ACGTGGCGCA GGGTGTTGTT CAAAAGCGAC AAAGCTGGCG CGGTCGATCT GTCGCTCGAT CCGAACAACC CGCGTTTCCT GTATGCTGCC GTCTGGGAGG CGTACCGCTC GTTCTGGCAG ATCTCTTCCG GTGGACCAGA CAGTGGTTTG TGGATGAGCA GCGATGGCGG TGAGACCTGG ACGGATATCA CTGATCGACC GGGGTTGCCG AAAGGGTTGA AGGGCAAAAT GGCGGTGGCT GCATCACCGG CGCGATCCGG GCGCGTCTGG GTGTTGATCG AGCACGCAAA AGAAGGCGGG CTCTACCGAT CCGACAACTA CGGCGACACG TGGGAAAAAG TTTCCGATAA TCAGAACCTG ATCTCGCGCG CCTGGTACTA CATGCACCTC ACCCCTGATC CGCTCGATTC GGAGACGATC TGGGTGAACA ATCTCAGTCT ATGGAAATCG ACCGATGGCG GACGCACGTT CGTTGAAGTT GCAACGCCGC ATGGCGATAA CCACGATCTC TGGATCGACC CACGCAACAA CCGTCGCATG ATCCAGGGGA ATGATGGCGG CGCATGCGTA TCGTTCAACG GCGGCGAGTC CTGGTCCACG ATCTACAACC AGCCGACGGC GCAGTTCTAC CACCTGGCAG TGGACAATCG CAAACCCTAC GTGGTCTACG GCACTCAGCA GGATAACTCA AGCATCGCTG TACCGGCTCG ATCGCGGCAC GGCGGCATTT TGTGGGGCGA TTGCTGGATC GCCGGGACGG GAGAGAGCGG CTATATCGCA GTGCGCCCCG ACAACCCAGA TATCGTGTAC GTCGGCGCGA TCGGGTCATC GCCCGGCGGC GGCAACTGTT TGCAGCGGTA TGATCATCGC GTTCGCCAGA TTCGCCTGAT CACCACGTGG CCCGAATATA TGGGCGGCTA CGGTGCTATC GACCATACAT ACCGTTTCGC CTGGACCTAC CCCATCGTCA TTTCACCGCA CGATCCCAAT ACACTCTACA TCGGCGGCAA CATGATCTTC CGCACCACCG ACGAAGGACA AAACTGGGAG GCGATCAGCC CCGATCTGAC GCGCGCCGAC CCGGCAACGC TGCAACCGAC CGGCGGACCG ATCAACCGCG ACTCGATCGG CGCCGAGGTG TATGCGACGG TCTTCGCGTT CATCGAGTCG CCGCACGAAC GGGGCGTCTT CTGGGCTGGC TCCGACGACG GACTGATCCA TCTTTCGCGT GATGGCGGAC TGACCTGGCA GAATGTCACG CCGCCAGAAC TGCCCGAATG GACGCTGATC AGTTGCATCG AGCCGTCACC CTTCGATGCG GCGACGGTGT ACGTCGCGGC GACGCGCTAC AAACTCGACG ACTATCATCC GTACCTGTAC AAAACCACCG ATTACGGCGC AACCTGGCAA CGAATCGACG CCGGTATTCC GGCGCACGAT TTTACGCGCG TCATCCGCGC CGACCCTGTG CGTCGCGGGT TGCTCGTCGC CGGGACGGAA ACCGGTCTGT ATCTCTCGTT CGACGATGGC GCGTCGTGGA TGCGCTTTAT GTTGAACCTG CCGGTCGCGC CGGTCCACGA GATTCTGATC AAAGACAGCG ACCTGATCGT CGGCACGCAT GGTCGTTCGA TCTGGATCCT CGACGATATA ACGCCGTTAC GCGCAATGAC TGCGGACATC CTCGATGCGC CGGTGCATCT GTTTGCGCCA CGCACGTCTG AGCGCATTCT CCCCGGCATC GACTGGTCGG GCAACAATCC GGGCAAAGAA TACCTGAGCA GCACTGGCGG CGCATTCATC AACGAAAAAC AGCCTGATGG CGCAATCAAA CGTCGCTATC TCGATGTAGG ACAGAATCCG CCGAAAGGCG TCATCGTCAC ATACTATCTG AAAGAGACGC CTGCCGAAGC GATCACGCTC AGTTTCTTCG ACGCACGGGG CGAATTGGTG CGCCGTTTCC GCAGCAAACC GCCCGAAAGC GCCGATGGCG AGAAAAAGAA GGATGACAAA GAACCGAAGA TCCCGGCGAA GGCGGGATGG AACCGTTTCG TCTGGGACAT GCGTCATGCG CCTGCGCCCC GGATCGAAGG CAAAGACCCT CCTGCTGACA TGGTTATCGA AGGTCCGTTT GTGGCGCCCG GCGCCTTCCG TGTGACGCTG AAGGCTGGCG ATGCCGAAAC TGCACAGGAA TTCGTCATCG TGCAGGACCC GCAATCCACC GCCACGCAGG AAGACCTGGA GGCGCAGCAC GACCTTGCGA TGCGGATTCA TCAAACGCTC AGCACGGTCG TGCAGACGAT CAATCGCATG CGCGATCTGC GCGCCCAACT CGATGGTTGG GCGAAGCGCG CTGAGACGCT GCCCGACGGC GCACCGGTCG CAGCGCAGGC AAAGGCGTTG CGCGAGAGGG TGCTGGAGAT CGAGCAGCAT CTCCTCGTGC CCGATCTGCG CCCTGGATGG GCGGATAATC TCAACCACGG GGTGCGCTTG CTGGAGAAGT TGATGAACGT CGCCGAAGTG GTTCAACCTG GCGATTACCG TCCGACCAGC GCGGCTGAAG CGGCGTTTCA AGACCTCGCA GCGCGCATTG CCGTCCAGAC AGCGCACTTC GAGGCGTTGA TCGAAAGCGA TCTGCCTGCG TTGAACGCTG CGATTGCCGG TGCTGGCTTC GGGGCAATTA TGCTGCCGGT ACCATCAACG TAA
|
Protein sequence | MSFPTDFTGL LEWRCIGPFR GGRVVAVAGD PRDIGTFYFG ACAGGVWKSV DGGMYWECVS DGFFNTAAIG ALAVSDSDPN VLYAGTGETT IRIDVSHGDG VYKSVDAGKT WKHVGLTDTR FIGKIRIHPH NPDIVWVAAL GHAFGPNDER GVFKSIDGGA TWRRVLFKSD KAGAVDLSLD PNNPRFLYAA VWEAYRSFWQ ISSGGPDSGL WMSSDGGETW TDITDRPGLP KGLKGKMAVA ASPARSGRVW VLIEHAKEGG LYRSDNYGDT WEKVSDNQNL ISRAWYYMHL TPDPLDSETI WVNNLSLWKS TDGGRTFVEV ATPHGDNHDL WIDPRNNRRM IQGNDGGACV SFNGGESWST IYNQPTAQFY HLAVDNRKPY VVYGTQQDNS SIAVPARSRH GGILWGDCWI AGTGESGYIA VRPDNPDIVY VGAIGSSPGG GNCLQRYDHR VRQIRLITTW PEYMGGYGAI DHTYRFAWTY PIVISPHDPN TLYIGGNMIF RTTDEGQNWE AISPDLTRAD PATLQPTGGP INRDSIGAEV YATVFAFIES PHERGVFWAG SDDGLIHLSR DGGLTWQNVT PPELPEWTLI SCIEPSPFDA ATVYVAATRY KLDDYHPYLY KTTDYGATWQ RIDAGIPAHD FTRVIRADPV RRGLLVAGTE TGLYLSFDDG ASWMRFMLNL PVAPVHEILI KDSDLIVGTH GRSIWILDDI TPLRAMTADI LDAPVHLFAP RTSERILPGI DWSGNNPGKE YLSSTGGAFI NEKQPDGAIK RRYLDVGQNP PKGVIVTYYL KETPAEAITL SFFDARGELV RRFRSKPPES ADGEKKKDDK EPKIPAKAGW NRFVWDMRHA PAPRIEGKDP PADMVIEGPF VAPGAFRVTL KAGDAETAQE FVIVQDPQST ATQEDLEAQH DLAMRIHQTL STVVQTINRM RDLRAQLDGW AKRAETLPDG APVAAQAKAL RERVLEIEQH LLVPDLRPGW ADNLNHGVRL LEKLMNVAEV VQPGDYRPTS AAEAAFQDLA ARIAVQTAHF EALIESDLPA LNAAIAGAGF GAIMLPVPST
|
| |