Gene RoseRS_3156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3156 
Symbol 
ID5210126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3973269 
End bp3976451 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content60% 
IMG OID640596747 
Producthypothetical protein 
Protein accessionYP_001277467 
Protein GI148657262 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTCC CCACCGATTT TACCGGCTTA CTCGAATGGC GCTGCATCGG ACCGTTCCGC 
GGCGGGCGGG TCGTCGCGGT CGCGGGAGAT CCGCGCGACA TTGGAACGTT CTACTTCGGT
GCGTGCGCCG GCGGAGTCTG GAAGAGCGTC GATGGCGGGA TGTACTGGGA GTGCGTCTCA
GATGGGTTTT TCAACACGGC TGCAATCGGC GCGCTGGCGG TTTCCGATTC CGATCCGAAT
GTTCTGTACG CTGGCACAGG TGAAACAACG ATCCGGATCG ATGTATCGCA TGGCGACGGT
GTGTACAAAA GTGTCGATGC TGGCAAAACC TGGAAACACG TCGGACTGAC CGACACCCGC
TTCATCGGCA AGATTCGCAT TCACCCGCAC AACCCGGATA TTGTCTGGGT TGCAGCGCTC
GGGCACGCCT TCGGACCCAA CGATGAACGT GGCGTCTTCA AGAGCATCGA TGGCGGGGCG
ACGTGGCGCA GGGTGTTGTT CAAAAGCGAC AAAGCTGGCG CGGTCGATCT GTCGCTCGAT
CCGAACAACC CGCGTTTCCT GTATGCTGCC GTCTGGGAGG CGTACCGCTC GTTCTGGCAG
ATCTCTTCCG GTGGACCAGA CAGTGGTTTG TGGATGAGCA GCGATGGCGG TGAGACCTGG
ACGGATATCA CTGATCGACC GGGGTTGCCG AAAGGGTTGA AGGGCAAAAT GGCGGTGGCT
GCATCACCGG CGCGATCCGG GCGCGTCTGG GTGTTGATCG AGCACGCAAA AGAAGGCGGG
CTCTACCGAT CCGACAACTA CGGCGACACG TGGGAAAAAG TTTCCGATAA TCAGAACCTG
ATCTCGCGCG CCTGGTACTA CATGCACCTC ACCCCTGATC CGCTCGATTC GGAGACGATC
TGGGTGAACA ATCTCAGTCT ATGGAAATCG ACCGATGGCG GACGCACGTT CGTTGAAGTT
GCAACGCCGC ATGGCGATAA CCACGATCTC TGGATCGACC CACGCAACAA CCGTCGCATG
ATCCAGGGGA ATGATGGCGG CGCATGCGTA TCGTTCAACG GCGGCGAGTC CTGGTCCACG
ATCTACAACC AGCCGACGGC GCAGTTCTAC CACCTGGCAG TGGACAATCG CAAACCCTAC
GTGGTCTACG GCACTCAGCA GGATAACTCA AGCATCGCTG TACCGGCTCG ATCGCGGCAC
GGCGGCATTT TGTGGGGCGA TTGCTGGATC GCCGGGACGG GAGAGAGCGG CTATATCGCA
GTGCGCCCCG ACAACCCAGA TATCGTGTAC GTCGGCGCGA TCGGGTCATC GCCCGGCGGC
GGCAACTGTT TGCAGCGGTA TGATCATCGC GTTCGCCAGA TTCGCCTGAT CACCACGTGG
CCCGAATATA TGGGCGGCTA CGGTGCTATC GACCATACAT ACCGTTTCGC CTGGACCTAC
CCCATCGTCA TTTCACCGCA CGATCCCAAT ACACTCTACA TCGGCGGCAA CATGATCTTC
CGCACCACCG ACGAAGGACA AAACTGGGAG GCGATCAGCC CCGATCTGAC GCGCGCCGAC
CCGGCAACGC TGCAACCGAC CGGCGGACCG ATCAACCGCG ACTCGATCGG CGCCGAGGTG
TATGCGACGG TCTTCGCGTT CATCGAGTCG CCGCACGAAC GGGGCGTCTT CTGGGCTGGC
TCCGACGACG GACTGATCCA TCTTTCGCGT GATGGCGGAC TGACCTGGCA GAATGTCACG
CCGCCAGAAC TGCCCGAATG GACGCTGATC AGTTGCATCG AGCCGTCACC CTTCGATGCG
GCGACGGTGT ACGTCGCGGC GACGCGCTAC AAACTCGACG ACTATCATCC GTACCTGTAC
AAAACCACCG ATTACGGCGC AACCTGGCAA CGAATCGACG CCGGTATTCC GGCGCACGAT
TTTACGCGCG TCATCCGCGC CGACCCTGTG CGTCGCGGGT TGCTCGTCGC CGGGACGGAA
ACCGGTCTGT ATCTCTCGTT CGACGATGGC GCGTCGTGGA TGCGCTTTAT GTTGAACCTG
CCGGTCGCGC CGGTCCACGA GATTCTGATC AAAGACAGCG ACCTGATCGT CGGCACGCAT
GGTCGTTCGA TCTGGATCCT CGACGATATA ACGCCGTTAC GCGCAATGAC TGCGGACATC
CTCGATGCGC CGGTGCATCT GTTTGCGCCA CGCACGTCTG AGCGCATTCT CCCCGGCATC
GACTGGTCGG GCAACAATCC GGGCAAAGAA TACCTGAGCA GCACTGGCGG CGCATTCATC
AACGAAAAAC AGCCTGATGG CGCAATCAAA CGTCGCTATC TCGATGTAGG ACAGAATCCG
CCGAAAGGCG TCATCGTCAC ATACTATCTG AAAGAGACGC CTGCCGAAGC GATCACGCTC
AGTTTCTTCG ACGCACGGGG CGAATTGGTG CGCCGTTTCC GCAGCAAACC GCCCGAAAGC
GCCGATGGCG AGAAAAAGAA GGATGACAAA GAACCGAAGA TCCCGGCGAA GGCGGGATGG
AACCGTTTCG TCTGGGACAT GCGTCATGCG CCTGCGCCCC GGATCGAAGG CAAAGACCCT
CCTGCTGACA TGGTTATCGA AGGTCCGTTT GTGGCGCCCG GCGCCTTCCG TGTGACGCTG
AAGGCTGGCG ATGCCGAAAC TGCACAGGAA TTCGTCATCG TGCAGGACCC GCAATCCACC
GCCACGCAGG AAGACCTGGA GGCGCAGCAC GACCTTGCGA TGCGGATTCA TCAAACGCTC
AGCACGGTCG TGCAGACGAT CAATCGCATG CGCGATCTGC GCGCCCAACT CGATGGTTGG
GCGAAGCGCG CTGAGACGCT GCCCGACGGC GCACCGGTCG CAGCGCAGGC AAAGGCGTTG
CGCGAGAGGG TGCTGGAGAT CGAGCAGCAT CTCCTCGTGC CCGATCTGCG CCCTGGATGG
GCGGATAATC TCAACCACGG GGTGCGCTTG CTGGAGAAGT TGATGAACGT CGCCGAAGTG
GTTCAACCTG GCGATTACCG TCCGACCAGC GCGGCTGAAG CGGCGTTTCA AGACCTCGCA
GCGCGCATTG CCGTCCAGAC AGCGCACTTC GAGGCGTTGA TCGAAAGCGA TCTGCCTGCG
TTGAACGCTG CGATTGCCGG TGCTGGCTTC GGGGCAATTA TGCTGCCGGT ACCATCAACG
TAA
 
Protein sequence
MSFPTDFTGL LEWRCIGPFR GGRVVAVAGD PRDIGTFYFG ACAGGVWKSV DGGMYWECVS 
DGFFNTAAIG ALAVSDSDPN VLYAGTGETT IRIDVSHGDG VYKSVDAGKT WKHVGLTDTR
FIGKIRIHPH NPDIVWVAAL GHAFGPNDER GVFKSIDGGA TWRRVLFKSD KAGAVDLSLD
PNNPRFLYAA VWEAYRSFWQ ISSGGPDSGL WMSSDGGETW TDITDRPGLP KGLKGKMAVA
ASPARSGRVW VLIEHAKEGG LYRSDNYGDT WEKVSDNQNL ISRAWYYMHL TPDPLDSETI
WVNNLSLWKS TDGGRTFVEV ATPHGDNHDL WIDPRNNRRM IQGNDGGACV SFNGGESWST
IYNQPTAQFY HLAVDNRKPY VVYGTQQDNS SIAVPARSRH GGILWGDCWI AGTGESGYIA
VRPDNPDIVY VGAIGSSPGG GNCLQRYDHR VRQIRLITTW PEYMGGYGAI DHTYRFAWTY
PIVISPHDPN TLYIGGNMIF RTTDEGQNWE AISPDLTRAD PATLQPTGGP INRDSIGAEV
YATVFAFIES PHERGVFWAG SDDGLIHLSR DGGLTWQNVT PPELPEWTLI SCIEPSPFDA
ATVYVAATRY KLDDYHPYLY KTTDYGATWQ RIDAGIPAHD FTRVIRADPV RRGLLVAGTE
TGLYLSFDDG ASWMRFMLNL PVAPVHEILI KDSDLIVGTH GRSIWILDDI TPLRAMTADI
LDAPVHLFAP RTSERILPGI DWSGNNPGKE YLSSTGGAFI NEKQPDGAIK RRYLDVGQNP
PKGVIVTYYL KETPAEAITL SFFDARGELV RRFRSKPPES ADGEKKKDDK EPKIPAKAGW
NRFVWDMRHA PAPRIEGKDP PADMVIEGPF VAPGAFRVTL KAGDAETAQE FVIVQDPQST
ATQEDLEAQH DLAMRIHQTL STVVQTINRM RDLRAQLDGW AKRAETLPDG APVAAQAKAL
RERVLEIEQH LLVPDLRPGW ADNLNHGVRL LEKLMNVAEV VQPGDYRPTS AAEAAFQDLA
ARIAVQTAHF EALIESDLPA LNAAIAGAGF GAIMLPVPST