Gene RoseRS_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1784 
Symbol 
ID5208743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2202532 
End bp2205528 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content60% 
IMG OID640595392 
Producthypothetical protein 
Protein accessionYP_001276124 
Protein GI148655919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.78034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCGA CGACATTACT GCCGTTTCGT CGGTTGCTTG CCCCGCTCGT CGGCTACGCA 
ATGATGGTTG TTGCCGGTTA TTGGAGTGTT CGCACAGTTG TCGGACTGAA TAGTGCGCTG
CCGGTGCTTC TTATCTTCTG CACTATGGTC AATATCCTGG CGTGGCGTCG CACCGGTTCT
CCGTGGTCTT GTCGGTTGCC GGTTCAGATG ACGCGCAGCG AAGGAGTGGC GCTGTTCCTT
CTGATTCTGG CGACGATTGG CGTGGGGGTG TTCCCGCTGA TCCACTACAA CCAGCCTGCT
GCGATCGGCG ATGGGTGGGA TATCGAGGTG GCACTGCCGA TAGCGCGCTA TCTGGAGCGC
GTCCCGGTCG CAGCGATTGC GACAATGCCC GATAACCCGC TGCGTGATCT GGTCGCCCAT
CCGCCTCGAA TTTCGCATAA TATCGGGTTC GCCATCTGGC AGGGGTATGT CGATCTCCTG
GGAGGGTTTG AGGCGTTCGT CTCGTTTACT CCGTTGATCG CCTGGTTGCG GGGATTGGGC
ATTCTGGCGA TCTATGCGCT GTTGCGCATC TGGCTTGGAT TGCCAGCCTG GACAGCGCTG
GCTGGCGCCG GTCTGGCAAG CCTGAATGCG CTCCTTCTCT GGGTGTCCTA TTTCAACTTC
GAGAAGCAAC TTGCCGGGTT TCCGTTGATC CCGCTGGGTC TGGTCATTGG GGCGGCTGCG
GTTGAAGAGA TCGCACGTTA CCGTCTGGCA GCGTGGCGAA GCGCATTCCT GGCGGCCGTA
GTCCTGTCCG CTCTGCCGGT CACCTACTAT CCGGCGATCA CGGTGTGGGC GGCGCTGGCA
GCGGGGATGG GCGTGGTGCG TCTGATCGAA GCGCGCAGGA AACCTGCCGA GGCGCCGTCG
CCGTGGATGC TGATACGTGC AGCCGCTGCA CTGCTGGTAC TGACCCTCAT GATCGCTGCG
CCAACGGTCG AAGACTATCT CAATGGTTTT GGATTTCGCT ACAGTCATCA GGTGACATCG
CTTGGCATAT TCGACTATAT TCCGGTCAGC GTTATTGTTG GATTGGAACC GTTTCTTCTG
AGTCGCAGCG GATCGGTTGC GCCGGACAGC GTGGTGTATG CTGGCGGTCT GGCGCTGGGC
CTGCTTGTGG CGGGTGCGCT CGCGTTCGGA CCACTTCGCC TGCGACTGGC GGGGTTGCTC
ATCGGAGGGA TCGTCTATCT GGCCTGGTTG CGGTGGTGGC AGGCATATCC ATACGGCTAC
ATGAAAGGCG CCGCGTATGT TGGTTTTGTT TTTTCCGCAC TGGCGGCAGC CGGAATCCAG
GGGCTCCGCA GGTGGATAGC AGAACGATGG AACAACAACA TTATTCAGCG GGTCGGCGCT
CACACCGCGT TGATCGTGAT TGCGACAGGT TTATGTGCGC TTATGGGGGT CAATCAAGCG
CAGGTGGTGG TTGCTCACCT CGATCAACCC GGTCTCTACC CTGACGACGC TCCAACATTG
CTCGCATTGC GTCAGATTAT CCCGCCCGGA AGCACGGTCA CGCTGACGTC TGATCAACGG
GTACAGGGCG TTATCAGTGG GTTTGCTGCG TATGCGCTCG ACCATACGGT GGTGTGGGGG
CATGTACGCA CGGGATACAC CAGGTCTCAA ACAGGCGATA TCGATGCTAT TGGTGAGTAT
GGGTTACTCT ATGCTTCTGA AGATCCTCTG CTATGGGGGT ACACGCAGCC TCCAATCTGG
CGCGGCGGTT CGTATGCGCT CTATCGTCGC CCGCCGGAGG TGCAGCGCCA TCTTCGCGTG
CTGAAACCGC TGGTGCCAGG TGAAACGCTG ACGCTGCGTA TGAATGTCGA GCAATGGGAG
GCGTCGTCCG TCACCGGTTC CGCTTTTCGC TCACTGCGAT TGATGGTCGC TTCTCTTGCG
CCCGCCGCCA TCGAGATCAA TGGCATTCCC ATAGCAATAC CTCCAGGACG CCATACGATG
ACCCTTGCTG TTCCGCCCTT CCAGGAAGTG AGGATTCGTC ACGTTGATGG CGCCCTGCCT
CTCATCGAGA CGATCACGCT GCTGGCAGAT CCTGATCCGA ACACGATCCA GGCTATGAGA
GTCGTTCACA GTACTCAACC GACAGGCGGC CCTCTGATGC GGGAGGTGAC AGGGATCGCT
CTGGTCCAGG CATCGGCGGT TGCTTCTGAT ACGCATATCC TGATGACTCT GGCAGCGCTT
CTGCCGGATG CTGGACCGCT GAACGTGGCG CTCGATATCT GGGATGTTGA GCGGGGCGTC
CAGTATGGAT GGTATGGGCT GCTTGTTATG CCTGAGCCGG AGGTGCAGCG TTTCTCACTG
TTCCTGTCGC TTGCTGATGG ACAGATGCGC GGAGTATCCG CTCAGGGCGG CGACGTGCCG
CTGGGCGCGT ATTTTGCCGG GTTGCAACCT GGACGATACA CCGCCCGCCT GTATCTTGCT
GCCAGTGCTC AGGTGGTGAG TGAGCCAATC GATCTGTTTG GATTTGATAT CACGTCTGAT
CGCGCAATGA CGAATGTATG GACACGGGAT CATCAGATGC AGGCGATCCG CGCGATCCAT
CCGACGACGT TCATCAATGT CCGGGTTGCC GATGACGTGG CGATGGTCGG ATACACTCTG
CTGCCAGCGC GCCCCAAACC GGGAGATACA GTCGACCTGA TCATCTGGTG GCGCTCACTG
CGCGATGGTC TGGATGAGCG CAGCGTGCTC GTGCATCTTG TTGATGCCGC CGGCACGAAA
CGTGCGCAAG CGGACGGTCC GCCTGCCGCA GGAACGATGC CAACCGGGAA ATGGCGCGCC
GGACTGACGA TTGTTGATGC GCGGCGCCTC ACCCTCCCGG TCGATCTGCC ATCCGGCGAC
TATACGCTTT TGGTGGGGAT GTACCGCTGG CCCTCGCTCG AACGCCTGCC GCTGGTGCAG
GGAAATGAGT TGCTTCCCGA AGCGGTCTTC CGGGTTCCTG TGGCAATTGG GGAGTGA
 
Protein sequence
MLPTTLLPFR RLLAPLVGYA MMVVAGYWSV RTVVGLNSAL PVLLIFCTMV NILAWRRTGS 
PWSCRLPVQM TRSEGVALFL LILATIGVGV FPLIHYNQPA AIGDGWDIEV ALPIARYLER
VPVAAIATMP DNPLRDLVAH PPRISHNIGF AIWQGYVDLL GGFEAFVSFT PLIAWLRGLG
ILAIYALLRI WLGLPAWTAL AGAGLASLNA LLLWVSYFNF EKQLAGFPLI PLGLVIGAAA
VEEIARYRLA AWRSAFLAAV VLSALPVTYY PAITVWAALA AGMGVVRLIE ARRKPAEAPS
PWMLIRAAAA LLVLTLMIAA PTVEDYLNGF GFRYSHQVTS LGIFDYIPVS VIVGLEPFLL
SRSGSVAPDS VVYAGGLALG LLVAGALAFG PLRLRLAGLL IGGIVYLAWL RWWQAYPYGY
MKGAAYVGFV FSALAAAGIQ GLRRWIAERW NNNIIQRVGA HTALIVIATG LCALMGVNQA
QVVVAHLDQP GLYPDDAPTL LALRQIIPPG STVTLTSDQR VQGVISGFAA YALDHTVVWG
HVRTGYTRSQ TGDIDAIGEY GLLYASEDPL LWGYTQPPIW RGGSYALYRR PPEVQRHLRV
LKPLVPGETL TLRMNVEQWE ASSVTGSAFR SLRLMVASLA PAAIEINGIP IAIPPGRHTM
TLAVPPFQEV RIRHVDGALP LIETITLLAD PDPNTIQAMR VVHSTQPTGG PLMREVTGIA
LVQASAVASD THILMTLAAL LPDAGPLNVA LDIWDVERGV QYGWYGLLVM PEPEVQRFSL
FLSLADGQMR GVSAQGGDVP LGAYFAGLQP GRYTARLYLA ASAQVVSEPI DLFGFDITSD
RAMTNVWTRD HQMQAIRAIH PTTFINVRVA DDVAMVGYTL LPARPKPGDT VDLIIWWRSL
RDGLDERSVL VHLVDAAGTK RAQADGPPAA GTMPTGKWRA GLTIVDARRL TLPVDLPSGD
YTLLVGMYRW PSLERLPLVQ GNELLPEAVF RVPVAIGE