Gene RoseRS_3008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3008 
Symbol 
ID5209976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3776783 
End bp3779731 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content58% 
IMG OID640596600 
Productpeptidase M23B 
Protein accessionYP_001277322 
Protein GI148657117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.631007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCGT ATCGATTGAT GCATGCTATC AGTGTCGTGA CTCTTCTCGT GATGACGTTT 
GGGGTATCCG GTGCGTTGAC GCAGGAAATT GCTCTCAGTG CAACTGCATA TCCGCCTCCT
GCAACACCGG TTTTCTCGAT GCCTGCGACG GGTTTTATGG GACTCAGTTG GAGCGGCGAT
GGTCCGGGTG CTCATCGGGG AATCGATATA TGGAGTTGGC GCTCTGCCGG TTGCTCTTGT
AGGGAACCAA CCGGTTGCGG CGTCATGCCA GGAGCAGAAG TCCGTGCGGT ATATGATGGG
GTCGTTGCAG GCATCTACTG GGGCGACCGC AGCGGTAGAT GGTATTCAGC CGGAACACCT
AATCCTGGCA ACTATCCATT GTCGGTCGTT ATTCTTGAGC ACCAGGGCGT TCCTGGCATC
TCAACTAATA AGGTGTACAC CGTGTACCAG CACATGGCCA ACGACGACAC TCGCGAAAGT
TACGTTGCGC AGGGTCTTGC CGTTGGTCAG ACTGTGCGCC AGAACGAGGT CATTGGCAGA
CAGGGCAACT GGCGCTATTA CGTGGCGAAC GATCCCAGGG CGAACGATCC CATAACGCAT
CTCCATTTCG AGGTGGCTTA TAAACCAGAT ACCTATACCC TGCCGGACCG GATCACGGTT
GATCCAATGC CGTACCTCAT CGGTGGAGGG CAGTCAAGCT GTTCGGGCGC ACCTATCAAT
AGCAGTGCAC CGCCAGCGCT CAAGATCCTC CGTCCATCAA AGGCAAATCC GGTTGATGTC
GGCACACCGG CAAGTCCTTC AAAGTTTACC ATCGAGATCG CATATGCAGG ATCTGCAAGT
CTCCAGGATG TGACGGTCAC AGTTGGTGGT AAACCTGCCA CGATTATCAA TATGACACCC
ACACGCTATA CGATGGAGGT ACAGGCGCCT CCACAGACTG GAAGCGGTGC ATACGACCTG
ACAGTGTCTA TCAAGGGCGC TACTGCGACG GAAGCCAGCG CCGTGAGTTA CAACGGTGCA
TCGAATGCCA ACGTCATTCT CACGCTCGAC CGCTCGGGCA GCATGTCGAC CGACAATAAG
ATGCCGGCGG CGCACAACGC AGCGCGTCAA TTTGTCGATC TTATGCAGGT CGGTGATGGC
GTTGGCGTGG TGGGATTCGA CGACCGCGTG ACGACAGCCT TCCCGCTGAC GGTGATTACC
GATCCGCCGC CGTTATCGTC GCTGATCTTT ACCGATACGA TGGAATCCGG TACGGGCAAA
TGGATACCCG ACCCGCCCTG GGGATTGACG TCGGTTGCGT ATCGCGGCAG CGCCGCCTGG
ACAGACAGCC CGGCGGGGAA TTATGCCAAC AATGCCAATA GTGTCCTCGC AATCGCCGAT
CCTATTGTTC TTCCTGCGTC ACTGACCACG CCCGCGCTCT CTTTCTGGCA CCGTTATGAT
ATCGAAAACT ATTTCGATTA CGGCCGGGTC GAAGTTTCGA CTGATAATGG CGCTACCTGG
CAATCGCTGG CAGCATATAC CGGCGTCAAT ACAACATGGA GTCGTGCGGT GATCGATCTC
AGCCCCTATC GCGGGCAGAC GATCCGGTTG CGCTTCCGCC TGACCACCAA TGCGTATCTG
ACGCGTGATG GCTGGTATAT TGACGATGTA ACGGTGGGTC CGAAATGGGT TGATGCGCGC
GCTGACGCTA TCGCAGCGAT TGGGACGCTG ACGCCGCGCG GCAGTACGTC AATTGGCGGC
GGTCTGCAAC GCAGCCAGCA GTTGCTCTCC GCCAGCGCAC CCGGTCGCAC GCGTGCGATT
GTCCTGCTCA GCGACGGTCA GGAGAACACA GCACCCTACG TCAGCGACGT TTTGCCGCAG
ATTCGCGCGT CGCAGATCAC GGTGCATACC ATTGGCCTGG GCACAGACGC CGATCAACAA
TTGATGCTCT CAATCGCCGC ACAGACTGGC GGAACCTACA ATTATGCGCC GCGTCCTGAC
CAGCTGGCCG GCATTTATAA CACCATCTCG GGTGCGGTCA GTAATCGCCA GACTCTGATC
ACAATTGATG GCACAATTGC GGCTGGCGCT ACTGTGACCC GCGACGTAAC GGTTGACTCT
TCTGTTTCGG AAGCCACCTT CTGGGTCAGC TGGACAAATG CTGGCGCCGG TGTGACGTTC
TCGCTCCAGA CGCCCGGCGG TGCGACCATT GACGCTGCAA GCGCCGCTGC GAATGACAGT
ATTGACTACG TCGGCGGATC AACCTATGCC TACTACCGTG TGCGCGCTCC AACGCTTACC
CCTGGCGTCT GGCGCATTCT CCTCAGTCGC GCGGCGGCTT CGTCTGCGAC CATCATCGAT
GAGCCGACTG TCGCTACGTC AGAGCCGTTG GTTGAGCGTC CAGCCGACGG GTTTGTGGAA
CCAGAGCATG ATGTCACTCC CGACCTGTCG CCTGATCCTG CTGAGCGTGT CATAGAGCCA
GAGCCTGTCG CTCAATCCAC TCCTGCTGAG CATGTCGTGG AGCCAGATTC TGTCGCTCAA
CCCGCCGCCA CTGGCGATGA ACCGTTTGTG GCGCGTGTCC TGGCGCGCGC CAGTCTCACC
ATGGGCTTCT ATCTGGGCAA GATGGCGTAC CTGACGACCG AGCCGATGAT ATGTATCGTT
ACGCTGGCCG ACAACGCGCC GATTACCGGA GCGACGGTCG TGCTGACGGT TACGTTGCCT
GGTCAACCGG TCGCGCTAAC CCTGCCGCTT TATGATGACG GCAGGCACGG CGATGGCGCG
GCTGGCGACG GTGTCTATGC GACAACGTTC ATCGGCCCGT TCACTCCCGG CACAGCAACC
TTCAGTGTTA TTGCATCAGG CCGGAGCAGC GCTGGCGAAC CGTTTACTCG TCAGGGCGAA
TTGTCGACCT ATGTTGCGAC AAATCCTGAC CCGTATACGT TCGTGCACCT GCCACTGGTT
GTTCGCTGA
 
Protein sequence
MFSYRLMHAI SVVTLLVMTF GVSGALTQEI ALSATAYPPP ATPVFSMPAT GFMGLSWSGD 
GPGAHRGIDI WSWRSAGCSC REPTGCGVMP GAEVRAVYDG VVAGIYWGDR SGRWYSAGTP
NPGNYPLSVV ILEHQGVPGI STNKVYTVYQ HMANDDTRES YVAQGLAVGQ TVRQNEVIGR
QGNWRYYVAN DPRANDPITH LHFEVAYKPD TYTLPDRITV DPMPYLIGGG QSSCSGAPIN
SSAPPALKIL RPSKANPVDV GTPASPSKFT IEIAYAGSAS LQDVTVTVGG KPATIINMTP
TRYTMEVQAP PQTGSGAYDL TVSIKGATAT EASAVSYNGA SNANVILTLD RSGSMSTDNK
MPAAHNAARQ FVDLMQVGDG VGVVGFDDRV TTAFPLTVIT DPPPLSSLIF TDTMESGTGK
WIPDPPWGLT SVAYRGSAAW TDSPAGNYAN NANSVLAIAD PIVLPASLTT PALSFWHRYD
IENYFDYGRV EVSTDNGATW QSLAAYTGVN TTWSRAVIDL SPYRGQTIRL RFRLTTNAYL
TRDGWYIDDV TVGPKWVDAR ADAIAAIGTL TPRGSTSIGG GLQRSQQLLS ASAPGRTRAI
VLLSDGQENT APYVSDVLPQ IRASQITVHT IGLGTDADQQ LMLSIAAQTG GTYNYAPRPD
QLAGIYNTIS GAVSNRQTLI TIDGTIAAGA TVTRDVTVDS SVSEATFWVS WTNAGAGVTF
SLQTPGGATI DAASAAANDS IDYVGGSTYA YYRVRAPTLT PGVWRILLSR AAASSATIID
EPTVATSEPL VERPADGFVE PEHDVTPDLS PDPAERVIEP EPVAQSTPAE HVVEPDSVAQ
PAATGDEPFV ARVLARASLT MGFYLGKMAY LTTEPMICIV TLADNAPITG ATVVLTVTLP
GQPVALTLPL YDDGRHGDGA AGDGVYATTF IGPFTPGTAT FSVIASGRSS AGEPFTRQGE
LSTYVATNPD PYTFVHLPLV VR