Gene RoseRS_1579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1579 
Symbol 
ID5208534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1930429 
End bp1932477 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content59% 
IMG OID640595185 
Productoligopeptidase B 
Protein accessionYP_001275921 
Protein GI148655716 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.767471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACAC CCCCTGTTCC GCCAAAGCAA CCGCACGTGG TCTCCATCCA CGGCAATCAG 
GTGATCGACA ACTACTTCTG GATGCGCGAA CGCGATAACC CGGAGGTCAT CGCCCATCTT
GAAGCCGAAA ACCGCTACAC GGAAGAGATG ACGGCGCACA TTGCCGGGCT GCGTGAGCGC
CTGTACAGCG AGATGCGCAG CAGATTGCGC GAGGAGGATG AAAGCGTTCC TGATCGCTAC
GGACCGTTCG TGTACTTCAC GCGCACTCAG GCAGGACGAC AATACCCGAT TGTGTACCGT
CGCCCCGTTC ACAATGCACA AGAAGAGATC CTCCTCGACA TCAACACCCT GGCGGAAGGA
CACGCCTTCA CCCGTATCGG GGTCTTTCGA CCGACGCACG ATGGACGCCT GCTCGCCTGG
TCGGTCGATG TGAATGGATC GGAAACGTAC ACGCTTTTCA TCAAAGATCT GACAACCGGC
GCGCTGCTGG ATCGTCCGAT TTCCAACACC TACTATGGCG TCGCCTGGAG CAACGATGGA
CAGTACCTGT TCTACACCAC CCTCGATGAT GCCAGACGCC CCTATCGCGT CTACCGCCAC
GCCATCGGAA GCGATCCAGA AGCGGACACA CTGGTGTACG AAGAGACGGA TCATCTCTTC
CACGTCGACG TTTCCCTGAC CCGAAGCCGG GCATACATCC TGGTGACATC GCACAGCAAC
ACAACCTCAG AAGTGTATGC GATTCCGGCT GATGAACCGA CGACGGCGCC GCGCCTTCTT
CTGCCGCGCC GCCATCGGGT TGAGTATACA GCGCATCACT GCGGTGACCA TTTTTACTTT
CTGACGAATG ACAACGCGCT GAATTTCCGC GTCCTGCGCA CGCCGGTCGA TGATGCGCGC
CTGGAACGGA TGGAAGAAAT CATCCCGCAC CGCAACGACG TGATGATCGA TGACATAGCG
CTCTTCGCCG ATCATCTGGT GGCATACGAA CGCGCCGATG CACGGGAGCG CGTCGAGATT
ATCGATCTGC GCACCGGCGA AGCGCACCTG CTGACATTCC CAGAGCAGGT CTACACCCTG
CAACCGTGGG ACAGAGACGC GCTGTGGGAA CCAAACCTGG AGTTTGACAC TGCCGTTTTG
CGGCTCCACG TCATGTCGCT CACTCAGCCG CGCACTATCT ATGACTACGA TATGACCTCG
CGTGTCCTGC AGTTGGTGAA GCGCGACGAC ATCCCCGGCT ACGATCCATC GCGCTACCGC
AGCGAACGCC TGTGGGCGAC GGCAGGCGAC GGCGTCCGCA TACCGATCTC CATTGTCTAT
CGCGCCGATG TGACACGTCC GGCGCCACTG CTGCTCTACG GCTACGGTTC GTATGGCGCC
ACCGCCGATC CGCGTTTCTC GCTCGAACGG ATCAGCCTGC TGGATCGCGG CGTTATCTTT
GCAATCGCCC ACGTTCGCGG CGGTGGAGAG TTGGGACGCG CGTGGTACGA AGCGGGCAAG
ATGTTGAACA AGCGCAACAC CTTCACCGAT TTCATTGCCT GCGCTGAACA CCTGATTGCC
GGGGGATACA CCACGCCAGA GCGACTCGCA ATCATGGGAC GCAGCGCCGG TGGCTTGCTG
GTAGGTGCAG TCACAACGAT GCGACCAGAT CTGATGCGGT GCGTGATCGC CGATGTTCCG
TTCGTCGATG TGATCAACAC TATGCTCGAT CCGTCGATCC CGCTGACGGC AATCGAGTTC
GAAGAATGGG GAAATCCGGC GATTGCAGAA CAGTACGCAT ATATGAAGTC CTATTCTCCC
TACGACAACA CCACGCCGCG TGCATATCCG GCAATCCTGG CGACCGCCGG CTTGCACGAT
CCGCGTGTGC AGTACTGGGA ACCCGCCAAA TGGGTGGCAA AACTGCGCGA GGTCAAAACG
AACGACACAC CAGTGTTGCT GAAGACCGAA ATGACCGCCG GGCATGCCGG TCCTTCCGGG
CGCTATGATC GCTTGCGCGA CACAGCCTTC GAGTATGCGT TTCTCCTCGA TCACCTGAGG
GCATCATAA
 
Protein sequence
MPTPPVPPKQ PHVVSIHGNQ VIDNYFWMRE RDNPEVIAHL EAENRYTEEM TAHIAGLRER 
LYSEMRSRLR EEDESVPDRY GPFVYFTRTQ AGRQYPIVYR RPVHNAQEEI LLDINTLAEG
HAFTRIGVFR PTHDGRLLAW SVDVNGSETY TLFIKDLTTG ALLDRPISNT YYGVAWSNDG
QYLFYTTLDD ARRPYRVYRH AIGSDPEADT LVYEETDHLF HVDVSLTRSR AYILVTSHSN
TTSEVYAIPA DEPTTAPRLL LPRRHRVEYT AHHCGDHFYF LTNDNALNFR VLRTPVDDAR
LERMEEIIPH RNDVMIDDIA LFADHLVAYE RADARERVEI IDLRTGEAHL LTFPEQVYTL
QPWDRDALWE PNLEFDTAVL RLHVMSLTQP RTIYDYDMTS RVLQLVKRDD IPGYDPSRYR
SERLWATAGD GVRIPISIVY RADVTRPAPL LLYGYGSYGA TADPRFSLER ISLLDRGVIF
AIAHVRGGGE LGRAWYEAGK MLNKRNTFTD FIACAEHLIA GGYTTPERLA IMGRSAGGLL
VGAVTTMRPD LMRCVIADVP FVDVINTMLD PSIPLTAIEF EEWGNPAIAE QYAYMKSYSP
YDNTTPRAYP AILATAGLHD PRVQYWEPAK WVAKLREVKT NDTPVLLKTE MTAGHAGPSG
RYDRLRDTAF EYAFLLDHLR AS