Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1579 |
Symbol | |
ID | 5208534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1930429 |
End bp | 1932477 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640595185 |
Product | oligopeptidase B |
Protein accession | YP_001275921 |
Protein GI | 148655716 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.767471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACAC CCCCTGTTCC GCCAAAGCAA CCGCACGTGG TCTCCATCCA CGGCAATCAG GTGATCGACA ACTACTTCTG GATGCGCGAA CGCGATAACC CGGAGGTCAT CGCCCATCTT GAAGCCGAAA ACCGCTACAC GGAAGAGATG ACGGCGCACA TTGCCGGGCT GCGTGAGCGC CTGTACAGCG AGATGCGCAG CAGATTGCGC GAGGAGGATG AAAGCGTTCC TGATCGCTAC GGACCGTTCG TGTACTTCAC GCGCACTCAG GCAGGACGAC AATACCCGAT TGTGTACCGT CGCCCCGTTC ACAATGCACA AGAAGAGATC CTCCTCGACA TCAACACCCT GGCGGAAGGA CACGCCTTCA CCCGTATCGG GGTCTTTCGA CCGACGCACG ATGGACGCCT GCTCGCCTGG TCGGTCGATG TGAATGGATC GGAAACGTAC ACGCTTTTCA TCAAAGATCT GACAACCGGC GCGCTGCTGG ATCGTCCGAT TTCCAACACC TACTATGGCG TCGCCTGGAG CAACGATGGA CAGTACCTGT TCTACACCAC CCTCGATGAT GCCAGACGCC CCTATCGCGT CTACCGCCAC GCCATCGGAA GCGATCCAGA AGCGGACACA CTGGTGTACG AAGAGACGGA TCATCTCTTC CACGTCGACG TTTCCCTGAC CCGAAGCCGG GCATACATCC TGGTGACATC GCACAGCAAC ACAACCTCAG AAGTGTATGC GATTCCGGCT GATGAACCGA CGACGGCGCC GCGCCTTCTT CTGCCGCGCC GCCATCGGGT TGAGTATACA GCGCATCACT GCGGTGACCA TTTTTACTTT CTGACGAATG ACAACGCGCT GAATTTCCGC GTCCTGCGCA CGCCGGTCGA TGATGCGCGC CTGGAACGGA TGGAAGAAAT CATCCCGCAC CGCAACGACG TGATGATCGA TGACATAGCG CTCTTCGCCG ATCATCTGGT GGCATACGAA CGCGCCGATG CACGGGAGCG CGTCGAGATT ATCGATCTGC GCACCGGCGA AGCGCACCTG CTGACATTCC CAGAGCAGGT CTACACCCTG CAACCGTGGG ACAGAGACGC GCTGTGGGAA CCAAACCTGG AGTTTGACAC TGCCGTTTTG CGGCTCCACG TCATGTCGCT CACTCAGCCG CGCACTATCT ATGACTACGA TATGACCTCG CGTGTCCTGC AGTTGGTGAA GCGCGACGAC ATCCCCGGCT ACGATCCATC GCGCTACCGC AGCGAACGCC TGTGGGCGAC GGCAGGCGAC GGCGTCCGCA TACCGATCTC CATTGTCTAT CGCGCCGATG TGACACGTCC GGCGCCACTG CTGCTCTACG GCTACGGTTC GTATGGCGCC ACCGCCGATC CGCGTTTCTC GCTCGAACGG ATCAGCCTGC TGGATCGCGG CGTTATCTTT GCAATCGCCC ACGTTCGCGG CGGTGGAGAG TTGGGACGCG CGTGGTACGA AGCGGGCAAG ATGTTGAACA AGCGCAACAC CTTCACCGAT TTCATTGCCT GCGCTGAACA CCTGATTGCC GGGGGATACA CCACGCCAGA GCGACTCGCA ATCATGGGAC GCAGCGCCGG TGGCTTGCTG GTAGGTGCAG TCACAACGAT GCGACCAGAT CTGATGCGGT GCGTGATCGC CGATGTTCCG TTCGTCGATG TGATCAACAC TATGCTCGAT CCGTCGATCC CGCTGACGGC AATCGAGTTC GAAGAATGGG GAAATCCGGC GATTGCAGAA CAGTACGCAT ATATGAAGTC CTATTCTCCC TACGACAACA CCACGCCGCG TGCATATCCG GCAATCCTGG CGACCGCCGG CTTGCACGAT CCGCGTGTGC AGTACTGGGA ACCCGCCAAA TGGGTGGCAA AACTGCGCGA GGTCAAAACG AACGACACAC CAGTGTTGCT GAAGACCGAA ATGACCGCCG GGCATGCCGG TCCTTCCGGG CGCTATGATC GCTTGCGCGA CACAGCCTTC GAGTATGCGT TTCTCCTCGA TCACCTGAGG GCATCATAA
|
Protein sequence | MPTPPVPPKQ PHVVSIHGNQ VIDNYFWMRE RDNPEVIAHL EAENRYTEEM TAHIAGLRER LYSEMRSRLR EEDESVPDRY GPFVYFTRTQ AGRQYPIVYR RPVHNAQEEI LLDINTLAEG HAFTRIGVFR PTHDGRLLAW SVDVNGSETY TLFIKDLTTG ALLDRPISNT YYGVAWSNDG QYLFYTTLDD ARRPYRVYRH AIGSDPEADT LVYEETDHLF HVDVSLTRSR AYILVTSHSN TTSEVYAIPA DEPTTAPRLL LPRRHRVEYT AHHCGDHFYF LTNDNALNFR VLRTPVDDAR LERMEEIIPH RNDVMIDDIA LFADHLVAYE RADARERVEI IDLRTGEAHL LTFPEQVYTL QPWDRDALWE PNLEFDTAVL RLHVMSLTQP RTIYDYDMTS RVLQLVKRDD IPGYDPSRYR SERLWATAGD GVRIPISIVY RADVTRPAPL LLYGYGSYGA TADPRFSLER ISLLDRGVIF AIAHVRGGGE LGRAWYEAGK MLNKRNTFTD FIACAEHLIA GGYTTPERLA IMGRSAGGLL VGAVTTMRPD LMRCVIADVP FVDVINTMLD PSIPLTAIEF EEWGNPAIAE QYAYMKSYSP YDNTTPRAYP AILATAGLHD PRVQYWEPAK WVAKLREVKT NDTPVLLKTE MTAGHAGPSG RYDRLRDTAF EYAFLLDHLR AS
|
| |