Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2946 |
Symbol | |
ID | 5209914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3683818 |
End bp | 3686874 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596539 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001277261 |
Protein GI | 148657056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGT TTTTTGACCA ACCGATCCTG AACTCGCCCT ACGAATACCC TGCGCGCCAC TGGAAGCTCG AAAACGGTCA ACCGACAGGG GAGATTATCC ACGGCCGGCG TCGGGCGGAA TTTATCACGC CCATCCCCAG GCTGAAGAAG CGCCGCGCTG CGCAGCAGGC AGAAATGATC TTCGACGAAG GGCTGGGGCT TTCGACCGCA ACGCAGCAGT ACGACCCGAC CTCGATCATC AACGAAGTGC GTAGCCACGT GGATGCCTGG CGCGCACTGC CGCCGGGCCA GTGGCAGGTC ACCCCCGAAA CTGCGCGCTT GTTGCACCAC TGGCGGCATC ACCAGTTCAG CAGCGTGCGC CCGTTCTTCT GCCAGATCGA GGCGGTCGAG ACGGTGATCT GGCTCACTGA AGTTGCGCCG CACACGGCTG CCGGGAAGGG TCTCCTCGAT CATCTGGCGC GGGCGAACCG CGATGCCAAC CCCGAACTCA ATCGTCTTGC GCTCAAACTC GCCACCGGCG CGGGCAAGAC CACCGTCATG GCCATGCTGA TCGCCTGGCA GACGGTCAAC GCCGTGCGCC ATCCGCAGAG CAAACGCTTC ACCCGTGGGT TTCTCATTGT CACGCCGGGG ATCACCATCC GGGACCGCCT GCGGGTGTTG CTTCCGAACG ACACGGAGAA CTACTACACG ACCCGCGAGC TGGTTCCCAT TGATATGATC GAGGATATCC ACCGCGCCCG GATTGTCATC ACCAACTACC ACGCCTTCAA ACTGCGCGAG CGGATGGAAC TGTCCGCCGG CGGGCGGGCG CTGCTGCAGG GTCGCGGCGA ACCGATCCAG ACCACCGAGA CGGAAGGGCA GATGCTTGCC CGCGTGATGC CTGAGCTGAT GAGCATGAAA AACATCCTGG TGCTCAACGA CGAAGGTCAT CACTGCTACC GTGAAAAGCC GCGCGACCCG GAAGAGGAAG ACCTGACCAG CGAAGAGAAG AAGGAAGCCG AGAAAAACAA CGAAGCCGCG CGGTTGTGGA TCACCGGTAT CGAGACTGTC GCCCGCAAGA TCGGCGTGAG CCGGGTGATC GACCTTTCGG CCACGCCGTT CTTTCTGCGC GGCTCAGGCT ATGCCGAGGG AACCCTGTTC CCGTGGACGA TGAGCGATTT TTCGCTGATG GACGCCATCG AGTGTGGTAT CGTGAAGCTC CCGCGCGTGC CCGTGGCCGA GAACATCCCG GGTGACGAGA TGCCGATGTA CCGCAATCTG TGGGAGCACA TCCGCAAGGA TATGCCGAAG AAAGGGCGCG GCAAGGCTGG CGACCTGGAT CCCTTGAAGA TTCCTACCCG TTTGCAGACG GCGCTCCAGG CGCTGTACGG CCACTACGAG AAGACCTTCC GGATCTGGGA GCAGGCTGGC ATCCGTGTGC CTCCCTGCTT CATTATCGTC TGCCAGAACA CCGCGATCTC CAAACTCGTC TACGACTATG TCGCGGGCTT TGTCCGGCAG AACGACGATG GCACGAGCAC ACTGGTCAAC GGCCAGCTGC CGCTCTTCCG CAATTTCGAC GAGACCACCG GCAACCCGCT GCCCCGCCCC AACACCCTGC TCATCGACAG TGAGCAGCTT GAGTCCGGCA CGGCTCTCGA CGATAACTTC CGCGCGATGG CGGCGGACGA GATCGAGCGC TTCCGCCGCG CCATCATTGA GCGCACCGGC GATGCGCGAA AAGCTGAAAG CCTCACCGAC CAGGACCTGC TGCGCGAGGT TATGAACACG GTGGGCAAAC CCGGTCAGCT CGGCGAGCAG ATCCGCTGCG TGGTCTCGGT CTCCATGCTC ACCGAAGGGT GGGACGCCAA CAACGTCACC CACATTCTTG GCGTGCGCGC CTTTGGCACC CAGCTGTTGT GCGAGCAGGT CATTGGCCGC GCGCTGCGCC GCCAGTCGTA TGAAGTGAAC GCTGAAGGTC TCTTCAATCC TGAATATGCC GACATCTTCG GCATTCCCTT CGACTTCACC GCGAAGCCGG TCGTCGTCCG GCCCCAGCCG CCGCGCCAGA CCATCCAGGT CAGGGCTGTC CGTCCCGAGC GCGATCACCT GGAAATCCGT TTCCCGCGTG TTCAGGGCTA CCGCGTTGAG CTGCCCGACG AGCGCCTGGC TGCAAAATTC ACGGAGGACT CCATCCTCGA ACTCACCCCC GCCCTCGTCG GACCCACCAT CACCCGCAAC CAGGGGATCA TCGGCGAAGC GGTGGATCTG ACCCTCGCGC ACCTCGAGGA CATGCGTCCT TCGGCGCTGC TCTTCAACCT CACGAAGCAC CTGCTCTATA ACAAATGGCG CGATCCGGGC GAAGAACCGA AGCTGCACCT CTTTGGCCAG CTCAAGCGCA TCACCGGGGA ATGGCTGGAT CGTTGTCTCG TCTGCAAGGG CGACACGTAC CCCGCCTTGC TCATGTACCA GGAACTCGCC GACATGGCCT GCAATAAGAT CACCGCCGCT ATCACGCGCG AGTTTCAGGA CCGGCGCCCG ATCAAGGCGC TGCTCGATCC CTATAACCCC ACCGGATCAA CTGCGTATGT GCGTTTCTCT ACTACGCGGC AAACGCTATG GGACACCGCC GGACCGCCGC CGAAGTGCCA CGTCAACTGG ATCGTGCTCG ATAGCGATTG GGAAGCCGAG TTCTGCCGGG TGGCGGAAAG CCATCCCCGC GTGCTCGCCT ACGTGAAGAA CCACAACCTT GGCTTCGAAG TCCCCTACCG CTACGGCTCG GAAACCCGCG CCTATCGCCC CGACTTCATC GTGCTGGTGG ACGATGGCCG GGGTCCGCAC GACCCGCTGC ACCTCGTGAT CGAGATCAAG GGCTATCGCG GCGAGGATGC GAAGGAGAAG AAATCGACGA TGGAAACCTT CTGGATTCCC GGCGTGAACA ACCTCAAGAC CTATGGCCGC TGGGCGTTTG CCGAGTTCGG CGACATCTGG CAGATACAGA AGGCGTTCGA TCAGTTGCTT GAGCAGATGA TTCGCCCACA CGGAGCTGCG GAGCGCGCGG AGGCAGGAAC TGACTGA
|
Protein sequence | MSQFFDQPIL NSPYEYPARH WKLENGQPTG EIIHGRRRAE FITPIPRLKK RRAAQQAEMI FDEGLGLSTA TQQYDPTSII NEVRSHVDAW RALPPGQWQV TPETARLLHH WRHHQFSSVR PFFCQIEAVE TVIWLTEVAP HTAAGKGLLD HLARANRDAN PELNRLALKL ATGAGKTTVM AMLIAWQTVN AVRHPQSKRF TRGFLIVTPG ITIRDRLRVL LPNDTENYYT TRELVPIDMI EDIHRARIVI TNYHAFKLRE RMELSAGGRA LLQGRGEPIQ TTETEGQMLA RVMPELMSMK NILVLNDEGH HCYREKPRDP EEEDLTSEEK KEAEKNNEAA RLWITGIETV ARKIGVSRVI DLSATPFFLR GSGYAEGTLF PWTMSDFSLM DAIECGIVKL PRVPVAENIP GDEMPMYRNL WEHIRKDMPK KGRGKAGDLD PLKIPTRLQT ALQALYGHYE KTFRIWEQAG IRVPPCFIIV CQNTAISKLV YDYVAGFVRQ NDDGTSTLVN GQLPLFRNFD ETTGNPLPRP NTLLIDSEQL ESGTALDDNF RAMAADEIER FRRAIIERTG DARKAESLTD QDLLREVMNT VGKPGQLGEQ IRCVVSVSML TEGWDANNVT HILGVRAFGT QLLCEQVIGR ALRRQSYEVN AEGLFNPEYA DIFGIPFDFT AKPVVVRPQP PRQTIQVRAV RPERDHLEIR FPRVQGYRVE LPDERLAAKF TEDSILELTP ALVGPTITRN QGIIGEAVDL TLAHLEDMRP SALLFNLTKH LLYNKWRDPG EEPKLHLFGQ LKRITGEWLD RCLVCKGDTY PALLMYQELA DMACNKITAA ITREFQDRRP IKALLDPYNP TGSTAYVRFS TTRQTLWDTA GPPPKCHVNW IVLDSDWEAE FCRVAESHPR VLAYVKNHNL GFEVPYRYGS ETRAYRPDFI VLVDDGRGPH DPLHLVIEIK GYRGEDAKEK KSTMETFWIP GVNNLKTYGR WAFAEFGDIW QIQKAFDQLL EQMIRPHGAA ERAEAGTD
|
| |