Gene RoseRS_2946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2946 
Symbol 
ID5209914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3683818 
End bp3686874 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content62% 
IMG OID640596539 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001277261 
Protein GI148657056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGT TTTTTGACCA ACCGATCCTG AACTCGCCCT ACGAATACCC TGCGCGCCAC 
TGGAAGCTCG AAAACGGTCA ACCGACAGGG GAGATTATCC ACGGCCGGCG TCGGGCGGAA
TTTATCACGC CCATCCCCAG GCTGAAGAAG CGCCGCGCTG CGCAGCAGGC AGAAATGATC
TTCGACGAAG GGCTGGGGCT TTCGACCGCA ACGCAGCAGT ACGACCCGAC CTCGATCATC
AACGAAGTGC GTAGCCACGT GGATGCCTGG CGCGCACTGC CGCCGGGCCA GTGGCAGGTC
ACCCCCGAAA CTGCGCGCTT GTTGCACCAC TGGCGGCATC ACCAGTTCAG CAGCGTGCGC
CCGTTCTTCT GCCAGATCGA GGCGGTCGAG ACGGTGATCT GGCTCACTGA AGTTGCGCCG
CACACGGCTG CCGGGAAGGG TCTCCTCGAT CATCTGGCGC GGGCGAACCG CGATGCCAAC
CCCGAACTCA ATCGTCTTGC GCTCAAACTC GCCACCGGCG CGGGCAAGAC CACCGTCATG
GCCATGCTGA TCGCCTGGCA GACGGTCAAC GCCGTGCGCC ATCCGCAGAG CAAACGCTTC
ACCCGTGGGT TTCTCATTGT CACGCCGGGG ATCACCATCC GGGACCGCCT GCGGGTGTTG
CTTCCGAACG ACACGGAGAA CTACTACACG ACCCGCGAGC TGGTTCCCAT TGATATGATC
GAGGATATCC ACCGCGCCCG GATTGTCATC ACCAACTACC ACGCCTTCAA ACTGCGCGAG
CGGATGGAAC TGTCCGCCGG CGGGCGGGCG CTGCTGCAGG GTCGCGGCGA ACCGATCCAG
ACCACCGAGA CGGAAGGGCA GATGCTTGCC CGCGTGATGC CTGAGCTGAT GAGCATGAAA
AACATCCTGG TGCTCAACGA CGAAGGTCAT CACTGCTACC GTGAAAAGCC GCGCGACCCG
GAAGAGGAAG ACCTGACCAG CGAAGAGAAG AAGGAAGCCG AGAAAAACAA CGAAGCCGCG
CGGTTGTGGA TCACCGGTAT CGAGACTGTC GCCCGCAAGA TCGGCGTGAG CCGGGTGATC
GACCTTTCGG CCACGCCGTT CTTTCTGCGC GGCTCAGGCT ATGCCGAGGG AACCCTGTTC
CCGTGGACGA TGAGCGATTT TTCGCTGATG GACGCCATCG AGTGTGGTAT CGTGAAGCTC
CCGCGCGTGC CCGTGGCCGA GAACATCCCG GGTGACGAGA TGCCGATGTA CCGCAATCTG
TGGGAGCACA TCCGCAAGGA TATGCCGAAG AAAGGGCGCG GCAAGGCTGG CGACCTGGAT
CCCTTGAAGA TTCCTACCCG TTTGCAGACG GCGCTCCAGG CGCTGTACGG CCACTACGAG
AAGACCTTCC GGATCTGGGA GCAGGCTGGC ATCCGTGTGC CTCCCTGCTT CATTATCGTC
TGCCAGAACA CCGCGATCTC CAAACTCGTC TACGACTATG TCGCGGGCTT TGTCCGGCAG
AACGACGATG GCACGAGCAC ACTGGTCAAC GGCCAGCTGC CGCTCTTCCG CAATTTCGAC
GAGACCACCG GCAACCCGCT GCCCCGCCCC AACACCCTGC TCATCGACAG TGAGCAGCTT
GAGTCCGGCA CGGCTCTCGA CGATAACTTC CGCGCGATGG CGGCGGACGA GATCGAGCGC
TTCCGCCGCG CCATCATTGA GCGCACCGGC GATGCGCGAA AAGCTGAAAG CCTCACCGAC
CAGGACCTGC TGCGCGAGGT TATGAACACG GTGGGCAAAC CCGGTCAGCT CGGCGAGCAG
ATCCGCTGCG TGGTCTCGGT CTCCATGCTC ACCGAAGGGT GGGACGCCAA CAACGTCACC
CACATTCTTG GCGTGCGCGC CTTTGGCACC CAGCTGTTGT GCGAGCAGGT CATTGGCCGC
GCGCTGCGCC GCCAGTCGTA TGAAGTGAAC GCTGAAGGTC TCTTCAATCC TGAATATGCC
GACATCTTCG GCATTCCCTT CGACTTCACC GCGAAGCCGG TCGTCGTCCG GCCCCAGCCG
CCGCGCCAGA CCATCCAGGT CAGGGCTGTC CGTCCCGAGC GCGATCACCT GGAAATCCGT
TTCCCGCGTG TTCAGGGCTA CCGCGTTGAG CTGCCCGACG AGCGCCTGGC TGCAAAATTC
ACGGAGGACT CCATCCTCGA ACTCACCCCC GCCCTCGTCG GACCCACCAT CACCCGCAAC
CAGGGGATCA TCGGCGAAGC GGTGGATCTG ACCCTCGCGC ACCTCGAGGA CATGCGTCCT
TCGGCGCTGC TCTTCAACCT CACGAAGCAC CTGCTCTATA ACAAATGGCG CGATCCGGGC
GAAGAACCGA AGCTGCACCT CTTTGGCCAG CTCAAGCGCA TCACCGGGGA ATGGCTGGAT
CGTTGTCTCG TCTGCAAGGG CGACACGTAC CCCGCCTTGC TCATGTACCA GGAACTCGCC
GACATGGCCT GCAATAAGAT CACCGCCGCT ATCACGCGCG AGTTTCAGGA CCGGCGCCCG
ATCAAGGCGC TGCTCGATCC CTATAACCCC ACCGGATCAA CTGCGTATGT GCGTTTCTCT
ACTACGCGGC AAACGCTATG GGACACCGCC GGACCGCCGC CGAAGTGCCA CGTCAACTGG
ATCGTGCTCG ATAGCGATTG GGAAGCCGAG TTCTGCCGGG TGGCGGAAAG CCATCCCCGC
GTGCTCGCCT ACGTGAAGAA CCACAACCTT GGCTTCGAAG TCCCCTACCG CTACGGCTCG
GAAACCCGCG CCTATCGCCC CGACTTCATC GTGCTGGTGG ACGATGGCCG GGGTCCGCAC
GACCCGCTGC ACCTCGTGAT CGAGATCAAG GGCTATCGCG GCGAGGATGC GAAGGAGAAG
AAATCGACGA TGGAAACCTT CTGGATTCCC GGCGTGAACA ACCTCAAGAC CTATGGCCGC
TGGGCGTTTG CCGAGTTCGG CGACATCTGG CAGATACAGA AGGCGTTCGA TCAGTTGCTT
GAGCAGATGA TTCGCCCACA CGGAGCTGCG GAGCGCGCGG AGGCAGGAAC TGACTGA
 
Protein sequence
MSQFFDQPIL NSPYEYPARH WKLENGQPTG EIIHGRRRAE FITPIPRLKK RRAAQQAEMI 
FDEGLGLSTA TQQYDPTSII NEVRSHVDAW RALPPGQWQV TPETARLLHH WRHHQFSSVR
PFFCQIEAVE TVIWLTEVAP HTAAGKGLLD HLARANRDAN PELNRLALKL ATGAGKTTVM
AMLIAWQTVN AVRHPQSKRF TRGFLIVTPG ITIRDRLRVL LPNDTENYYT TRELVPIDMI
EDIHRARIVI TNYHAFKLRE RMELSAGGRA LLQGRGEPIQ TTETEGQMLA RVMPELMSMK
NILVLNDEGH HCYREKPRDP EEEDLTSEEK KEAEKNNEAA RLWITGIETV ARKIGVSRVI
DLSATPFFLR GSGYAEGTLF PWTMSDFSLM DAIECGIVKL PRVPVAENIP GDEMPMYRNL
WEHIRKDMPK KGRGKAGDLD PLKIPTRLQT ALQALYGHYE KTFRIWEQAG IRVPPCFIIV
CQNTAISKLV YDYVAGFVRQ NDDGTSTLVN GQLPLFRNFD ETTGNPLPRP NTLLIDSEQL
ESGTALDDNF RAMAADEIER FRRAIIERTG DARKAESLTD QDLLREVMNT VGKPGQLGEQ
IRCVVSVSML TEGWDANNVT HILGVRAFGT QLLCEQVIGR ALRRQSYEVN AEGLFNPEYA
DIFGIPFDFT AKPVVVRPQP PRQTIQVRAV RPERDHLEIR FPRVQGYRVE LPDERLAAKF
TEDSILELTP ALVGPTITRN QGIIGEAVDL TLAHLEDMRP SALLFNLTKH LLYNKWRDPG
EEPKLHLFGQ LKRITGEWLD RCLVCKGDTY PALLMYQELA DMACNKITAA ITREFQDRRP
IKALLDPYNP TGSTAYVRFS TTRQTLWDTA GPPPKCHVNW IVLDSDWEAE FCRVAESHPR
VLAYVKNHNL GFEVPYRYGS ETRAYRPDFI VLVDDGRGPH DPLHLVIEIK GYRGEDAKEK
KSTMETFWIP GVNNLKTYGR WAFAEFGDIW QIQKAFDQLL EQMIRPHGAA ERAEAGTD