Gene Oter_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_3004 
Symbol 
ID6204298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp3859141 
End bp3862530 
Gene Length3390 bp 
Protein Length1129 aa 
Translation table11 
GC content63% 
IMG OID641692669 
Producttype III restriction protein res subunit 
Protein accessionYP_001819885 
Protein GI182414819 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.385036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0400425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG GTTCGCAGTT TTCGTTTCTC CAGTCCGAGT GGCCAGAGGT TTTCGAGGCG 
GCGGCAAAGG CGGAAGCGCT GGCACTGGTC GATCCGCGGA CCGCGTGCTT CTACGCCCGG
CGGGCGCTGG AGATCGGGGT CACGTGGCTC TACACGAACG ACGACGCGCT GAAACTGCCG
TATCAACAGA ACCTGAGCGC GCTGATTCAC GAGCCCACGT TCAAGACCGC AGCCGGCCCG
GCGGTCTTCA ACAAGGCGGT GCTGATCACG CGCTACGGGA ACCAGGCGGT GCACAGCCAC
CGCGCGGTGA AGGCGTTCGA CGCGCTGACC ACGGTGCGCG AGCTGTTCCA CATCTGTTTC
TGGCTGACCC GAACCTACGC CCGCGGCGCC CGGCCCGCGG ACGACCTCGC GTTTGAACCG
GACACCTTGC CGAAGTCGGC GCCGGTGCCG ACGCAGACGC AGGAACAGTT GCAGAAACTC
GCGGCGCAAC TCGCCGAGAA GGACAAACGG CTCGCAGCGC TGTTTGCGGA CAAGGAAACG
CTGAGCAAAG AGCTGGCACA GGCGCGGCAG GAGATCGCCG CCATCAAAAA GCGCAACGCC
GCGACGCCGG ATCAGCACGA CTACTCCGAG GCACAAACCC GCGATGCGTT CATCGACCTG
CTGCTCAAGG AAGCCGGTTG GCCGCTGAAT CAGCCCCGCG ACCGGGAGGT CGAGGTCAGC
GGGATGCCGA ATCAGCAGAA GAAGGGCTAT GTCGACTACG TGCTGTGGGG CGACGACGGC
CGACCGCTCG CGGTCGTGGA GGCGAAGCGC ACGCGAAAAA GTGCCGCGAT CGGCCAACAG
CAGGCGAAGC TTTACGCCGA CTGTCTGGAG CAGCAGTTCG GACGGCGGCC GGTCATCTTC
TGCTCGAACG GCTATGAGCA CTGGCTGTGG GATGACGCGC TGTATCCGCC GCGGCCGGTC
CAGGGTTTTC GCAAGAAGGA CGAACTGGAG CTGATGATCC TGCGGCGGCA AACGCGCACC
TCCTTGGCTG GAGCCGCGAT CAATCCGGCG ATTGTCGAAC GCTACTATCA GACCCGCGCG
ATCCGCCGAA TCGGCGAGGC ATTCGAGGCC GACTGCGAAC GAAAGGCGCT CGTCGTCATG
GCCACCGGCG CCGGAAAGAC GCGGACGGTG ATCGCGCTGT GCGACCTGTT GATGCGATGC
AACTGGGTGA AACGGGTGTT GTTTCTCGCC GACCGCACCG CGCTCGTGAA CCAGGCGGTG
AACGCGTTCA AGCGTTTCCT GCCGGAAGCG TCACCAGTGA ATCTGGTGAC GGAACCGGAG
GCGGCGGGCC GCGTGTTTGT CTCCACCTAC CCGACGATGA TGGGGCTGAT CGATGAGACG
AAGGACGGCC AGCGGCGCTT CGGCACCGGG CATTTCGATC TGGTCATCAT CGACGAGGCC
CATCGGTCGG TTTACCAAAA ATACGGTGCG ATCTTCCGCT ATTTCGACTC GCTGCTCGTC
GGCCTGACGG CCACACCGCG GGAGGAAATC GATCGCGACA CCTACGGGCT GTTCGACCTG
GAGAAAGGCG TGCCAACGGA TGCCTACGAC CTGAAGAACG CGGTGGCGGA CAAGTTCCTC
GTGCCGGCCA AAGCCGTGTC GGTCCCGCTG AAGTTCCAGC GCGATGGGAT CAAATATGAG
GACCTGTCCG AGGAGGAGAA GGAACAGTGG GACGGGGTCG AGTGGGACGA GGAAGGCACC
ACTCCGCAGA GGGTCGAACC GGAGGCGGTC AACAAATGGC TCTTCAACAA GGACACGGTC
GACAAGGTGC TGGAGCACCT GATGACGCAT GGTCAAACCG TCGCCGGCGG TGACCGGTTG
GGAAAGACGA TCGTGTTCGC CAAGAACAAG GATCATGCCG AGTTCATCGC TCAGCGCTTT
GACGTGAACT ACCCGCACCT CAAGGGCGCG TTTGCCCGGG TGATTCACTG CGGGCTGCCC
TATGCGCAAT CGCTGATCGA TGACTTCTCC AATCCCGCGA AGATGCCGCA CATCGCGATC
TCGGTCGACA TGCTGGACAC CGGCATCGAT GTGCCGGAGG TCGTCAATCT CGTCTTCTTC
AAACTGGTCC GATCGAAAAC GAAGTTCTGG CAGATGATCG GTCGCGGCAC CCGGCTGTGT
CCGGATCTAT TCGGTCCGGG CAAACACAAG GCCTTCTTCT ACATCTTCGA TTATTGTCAG
AACCTCGAGT TCTTCAGCCA GCACCCCGAG ACGACCGCCG GTGCTTTGGG CGCGTCGCTA
AGCAAGCGCA TCTTCACGGC GCGGCTGGAA GTCATCGGCG AACTCGACCG AGCCTTCGCC
GGAATGTCGC ACGAGCCCGC GGAAGGCGAA GCCGAACTCG GCCACGAGCT GCGCGCGGTT
CTGCAGACCG AAGTGGCGGC GATGAACGTC GACAATTTCG TCGTCCGTCC GCAGCGACGG
CTCGTCGAGC GGTTCGCCAA GCCCGAGGCC TGGGTCGCAA TGGATTCCGC GGCTCGAGGC
GAACTGGCCT ATCATGTGGC CGGTTTGCCG ACGGAGCTGG ACCCGGAGGA GGAGGAAGCA
AAACGGTTCG ATCTGCTGAT CCTGAACCTT CAGCTCGCCG TGTTGCGGAA CGCGCGGGAG
TTCGAACGTC TCAAAAATCA GGTGATCGCG ATCGCCGGCC TGCTGGAGGA GAAGGCGGCC
ATCCCGATGA TCCAGGCGCA ACTAGCCCGT ATTCTGGAAG TGCAGACGGA GGGATGGTGG
ACCGACGTGA CGCTCCCGAT GCTCGAGCAG ACGCGCAAAC GCCTGCGCTC GCTGGTCAAG
CTCATCGACA AACGGCAGCG CAAACCGATC TACACGGACT TCGAGGACTC AATGGCGGCG
GCCAAGGAGG TCGTCCTCGC AGATTTCGTC ACCGCGGAAA ACTTCGAGAA GTTCCGCGAG
AAGGTGCGGG CGTTTTTGCG CGCGCATCAG AGCCACCTCA CGATCCAGAA GCTGCGGATG
AATGAACCAC TGACCGCCGT AGATCTCGCG GAGCTGGAGC GGGTGCTGGC CGAAAGCGGC
GTCGCCACTC CCGAACAGTG GGAAACGGCC AAGGCAGCCA GCGACGGTCT GGGGCTTTTC
GTGCGCTCGC TCGTCGGGCT GGATCGCGAA GCCGCGAAGC AGGCGTTGAA CGCCTTCACG
GCCGGGAAGA TCCTCACGGC CAACCAACTC GAGTTCGTGA ACATGGTGGT GGATCAGTTG
ACCGAACGAG GCGTGGTCGA GCCGAAATTG CTCTATGAAT CTCCATTCAC CGACGTAAAC
GCGCAGGGAC CGGACGGGGT ATTCGATTCG AGTCAGGTGG ACGAGTTGCT CGCGCTGCTC
GAGCAAGTTC GCGAACGGGC AGTCGTCTGA
 
Protein sequence
MSAGSQFSFL QSEWPEVFEA AAKAEALALV DPRTACFYAR RALEIGVTWL YTNDDALKLP 
YQQNLSALIH EPTFKTAAGP AVFNKAVLIT RYGNQAVHSH RAVKAFDALT TVRELFHICF
WLTRTYARGA RPADDLAFEP DTLPKSAPVP TQTQEQLQKL AAQLAEKDKR LAALFADKET
LSKELAQARQ EIAAIKKRNA ATPDQHDYSE AQTRDAFIDL LLKEAGWPLN QPRDREVEVS
GMPNQQKKGY VDYVLWGDDG RPLAVVEAKR TRKSAAIGQQ QAKLYADCLE QQFGRRPVIF
CSNGYEHWLW DDALYPPRPV QGFRKKDELE LMILRRQTRT SLAGAAINPA IVERYYQTRA
IRRIGEAFEA DCERKALVVM ATGAGKTRTV IALCDLLMRC NWVKRVLFLA DRTALVNQAV
NAFKRFLPEA SPVNLVTEPE AAGRVFVSTY PTMMGLIDET KDGQRRFGTG HFDLVIIDEA
HRSVYQKYGA IFRYFDSLLV GLTATPREEI DRDTYGLFDL EKGVPTDAYD LKNAVADKFL
VPAKAVSVPL KFQRDGIKYE DLSEEEKEQW DGVEWDEEGT TPQRVEPEAV NKWLFNKDTV
DKVLEHLMTH GQTVAGGDRL GKTIVFAKNK DHAEFIAQRF DVNYPHLKGA FARVIHCGLP
YAQSLIDDFS NPAKMPHIAI SVDMLDTGID VPEVVNLVFF KLVRSKTKFW QMIGRGTRLC
PDLFGPGKHK AFFYIFDYCQ NLEFFSQHPE TTAGALGASL SKRIFTARLE VIGELDRAFA
GMSHEPAEGE AELGHELRAV LQTEVAAMNV DNFVVRPQRR LVERFAKPEA WVAMDSAARG
ELAYHVAGLP TELDPEEEEA KRFDLLILNL QLAVLRNARE FERLKNQVIA IAGLLEEKAA
IPMIQAQLAR ILEVQTEGWW TDVTLPMLEQ TRKRLRSLVK LIDKRQRKPI YTDFEDSMAA
AKEVVLADFV TAENFEKFRE KVRAFLRAHQ SHLTIQKLRM NEPLTAVDLA ELERVLAESG
VATPEQWETA KAASDGLGLF VRSLVGLDRE AAKQALNAFT AGKILTANQL EFVNMVVDQL
TERGVVEPKL LYESPFTDVN AQGPDGVFDS SQVDELLALL EQVRERAVV