Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Maqu_2392 |
Symbol | |
ID | 4656668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinobacter aquaeolei VT8 |
Kingdom | Bacteria |
Replicon accession | NC_008740 |
Strand | + |
Start bp | 2673109 |
End bp | 2676243 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639812367 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_959655 |
Protein GI | 120555304 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCAGG ATGAAGTCGA AAAAGTCGAA CTGCCCGCCA TTGAGCAACT GACCCGCCTT GGCTGGACCT ACGTGCCCGG CACCGACCTT GCGCCAGTCC TACCCGTGGA TGGCGCACCG GTCGGAGAAC GAGGCTACTA CCGGGATGTA GTGTTGTCCA AACGACTGGA AGCTGCCGTG GTCCGGTTGA ACCCATGGAT CAGTGACGAA AACCTGCGCA AAGTGGTCCG GGAATTCACC CATCCCAACC ATGCGGGCTT GATGGAATAC AACCAGTCCA TCTATGACCT GCTGGTGAAC TACCTGTCCG TTGAACAGGA CCTGGGTGCC GGTCGCAAGG GGCAGACGGT CAAGGTCATC GACTTCGACA ACCCTGATAA CAACGAATTC CTATGCACTA ACCAATTCAA GGTTGAAGGC ATCAACCAGA ACATCATCCC GGATGTTGTC TGCTTCGTGA ACGGCTTACC GCTGGCGGTC ATTGAGTGCA AATCGCCGTA TGTTGCCGAC GCGGTAGCTG AAGGCATTAA CCAGCTGCGC CGATACGCCA ACCTGCGCAC CCCGGAAGAC GATGAAGGTG CCCAGAAGCT GTTCTGGTAC AACCAGCTGA TGGTATCCAC CTGCCGGGAT CAGGCGAAGG TGGGCACCAT CAGTTCCAGC GCCCAGCATT ACGGGGACTG GAAGGATGCT TACCCGTTTA CCGATGCCGA TATTCAGGCG ATGGTGCAAC AGCCTGCCTT TATCTCCACC GGCTCGACAG CATCCAGCCA CAAGACCGGT GACGTACCCT CACCAGAGTA TCTGGTTGCC GCGAAACAGA TCGCTTCAGG TGCTGGAACC AGTTCAGGCG AAGTCACCCC GCAACAGCGA TTACTTGCCG GTATGTTCAA CCCCGCCAGC TTCCTGGACC TGATGCAGAA CTTCACCCTG TTCGAACCGG TGGATGGTCG GCTGGTCAAG AAAGTGGCAC GGTATCAGCA ATACCGAGCG GTGAATAAGG TCATAGAACG CCTTAAGAAC GGCACCACGC GCAAGGAAAA GTCCGGGGTA GTCTGGCATA CCCAAGGCTC CGGAAAGTCC CTGACGATGG TTATGCTGGC GGTGAAGATG CGCCGTGACC CGGAGTTGAA GCAGTACAAG CTGGTGTTCG TGACTGACCG CACCCAACTG GACGGGCAAC TCTCGAAAAC CTTCCGGGAT GCCCAGAACG AAACTATCTA CAACGCTGGG TCAGTGGCGG AACTGAAGGA CCTGTTGTCC AAAGATTCTT CCGACATTGT GACCGCCATG GTGCAGAAGT TTCAGGAAGC GGAGCAGGAA GGCGACTTTA CGGACCTGAA CCCCAGCGAA AAGATCATTG TGCTGGCGGA TGAAGCCCAC CGTACCCAGT TTGGCGGGCT GGCAATGACC ATCAATGCCG CCCTGCCCAA TGCCCCGAAG ATCGGCTTTA CCGGCACTCC CCTGCTGAAG TCCCAGAAGA TGAACAAAGC CTTCGGTGGC TACATCGACA AGTACAAGAT CAACGAAGCC GTGGATGACG GTGCCACCGT GCGCATCATC TACGAAGGTC GTCAGGTTCA GAGCGATGTG GTGGGTGACT CGCTGGATGC GCTGTTTGAA GAATACTTCA AAGACAAAAG CGACGACGAA AAGCGGGAAA TCAAAAAGAA ATACGGTGTG GAACGCGCGG TACGGGAAGC CCCTGCCCGT ATTCGCTGGG TGTGTATCGA CCTGCTGAAG CATTACCGGG AAAAGATTCA GCCCAACGGC TTCAAAGCCA TGATTGTGGT GGGCAGCCGC CATGCCGCCA CGGTGTTCAA GAAGACCCTG GACGATCTGG AAGCGCCCCC TTCTGAAGTG ATCATTTCCG GCACCCACAA CGACCCGGCA TACCTCAGCA AGTACACCGA CAGCGCGCAC CAGAAGCAGG TGATCAAGAA CTTCACCAAG CCGCTGGGAA CGGATAAAGA GAAGGAAGAC CCCACCAGCT TCCTGATCGT GAAGGATATG CTGCTGACTG GCTTCGATGC GCCTATTGCA CAGGTGATGT ATATCGACCG GAGCCTGAAG GACCACACCT TGATGCAGGC AATCGCACGG GTGAACCGAA CCTATAAGGG CAAAGCCTGC GGTTACGTGG TGGATTATTA CGGGCTGTCA GACTACCTGA CCGACGCGCT GAAGCAGTTC AGTAGTGAGG ACGTGGAAGG CACCTACCAC ACCCTGAAGG ATGAAATACC GCGCCTGAAA GCGGCTCACA CCCGTGTGGC TGCCATCTTC GCCGGTGTTA AAGGCACGGA TGTAGATGAC TATGTGCTTC ACCTGAAGGA TGAAGACACC CGCCAGCAGT TCGAACTGGC GTTCAGGAAG TTCGCCAAGC AGATGGACGT GATTCTGCCG GATATCGCCG CCAAGCCGTT CATCCCGGAC CTGAAGTTGT GGGGCAAGGT TCAGAACGCC GCCCGTAACC GCTACCGGGA CCCCGGACTG AACATCAGCG ATGCTGGTGC GAAAGTCCGC CAGCTGGTGG ATGAACACAT CATCAGCACC GGGGTAGACC CGAAGATTCC ACCCGTGGAC CTGATGGCAG CGAACTTCAA GGAAACCGTG GAGCAGATCA AATCCCCGGA ATCCCGCGCA TCCGAAATCG AAAGCGCCAT CAAGCACCAC ATCACGGTGA ATCTGGACGA AGACCCGGAA TACTACAAAT CCCTGAGCCT TCGCCTGCGG GACATAATCG AAAAGACGGC GGGCAAGTGG GACCAGCAGT TGGAACTGCT GCTTCAGATG GTCGATAGCA TGGAGTCTGA ACACAAACAG GCTGCTGACG ATCTGGGGCT GAGTGAAACC GAATTCGCCT TCTACAACAT CCTGATGGCA GAAGTGACCA GCCATGCCGG TGAAGATGCC ATCAGCGAAG CCGTACACAA CGAAATCAAG GCAACCAGCC AGGACCTGGT GGGCATGTTC GATGAAGCCA CCCAGATTGT GGACTTCTTC AGCAAGCCGG ACGAAGTGAA GCGAATGAAG AAGGAAATCA AACGGGCGGT TCTTGATTGT TCGTTTGGCG AGAAAGCCCT GGTGAACGTG GTTCAGGATC GTTTCATGGA CCTTGCGAAG AACAAGTTTA AATGA
|
Protein sequence | MFQDEVEKVE LPAIEQLTRL GWTYVPGTDL APVLPVDGAP VGERGYYRDV VLSKRLEAAV VRLNPWISDE NLRKVVREFT HPNHAGLMEY NQSIYDLLVN YLSVEQDLGA GRKGQTVKVI DFDNPDNNEF LCTNQFKVEG INQNIIPDVV CFVNGLPLAV IECKSPYVAD AVAEGINQLR RYANLRTPED DEGAQKLFWY NQLMVSTCRD QAKVGTISSS AQHYGDWKDA YPFTDADIQA MVQQPAFIST GSTASSHKTG DVPSPEYLVA AKQIASGAGT SSGEVTPQQR LLAGMFNPAS FLDLMQNFTL FEPVDGRLVK KVARYQQYRA VNKVIERLKN GTTRKEKSGV VWHTQGSGKS LTMVMLAVKM RRDPELKQYK LVFVTDRTQL DGQLSKTFRD AQNETIYNAG SVAELKDLLS KDSSDIVTAM VQKFQEAEQE GDFTDLNPSE KIIVLADEAH RTQFGGLAMT INAALPNAPK IGFTGTPLLK SQKMNKAFGG YIDKYKINEA VDDGATVRII YEGRQVQSDV VGDSLDALFE EYFKDKSDDE KREIKKKYGV ERAVREAPAR IRWVCIDLLK HYREKIQPNG FKAMIVVGSR HAATVFKKTL DDLEAPPSEV IISGTHNDPA YLSKYTDSAH QKQVIKNFTK PLGTDKEKED PTSFLIVKDM LLTGFDAPIA QVMYIDRSLK DHTLMQAIAR VNRTYKGKAC GYVVDYYGLS DYLTDALKQF SSEDVEGTYH TLKDEIPRLK AAHTRVAAIF AGVKGTDVDD YVLHLKDEDT RQQFELAFRK FAKQMDVILP DIAAKPFIPD LKLWGKVQNA ARNRYRDPGL NISDAGAKVR QLVDEHIIST GVDPKIPPVD LMAANFKETV EQIKSPESRA SEIESAIKHH ITVNLDEDPE YYKSLSLRLR DIIEKTAGKW DQQLELLLQM VDSMESEHKQ AADDLGLSET EFAFYNILMA EVTSHAGEDA ISEAVHNEIK ATSQDLVGMF DEATQIVDFF SKPDEVKRMK KEIKRAVLDC SFGEKALVNV VQDRFMDLAK NKFK
|
| |