Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_0843 |
Symbol | |
ID | 6367302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010814 |
Strand | - |
Start bp | 866713 |
End bp | 869694 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642676238 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_001951087 |
Protein GI | 189423910 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTCACCG AATCCAACAC CGTCGAAGCC TACCTCCGCG ACCTGCTGAC TGGTTCTGTC ATAGATACTA CACAAGGTGT AGTCAGTGAA CCGGAGCCGG TATACATCGT CGGCCGGGCA ACCAAAGGTG TCGGCTGGCG GTATGTCTCC CCCCAGAAGC TCACCCGCCA GAGTAATGAA GTTTTCGTTG AATCCTACCT CCGTGACGCC CTCATCCATC TAAATCCCGA AATCAAAGCT CAACCCGACC GCGCTGATGA AGTGCTCTAC AAACTGCGGG CCATCGTGCT TTCCGTCCGC TCCGACGGCC TGATCCGCGC CAACGAGGAG ATGACCGCCT GGCTGCGGGG CGAGCGCACT ATGCCGTTCG GCCAGAACAA TGAGCATGTG CCGGTGCGGT TGATCGATTT CGAGAATCTG GACAACAACC AGTATGTAGT GACGCAGCAA TATACATACC GGGCCGGGCA GATCGAAAAA CGGGCCGACT TGATGCTGCT GGTCAACGGT CTGCCGTTGG TGCTGATCGA AGCCAAGACC CCGGTGCGGA AGGCAACCAG TTGGGTGGAC GGGGCGCTGC AGGTCCATGA GGATTATGAG AAGTTCGTGC CGGAGCTGTT CGTCTGCAAT GTGTTCTCGG TCGCCAGCGA AGGGAAAGAA TTCCGTTACG GTGCCATTGG TGTGCCGGTC AAGGATTGGG GGCCGTGGAA CCTGGATGAC ACTACAACCA ACACCCAGCA TCATCCGCTG CACTCCCTCA AGTCGGCTGT CGAAAGCATG CTCCGTCCCC ATATCGTGCT CGATATTCTG GCCAACTTCA CCCTGTTCGC AACCCACAAG AAAAAGCAGC GCAGCAAGAT CATCTGCCGT TACCAGCAGT ACGAGGCCGC CAATCGCATT GTCGAGCGTG TGCTGGCTGG GGCTCCCCGC AAAGGGCTGA TCTGGCATTT CCAGGGGTCC GGCAAGTCGC TGCTGATGGT CTTTGCCGCC CAGAAACTGC GCCTGCATCC CAAGCTGAAG AATCCGACCG TGCTGATCGT GGTTGACCGC ATCGACCTCG ATACCCAGAT CACCGGCACC TTCACCGCCG CCGACATCCC GAACCTGGAA AAGGCCGAGT CCCGCGAGAA GCTGCAAAAG CTGCTGGGGC AGGATGTCCG CAAGATCATC ATTACCACCA TCTTCAAGTT CGGCGAGGCG GATGGCGTCC TGAACGAACG GTCAAACATC ATTGCCCTGG TGGATGAGGC CCACCGCACC CAGGAAGGTG ACCTGGGGCG CAAGATGCGC CAGGCTCTTC CCAATGCGTT TCTCTTCGGC CTCACCGGCA CCCCGATCAA CCGGAGCGAC CGCAATACCT TCTATGCCTT TGGCGCCGAG GAAGACACCG GCGGCTACAT GAGCCGCTAT GGTTTCGAGG AGTCGATCCG GGACGGCGCC ACCCTGCCGC TGCATTTCGA GCCCCGGCTG GTGGAGCTGC ATATCGACAA AGAGGCCATC GACGAAGCCT ATGCCGAACT GACCGGCGAC CTGTCCGACC TGGACCGGGA CAACCTCGCC CGGGCAGCAT CGAAGATGGC CGTGCTGGTC AAGACGCCGG AGCGGGTCCA GCGGATCTGC GCCGATATCG TGCAGCACTT CCAGTCCAAA ATCGAACCGA ACGGTTTCAA GGGGCAGATC GTTACCTTTG ACCGGGAGTG CTGCCTGCTC TACAAACAGG AAATCGACAA GCTCCTGCCA ACCGAGGTGA GCGAGATTGT CATGACGGTC AACAGCAATG AGCCGCAGTA CAAAGCCTAT GCCCGCACCC GTGACGAAGA GGAGCGGCTG CTGGAGCGGT TTCGTGATCC CAATGACCCG CTGCAGCTCA TCATCGTCAC TTCAAAACTG CTGACCGGCT TTGATGCCCC CATTCTCCAG GCGATGTATC TGGACAAGCC GATGAAGGAC CACACGCTGC TGCAGGCCAT CTGCCGGGTC AACCGCACCT ACGCCGACAC CAAGACCCAC GGCCTGATTG TCGATTATCT GGGCGTGTTC GACGATGTGG CCAAGGCGTT GGAGTTTGAC GACAAGAACA TTCTCAAGGT GGTCTCCAAC ATCCAGGAGT TGAAGAATCA GCTCCCGGAT GCCATGCAGC GCTGCCTGAC TTTCTTCATC GGCGTGGATC GCACCCTGCA GGGATATGAA GGTCTGATCG CCGCGCAGCA GTGCCTGCCG AACAACACCG TGCGCGATAA CTTCGCTGCC GAATTCAGCG TCCTCAACAA GCTGTGGGAG GCGATTTCCC CGGACCCGAT GCTGAATCAG TACGAGACTG ATTACCGCTG GCTCTCCCAG GTGTACCAAT CAGTGCAGCC GTCCAGCGGT CACGGAAAGC TGATCTGGCA TTCCCTTGGC GCCAAGACCA TCGAACTGAT CCACCAGAAC GTCCACGTGG AAGCTCTGCG GGATGATCTG GACACGCTGA TCCTCGATGC CGACCTGCTG GAAGCGGTGC TGTCCAACCC GGACCCAAAA AAGGTGAAGG AGATTGAGAT CAAGATCTCT CGCCGACTGC GCAAGCACAT GGGTAATCCG AAATTCAAGG CGCTGTCGGA ACGGCTGGAG GAATTGCGGA ACCGGCAGGA ACAGGGGCTC ATTACCAGTG TCGAATTCCT GAAACAGCTG CTGCAACTGG CGAAAGACCT GCTGCAGGCG GAAAAGGAGA CGCCGCCGGA AGAGGATGAG GATCGCGGCA AGGCGGCGTT GACCGAACTG TTTCAGGATG TGAAGACCGA GGAAACCCCG GTCATGGTGG AACGGATCGT TGCCGATATT GATGAGATTG TCCGGCTGGT ACGGTTCCCC GGCTGGCAGC ATACTCTGGC GGGTGAACGC GAGGTCAAGA AGGCCTTGCG GAAATCACTC TTCAAATATC GGTTGCACCA GGACGAGGAA CTTTTCGAAA AATCGTATGG GTATATCCGG CAGTATTACT GA
|
Protein sequence | MFTESNTVEA YLRDLLTGSV IDTTQGVVSE PEPVYIVGRA TKGVGWRYVS PQKLTRQSNE VFVESYLRDA LIHLNPEIKA QPDRADEVLY KLRAIVLSVR SDGLIRANEE MTAWLRGERT MPFGQNNEHV PVRLIDFENL DNNQYVVTQQ YTYRAGQIEK RADLMLLVNG LPLVLIEAKT PVRKATSWVD GALQVHEDYE KFVPELFVCN VFSVASEGKE FRYGAIGVPV KDWGPWNLDD TTTNTQHHPL HSLKSAVESM LRPHIVLDIL ANFTLFATHK KKQRSKIICR YQQYEAANRI VERVLAGAPR KGLIWHFQGS GKSLLMVFAA QKLRLHPKLK NPTVLIVVDR IDLDTQITGT FTAADIPNLE KAESREKLQK LLGQDVRKII ITTIFKFGEA DGVLNERSNI IALVDEAHRT QEGDLGRKMR QALPNAFLFG LTGTPINRSD RNTFYAFGAE EDTGGYMSRY GFEESIRDGA TLPLHFEPRL VELHIDKEAI DEAYAELTGD LSDLDRDNLA RAASKMAVLV KTPERVQRIC ADIVQHFQSK IEPNGFKGQI VTFDRECCLL YKQEIDKLLP TEVSEIVMTV NSNEPQYKAY ARTRDEEERL LERFRDPNDP LQLIIVTSKL LTGFDAPILQ AMYLDKPMKD HTLLQAICRV NRTYADTKTH GLIVDYLGVF DDVAKALEFD DKNILKVVSN IQELKNQLPD AMQRCLTFFI GVDRTLQGYE GLIAAQQCLP NNTVRDNFAA EFSVLNKLWE AISPDPMLNQ YETDYRWLSQ VYQSVQPSSG HGKLIWHSLG AKTIELIHQN VHVEALRDDL DTLILDADLL EAVLSNPDPK KVKEIEIKIS RRLRKHMGNP KFKALSERLE ELRNRQEQGL ITSVEFLKQL LQLAKDLLQA EKETPPEEDE DRGKAALTEL FQDVKTEETP VMVERIVADI DEIVRLVRFP GWQHTLAGER EVKKALRKSL FKYRLHQDEE LFEKSYGYIR QYY
|
| |