Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_3512 |
Symbol | |
ID | 6366442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010814 |
Strand | + |
Start bp | 3771066 |
End bp | 3773840 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642678929 |
Product | type III restriction protein res subunit |
Protein accession | YP_001953735 |
Protein GI | 189426558 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCTG AAGCGAAAGC ACGACAGGAA ATTGATCAAA AACTCGAACA AGCTGGCTGG GTTGTCCAGG ATCTGAAAAA GATCAACCTG GGTGCCGGTC CGGGTGTTGT AGTGCGTGAA TACCCAACTG ATACCGGACC AGCGGATTAT CTGCTGTTTG TGGATCGCCA GCCGGTCGGG GTAATCGAGG CCAAACCGGA TAACACCATA CTCACCTTTG TGGAAGACCA GACCGAACGC TACGCCCGCA GTACCCTTAA GTGGCGGATA ACCTCCACAC CACTGCGCTT CCTGTTTGAA AGCACCGGTC AGGTCATCCA TTTTACCGAC GGCGCTGACC CATCAGCCCG TGCCCGCGAG ATCTTCTATT TTTTCAAGCC GGAACAACTT GCCGAATGGC TGCAGCAGCC TGAATCATTC CGTCGTCGCC TCAAGGCACA GATGCCGGAA CTGCCGACCC GCAACCTGCG TGATTGCCAG ATCAGCGCGG TGACTGGGCT GGAGAAATCT CTTGCCCAGA ACAGGCCCCG CGCCTTGGTG CATATGGCAA CCGGCGCTGG CAAAACCTTC ACCGCCATTA CCTCGGTCTA CCGACTGCTC AAATATGGCG ATGCCAAGCG GATTCTTTTC CTGGTTGATA CCCGCAACCT GGGGAAACAG GCACATCAGG AGTTCATGGC CTACACCCCG CCCGACGATG CCCGCAAGTT TACGGAACTA TACAATGTAC AGCGCCTGAA TGGCCCCACC ATTGACCCGG CAGCCAAGGT CTGCATCAGC ACCATTCAGC GGATGTACTC CCTCCTTTCT GGTGAACCGA TTGATGAGTC TGCCGAGGAT GTGCCGCTGG ACCAGATTGT CCAAACCGAC AAGCAGGCAA AGGTTGTGAG GTACAACCCG GCTGTACCGG TGGAGACCTT TGATGTCATC ATTATTGATG AATGCCACCG CAGCATCTAC AACCTCTGGA AACAGGTGCT CGACTACTTC GATGCCTTCC TGGTGGGCCT CACCGCCACC CCTGACAAAC GCACCTTTGG CTTCTTCAAT GAAAATATCG TTGCCGAATA CACCTACGAG CAATCGGTGG CCGATGGCGT GAACGTGGGG TATGACGTCT TTGAGATTGA AACCGAGATC ACCAAAAAGG GGGCTGAACT GAAGGCCAAA GAATGGGTGG ATCACCGCGA CCGGCAGACC CGCAAGAAAC GCTGGGCCGA GGCAGAGGAA GATCTGCTCT ACACCGGCAA GGAACTGGAC AGGTCGGTGG TAAACATCAG CCAGATCCGC AAGGTGATCC AGGCCATGAA AACAGCGGTG GAGACGCAGA TATTCCCCAG CCGTAACGAG ACCCCCAAGA CCCTGATCTT TGCCAAGACC GACAGCCACG CCGACGACAT AATCCAGATC GTGCGCGAGG TCTACAACCA GGGAAACGCC TTCTGTAAAA AAGTCACCTA CAAGGCAGAA GAAGACCCGG ACAGCATCCT TGCCAGCTTC CGCAACGACT ATAACCCGCG CATTGCCGTT ACCGTGGACA TGATTGCCAC CGGCACTGAT GTAAAGCCTT TGGAAGTCTT GCTCTTCATG CGTGATGTCC GCTCTAAGGG GTATTATGAG CAGATGAAGG GACGCGGTGT CCGCAGTCTT GACCATGATG CCCTGAAACG GGTATCCAAC AGCGCTGACA GCGCCAAGAG CCGCTTTGTC CTGATTGATG CTGTGGGGGT GGAAAAATCC CTCAAGACCG AAAGCCGCCC GCTGGAGAAA AAGCCGTCAG TACCGCTGAA AGACCTGATG CTGGGTGTGG CCATGGGGCA CCGGGATGAA GATACCGTAC TCAGTCTGGC CAATCGTCTG GTGCGGCTGG CCAAGCAGTT GGATGACAAG GCCCTGGCGC GGATCGAAAA GAGATCCGGC GGGCTGACGG TTGGCGCTTT GGGCAAGACG CTGATTACCG CCATTGACCC TGATCGGGTT GTTGCAGCTG CCATTGCCAC TGCCAAGGAA AAAGGGATCA CCCGCACCGA AGAGACCCTG ACCGAAGAGG AGCTGACCGC TGCCCGCGCC CAATGCGTGG CTGCTGCCTG CGCCCCCTTT GACAGCCCGG AACTACGGGA TGAAATCGAA GCAGCCCGCC GGGAACAGGA GCAGGTCATT GACCATATCA ATCTGGATCA GGTCACCTTC TCCGGGTACA GCGCCCAGGC TGAAGAGCAG GCACAGAAGG TCATCCAGAG CTTTGCCGAC TACATTGCCC AACATAAGGA TGAAATCCAG GCGCTCAGTT TCTTCTACCA GCAACCGTAC CAACGCCGTA CCCTCACCTT TGAGATGATC GAAGAGCTGC ACGATGCCCT TTCCCGCCCG CCATTGATGC TGACCACCGA ACGGCTCTGG AACGCATACG CCCGGGTGAA AGGTTCGCAG GTAACCGGCG CTGATACAAA ACGGCAACTG ACTGATCTGG TGGCCCTGGT TCGTTTTGCC ATTGGGCTGG ATACGGAATT GAAGCCGTTC CGTGATCAGG TGGACAGGCG TTTTCAGGAA TGGATCTTCC GCCACAACGC CAAGCGCACC ACCGCCTTCA GCACAGAGCA GACCGAATGG CTGCGGCTGA TGAAGGATCA TATCGCTTCC AGTTGCTCCA TAGCCCGTGA CGATTTTGAG TATGCCGAGT TTGCCGCCAA AGGTGGCCTG CAGAGGGTGT GGGGGCTGTT TGGTACGGAG CTGGATGGGG TGATGACGGA GATGAATGAG GAGCTGGTGG CGTAA
|
Protein sequence | MTPEAKARQE IDQKLEQAGW VVQDLKKINL GAGPGVVVRE YPTDTGPADY LLFVDRQPVG VIEAKPDNTI LTFVEDQTER YARSTLKWRI TSTPLRFLFE STGQVIHFTD GADPSARARE IFYFFKPEQL AEWLQQPESF RRRLKAQMPE LPTRNLRDCQ ISAVTGLEKS LAQNRPRALV HMATGAGKTF TAITSVYRLL KYGDAKRILF LVDTRNLGKQ AHQEFMAYTP PDDARKFTEL YNVQRLNGPT IDPAAKVCIS TIQRMYSLLS GEPIDESAED VPLDQIVQTD KQAKVVRYNP AVPVETFDVI IIDECHRSIY NLWKQVLDYF DAFLVGLTAT PDKRTFGFFN ENIVAEYTYE QSVADGVNVG YDVFEIETEI TKKGAELKAK EWVDHRDRQT RKKRWAEAEE DLLYTGKELD RSVVNISQIR KVIQAMKTAV ETQIFPSRNE TPKTLIFAKT DSHADDIIQI VREVYNQGNA FCKKVTYKAE EDPDSILASF RNDYNPRIAV TVDMIATGTD VKPLEVLLFM RDVRSKGYYE QMKGRGVRSL DHDALKRVSN SADSAKSRFV LIDAVGVEKS LKTESRPLEK KPSVPLKDLM LGVAMGHRDE DTVLSLANRL VRLAKQLDDK ALARIEKRSG GLTVGALGKT LITAIDPDRV VAAAIATAKE KGITRTEETL TEEELTAARA QCVAAACAPF DSPELRDEIE AARREQEQVI DHINLDQVTF SGYSAQAEEQ AQKVIQSFAD YIAQHKDEIQ ALSFFYQQPY QRRTLTFEMI EELHDALSRP PLMLTTERLW NAYARVKGSQ VTGADTKRQL TDLVALVRFA IGLDTELKPF RDQVDRRFQE WIFRHNAKRT TAFSTEQTEW LRLMKDHIAS SCSIARDDFE YAEFAAKGGL QRVWGLFGTE LDGVMTEMNE ELVA
|
| |