Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1779 |
Symbol | |
ID | 7317589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1892757 |
End bp | 1895156 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643616671 |
Product | type I restriction-modification system specificity subunit |
Protein accession | YP_002513848 |
Protein GI | 220934949 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTCA AGAAATCCGA ACTCTATTCC TCCCTCTGGT CCGGCTGCGA CGAGCTGCGC GGTGGTATGG ACGCCAGCCA GTACAAGGAC TACGTCCTCG TCCTGCTGTT CATTAAGTAC GTCAGCGACA AGTACGCCGG GCAGCCCTAC GCGCCCATCA CCATCCCCCC AGGGGCGAGC TTCAAGGACA TGGCCGCCCT CAAGGGCAAG CCCGACATCG GCGACCAGAT CAACAAGAAG ATCATCGCCC CCCTGGCCAA TGCCAACCAG CTGTCCGAGA TGCCGGACTT CAACGACCCC AACAAGCTGG GCAGCGGCAA GGAGATGGTG GACCGGCTCA CCAACCTGAT CGCCATCTTC GAAGACAAGC GCCTGGACTT CTCCAAAAAC CGCGCCGACG GCGACGACAT CCTGGGCGAT GCCTACGAAT ACCTCATGCG CCACTTCGCC ACCGAGAGTG GCAAGAGCAA GGGACAGTTC TACACTCCCG CCGAGGTGAG CCGGGTGATG GCCCAGATCC TGGGCATCCG CAACGCCAGC ACCAGCGCCG ACACCACGGT CTACGACCCC ACCTGCGGCT CCGGATCCCT GCTGCTCAAG GTGGCGGACG AGGCGGGCAC CGATGTCACC CTCTACGGGC AGGAGAAGGA CGCCGCCACC AGCGGCCTGG CCCGCATGAA CATGATCCTG CACAACAACC CCACCGCGCT GATCATGCAG GGCAACACCC TGGCCGACCC CAAGTTCCTG GACGGCCAGA GCCTCAAGAC CTTCGACTAC GTGGTGGCCA ACCCGCCGTT CTCCGACAAG CGCTGGAGCA CCGGCCTCGA CCCGGCCAGC GACCCCCACG AGCGCTTCAA GCACTACGGC ATCCCCCCGG ACAAGCAGGG TGACTACGCC TACCTGCTGC ACATCCTGCG CAGCCTCAAG AGCACCGGCC GGGGCGCCTG CATCCTGCCC CACGGCGTGC TGTTCCGGGG CAACGCCGAG GCCGACATCC GCCGCAACCT GGTGCGCAAG GGCTACATCA AGGGCATCAT CGGCCTGCCG CCCAATCTCT TCTACGGCAC CGGCATCCCC GCCTGCATCG TGGTGGTGGA CAAGGCCGAG GCCCACGGGC GCGACGGCAT CTTCATGATC GACGCCAGCG GCGGCTTCAT GAAGGACGGC CCCAAGAACC GCCTGCGCAG CCAGGACATC CACAAGATCG TGGACGTGTT CACCAAACAG GCCGAGCTGC CCGGCTACGC GCGCCGGGTG CCTCTCGCCG AGATCGAGAA GAACGACTAC AACCTCAACC TGCCGCGCTA CATCGACAGC CAGCAGGCGG AAGACCGTCA GGATATCGAG GGCCACCTGC GTGGCGGCAT CCCCGAGGCT GACGTGGACG CCCTGCAAAG CTACTGGGAC GTCTGCCCCG GCCTGCGCCA GGCTCTGTTC CGCCCCAACC GCCCCGGCTA TCTCGATCTG GCCGTGGACA AGGCCGACAT CCGTGCCACC ATCTACGGAC ACCCGGAATT CACCGCCTTC ATCGACGCCA TGAACGCCCA CTTCGAGGCC TGGCGCGCGC GTGCCGTGCC CGGCCTCAAG GCCCTGGCCG CGGGCTGCCA TCCCAAGGAG GTGATCCACA CCCTCTCCGA GGACCTGCTG GCCCACTACA AGGGCCGCCC CCTCATCAAC CCCTACGACG TCTACCAGCA CCTCATGGAC TACTGGGCCG GGACCATGCA GGACGACGCC TACCAGATCG CCGCCGAGGG CTGGAAGGCC GAGACCGCGC GCATCATCGA GAAGGACAAG AAAGGCAAGG AGAAGGACAA GGGCTGGACC TGCGACTTCC TGCCCAAGGC CCTCGTGGTG GCCCGCTACT TCCCCGACCA GCAGGCGGCC ATCGACCGGC TCGCCGCCGA CCTGGAGGGC GTGAGCGCCC GCCTGGCCGA GCTGGAGGAG GAACACGGTG GCGAGGAGGG CGCTTTCGCG GAGCTGGACA AGGTCAACAA GGGCAACGTC AACGCCCGCC TTAGGGAGAT CCGGGATGAC CCGGAGAGCA AGGACGAAGC CAAGGTGCTC AAGGAATGGC TCGCCCTGAG CAAACAGGAA TCGGACCTCA AGAAAGACCT CAAGGACGCC GAGGCCGAGC TGGATGCCGC CGCCTATGCC CAGTACCCCA AACTGACCGA GGACGAGATC AAGACCCTGG TGGTGGACGA CAAGTGGATT GCCGCCCTGG ACAGGGACGT CCACGGTGAA ATGGACCGCA TCAGCCAGTC ACTCACCAAA CGGGTGCGCG AACTGGCCGA GCGTTACGAC ACGCCTCTGC CGGAGCTCAC CGCCCGGGTG GCCGGGCTGG AGGCGCGGGT CAACGGACAC CTGGAAAGGA TGGGCTTCGC ATGGAAGTGA
|
Protein sequence | MALKKSELYS SLWSGCDELR GGMDASQYKD YVLVLLFIKY VSDKYAGQPY APITIPPGAS FKDMAALKGK PDIGDQINKK IIAPLANANQ LSEMPDFNDP NKLGSGKEMV DRLTNLIAIF EDKRLDFSKN RADGDDILGD AYEYLMRHFA TESGKSKGQF YTPAEVSRVM AQILGIRNAS TSADTTVYDP TCGSGSLLLK VADEAGTDVT LYGQEKDAAT SGLARMNMIL HNNPTALIMQ GNTLADPKFL DGQSLKTFDY VVANPPFSDK RWSTGLDPAS DPHERFKHYG IPPDKQGDYA YLLHILRSLK STGRGACILP HGVLFRGNAE ADIRRNLVRK GYIKGIIGLP PNLFYGTGIP ACIVVVDKAE AHGRDGIFMI DASGGFMKDG PKNRLRSQDI HKIVDVFTKQ AELPGYARRV PLAEIEKNDY NLNLPRYIDS QQAEDRQDIE GHLRGGIPEA DVDALQSYWD VCPGLRQALF RPNRPGYLDL AVDKADIRAT IYGHPEFTAF IDAMNAHFEA WRARAVPGLK ALAAGCHPKE VIHTLSEDLL AHYKGRPLIN PYDVYQHLMD YWAGTMQDDA YQIAAEGWKA ETARIIEKDK KGKEKDKGWT CDFLPKALVV ARYFPDQQAA IDRLAADLEG VSARLAELEE EHGGEEGAFA ELDKVNKGNV NARLREIRDD PESKDEAKVL KEWLALSKQE SDLKKDLKDA EAELDAAAYA QYPKLTEDEI KTLVVDDKWI AALDRDVHGE MDRISQSLTK RVRELAERYD TPLPELTARV AGLEARVNGH LERMGFAWK
|
| |