Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1775 |
Symbol | |
ID | 7317585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1886648 |
End bp | 1889773 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643616667 |
Product | type I restriction-modification system restriction subunit |
Protein accession | YP_002513844 |
Protein GI | 220934945 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCGA TCGGACAGCC TGAACGCGTG ACCCAGAACC GGGTCGTCCA GCTCTTTCGC GATGAGCTGG GCTACGACTA CCTGGGCGAC TGGGGCGAGC GCGAGGGCAA CAGCAACATC GAGGAGGGGC TGCTGACGCA GTGGCTCGCC GGGCGGGGTT ACAGCCAGGC CCAGATCTCC CGTGCCATCC ACAAGCTGCG CGCCGAGGCG AACGATCACG GGCGCGAGCT CTACGAGAAC AACAAGCACA TCTACGGCCT GCTGCGCTAC GGCGTGGCGG TGAAGACCGA GGCGGGGCAG CACACGCACA CGGTGCACCT GATCGACTGG CAGTCCCCGC AGAATAACCA CTTCGCCATT GCCGAGGAGG TTACCCTCAA GGGCAAGCTG GAACGGCGTC CCGATCTGGT GTTGTACGTG AACGGCATCG CAGTGGGTGT GCTGGAACTC AAGAACAGCC GGGTGTCCCT CGGCGACGGC ATCCGCCAGA GCCTGTCCAA CCAGCAGCCC GAATTCAACG CCTGGTTCTT CAGCACCGTG CAGTTCATCT TCGCGGGCAA CGATTCCGAG GGGCTGCAAT ACGGCACCAT CGGCACGCCG GAGAAGTTCT TCCTCAAGTG GAAGGAGGAC GAGGCGGACA ACAGCCGCTA CAAGCTGGAC AAGTACCTGC TCAAGATGTG CGACAAGCAG CGCCTGATCG AGCTGATGCA CGATTTCGTG CTGTTCGACG GGGGCGTGAA GAAACTGCCC CGGGTGCATC AGTACTTCGG CATCAAGGCG GCGCAGACAT ACGTGCGCCG GGGCGAGGGC GGCATCATCT GGAACACCCA GGGCAGCGGC AAGAGCATCG TCATGGTGCT GCTGGCCAAG TGGATCCTGG AGAACAACCC CGATGCCCGG GTGGTGGTGA TCACCGACCG GGACGAGCTG GACGCCCAGA TCGAGCGGGT CTTCCGGGAC GCGGGGGAGA GCATCTACCG CACCAGCAGC GGGCGTGACC TGGTGAACCA ACTGGGCCAG GCCAGGCCCA GGCTGTTGTG CTCTCTGGTG CACAAGTTCG GCGCCCGGGG CGTGGAGGAC TTCGACGCCT TCATCCGCGA GCTGAAAGCG CAGCCCAGCC CCACGGTGGG CGAGGTATTC GTGTTCGTGG ACGAATGCCA CCGCACCCAG GGCGGCAAGC TGAACCAGGT GATGAAGGCC ATGATGCCCG ACGCGGTGTT CATCGGCTTC ACCGGCACGC CGCTGCTGAA GCGGGACAAG GCCACCAGTC TGGAGGTCTT TGGCGGCTAC ATCCACACCT ACAAGTTCAG CGAGGCGGTG GAGGACCAGG TGGTCCTGGA TCTGGTCTAC GAGGCCCGGG ATATCGACCA GAAGCTTGGC TCCCAGGCCA AGATCGACGA ATGGTTTGCC GCCAAGACCA AGGGTCTGAA CCCGTGGCAG CGGGATGAGC TGATGAAGAA GTGGGGCACC ATGCAGCAGG TGCTTAGCTC CAAGTCCCGT ATGAGCCGGG TGGTGGCCGA CATCCTGTTC GATTTCGGCG TGAAGCCACG GCTGTCCAGC GAGCGGGGCA ACGCCATCCT GGTGGCCTCC AGCATCTACG AGGCCTGCAA GTACTTCTCC CTGTTTCAGC AGACCAGCCT CAAGGGCAAG TGCGCGGTGG TGACCTCCTA CAACCCCATG GCCAATCACG TGACCCTGGA GGAGACCGGC GCCAACACCG AGTCCGACAA GCAGTTCATC TACAGCATGT ACACGGAACT GCTCAAGGAC GTGGACGCCC AGCCGGGCAA GACGAAGACC GAGACCTACG AGGCCTGGGC CAAGGACCGG TTCACCCGGG AACCGGCCAA CATGAAGCTG CTGGTGGTGG TGGACAAGCT GCTGACCGGC TTCGACGCGC CCCCCTGCAC CTATCTCTAC ATCGACAAGT CCATGCAGGA CCACGGCCTG TTCCAGGCCA TCTGCCGCAC CAACCGCCTG GACGGCGAGG ACAAGGACTT CGGCTACATC GTCGATTACA AAGACCTGTT CAAGAAGGTG GAGAACGCCA TCGCCGTGTA CAGCTCCGAG CTGGACCACA GCGCCCCCGG CGCCGATCCC GAGGTGATGA TCCAGGACCG CCTCACCAGG GGCCGGGAAA AGCTGGACGA GGCCCTGGAG GCCCTGGTCC TGCTGTGCGA GCCGGTGGAG CCCCCCAGGG GCGAGCTGGA GCACATCCAT TACTTCTGCG GCAACACCGA GATCCCCGAA GACCTGGCCG AGCGCGTGCC GCTGAGAACG GCCCTCTACA AGGGTGTCGC CACCCTGATG CGCACCTATG CCGGGCTTGC CGACGAGATG GAGCCCGCGG GTTACTCGGC CAAGGAGGCA GCAGAGATCA AGCGCAGGCT GGAGGATTAT CTCAAGCTAC GGGACATCAT CCGCAACGCC AGCGGCGAGA CCCTGGACCT GAAGGCCTAC GAGGCCGACA TGCGGCACCT GATCGACACC TACATCGAGG CGGACGAACC CCGCAAGATC TCCGCCTTCG AGAACATCGG CCTGCTGGAC CTGATCGTGA AGACCGGCGT CGCCGACGCC ATCAATCAGC AGCTCGGCAA CCTGAAGGGC AGCAGGGACG CCATCGCCGA GACCATCGAG AACAACGTCC GCAGCAAGAT CCTCAAGGAA CACCTGACCG ACCCGGCCTA CTTCGAGAAG ATGTCCGCCC TGCTCGCCGA GATCATCGAG GCCCGCAAGG CCAAGGCCCT GGAGTACGAG GAATACCTCA GGCAGATCGC CGAGCTGGTG AAGAAGGTGG CCGAAGGCAA GGATGAGGAC GTCCCGCCCG CGCTGGATAC ACCGGGCAAG CGGGCGCTGT ACAACAACCT GATGCTGCGG GTGCCTGCGG CTCAGCCAGC GGGGCAGGTG TTGGAGGTCA GGCCGAGCTG CCCGGAACTG ACCCAGGAGG AGGCCCTGGA GCTGGCCCTG AAGCTGGATG AGGTGGTGCG ACAAGTCAGG CCGGACGGCT GGCGCGGTTT ACAGCCACGG GAGAACACCA TCAAGCGCGC GCTCTATGAG GTGCTTAAGG ATCCCGATGA GGTGGATCGG CTGTTCAGCA TCATCAAGGC CCAGGCGGAG TATTGA
|
Protein sequence | MSSIGQPERV TQNRVVQLFR DELGYDYLGD WGEREGNSNI EEGLLTQWLA GRGYSQAQIS RAIHKLRAEA NDHGRELYEN NKHIYGLLRY GVAVKTEAGQ HTHTVHLIDW QSPQNNHFAI AEEVTLKGKL ERRPDLVLYV NGIAVGVLEL KNSRVSLGDG IRQSLSNQQP EFNAWFFSTV QFIFAGNDSE GLQYGTIGTP EKFFLKWKED EADNSRYKLD KYLLKMCDKQ RLIELMHDFV LFDGGVKKLP RVHQYFGIKA AQTYVRRGEG GIIWNTQGSG KSIVMVLLAK WILENNPDAR VVVITDRDEL DAQIERVFRD AGESIYRTSS GRDLVNQLGQ ARPRLLCSLV HKFGARGVED FDAFIRELKA QPSPTVGEVF VFVDECHRTQ GGKLNQVMKA MMPDAVFIGF TGTPLLKRDK ATSLEVFGGY IHTYKFSEAV EDQVVLDLVY EARDIDQKLG SQAKIDEWFA AKTKGLNPWQ RDELMKKWGT MQQVLSSKSR MSRVVADILF DFGVKPRLSS ERGNAILVAS SIYEACKYFS LFQQTSLKGK CAVVTSYNPM ANHVTLEETG ANTESDKQFI YSMYTELLKD VDAQPGKTKT ETYEAWAKDR FTREPANMKL LVVVDKLLTG FDAPPCTYLY IDKSMQDHGL FQAICRTNRL DGEDKDFGYI VDYKDLFKKV ENAIAVYSSE LDHSAPGADP EVMIQDRLTR GREKLDEALE ALVLLCEPVE PPRGELEHIH YFCGNTEIPE DLAERVPLRT ALYKGVATLM RTYAGLADEM EPAGYSAKEA AEIKRRLEDY LKLRDIIRNA SGETLDLKAY EADMRHLIDT YIEADEPRKI SAFENIGLLD LIVKTGVADA INQQLGNLKG SRDAIAETIE NNVRSKILKE HLTDPAYFEK MSALLAEIIE ARKAKALEYE EYLRQIAELV KKVAEGKDED VPPALDTPGK RALYNNLMLR VPAAQPAGQV LEVRPSCPEL TQEEALELAL KLDEVVRQVR PDGWRGLQPR ENTIKRALYE VLKDPDEVDR LFSIIKAQAE Y
|
| |