Gene Tgr7_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_1775 
Symbol 
ID7317585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp1886648 
End bp1889773 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content63% 
IMG OID643616667 
Producttype I restriction-modification system restriction subunit 
Protein accessionYP_002513844 
Protein GI220934945 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCGA TCGGACAGCC TGAACGCGTG ACCCAGAACC GGGTCGTCCA GCTCTTTCGC 
GATGAGCTGG GCTACGACTA CCTGGGCGAC TGGGGCGAGC GCGAGGGCAA CAGCAACATC
GAGGAGGGGC TGCTGACGCA GTGGCTCGCC GGGCGGGGTT ACAGCCAGGC CCAGATCTCC
CGTGCCATCC ACAAGCTGCG CGCCGAGGCG AACGATCACG GGCGCGAGCT CTACGAGAAC
AACAAGCACA TCTACGGCCT GCTGCGCTAC GGCGTGGCGG TGAAGACCGA GGCGGGGCAG
CACACGCACA CGGTGCACCT GATCGACTGG CAGTCCCCGC AGAATAACCA CTTCGCCATT
GCCGAGGAGG TTACCCTCAA GGGCAAGCTG GAACGGCGTC CCGATCTGGT GTTGTACGTG
AACGGCATCG CAGTGGGTGT GCTGGAACTC AAGAACAGCC GGGTGTCCCT CGGCGACGGC
ATCCGCCAGA GCCTGTCCAA CCAGCAGCCC GAATTCAACG CCTGGTTCTT CAGCACCGTG
CAGTTCATCT TCGCGGGCAA CGATTCCGAG GGGCTGCAAT ACGGCACCAT CGGCACGCCG
GAGAAGTTCT TCCTCAAGTG GAAGGAGGAC GAGGCGGACA ACAGCCGCTA CAAGCTGGAC
AAGTACCTGC TCAAGATGTG CGACAAGCAG CGCCTGATCG AGCTGATGCA CGATTTCGTG
CTGTTCGACG GGGGCGTGAA GAAACTGCCC CGGGTGCATC AGTACTTCGG CATCAAGGCG
GCGCAGACAT ACGTGCGCCG GGGCGAGGGC GGCATCATCT GGAACACCCA GGGCAGCGGC
AAGAGCATCG TCATGGTGCT GCTGGCCAAG TGGATCCTGG AGAACAACCC CGATGCCCGG
GTGGTGGTGA TCACCGACCG GGACGAGCTG GACGCCCAGA TCGAGCGGGT CTTCCGGGAC
GCGGGGGAGA GCATCTACCG CACCAGCAGC GGGCGTGACC TGGTGAACCA ACTGGGCCAG
GCCAGGCCCA GGCTGTTGTG CTCTCTGGTG CACAAGTTCG GCGCCCGGGG CGTGGAGGAC
TTCGACGCCT TCATCCGCGA GCTGAAAGCG CAGCCCAGCC CCACGGTGGG CGAGGTATTC
GTGTTCGTGG ACGAATGCCA CCGCACCCAG GGCGGCAAGC TGAACCAGGT GATGAAGGCC
ATGATGCCCG ACGCGGTGTT CATCGGCTTC ACCGGCACGC CGCTGCTGAA GCGGGACAAG
GCCACCAGTC TGGAGGTCTT TGGCGGCTAC ATCCACACCT ACAAGTTCAG CGAGGCGGTG
GAGGACCAGG TGGTCCTGGA TCTGGTCTAC GAGGCCCGGG ATATCGACCA GAAGCTTGGC
TCCCAGGCCA AGATCGACGA ATGGTTTGCC GCCAAGACCA AGGGTCTGAA CCCGTGGCAG
CGGGATGAGC TGATGAAGAA GTGGGGCACC ATGCAGCAGG TGCTTAGCTC CAAGTCCCGT
ATGAGCCGGG TGGTGGCCGA CATCCTGTTC GATTTCGGCG TGAAGCCACG GCTGTCCAGC
GAGCGGGGCA ACGCCATCCT GGTGGCCTCC AGCATCTACG AGGCCTGCAA GTACTTCTCC
CTGTTTCAGC AGACCAGCCT CAAGGGCAAG TGCGCGGTGG TGACCTCCTA CAACCCCATG
GCCAATCACG TGACCCTGGA GGAGACCGGC GCCAACACCG AGTCCGACAA GCAGTTCATC
TACAGCATGT ACACGGAACT GCTCAAGGAC GTGGACGCCC AGCCGGGCAA GACGAAGACC
GAGACCTACG AGGCCTGGGC CAAGGACCGG TTCACCCGGG AACCGGCCAA CATGAAGCTG
CTGGTGGTGG TGGACAAGCT GCTGACCGGC TTCGACGCGC CCCCCTGCAC CTATCTCTAC
ATCGACAAGT CCATGCAGGA CCACGGCCTG TTCCAGGCCA TCTGCCGCAC CAACCGCCTG
GACGGCGAGG ACAAGGACTT CGGCTACATC GTCGATTACA AAGACCTGTT CAAGAAGGTG
GAGAACGCCA TCGCCGTGTA CAGCTCCGAG CTGGACCACA GCGCCCCCGG CGCCGATCCC
GAGGTGATGA TCCAGGACCG CCTCACCAGG GGCCGGGAAA AGCTGGACGA GGCCCTGGAG
GCCCTGGTCC TGCTGTGCGA GCCGGTGGAG CCCCCCAGGG GCGAGCTGGA GCACATCCAT
TACTTCTGCG GCAACACCGA GATCCCCGAA GACCTGGCCG AGCGCGTGCC GCTGAGAACG
GCCCTCTACA AGGGTGTCGC CACCCTGATG CGCACCTATG CCGGGCTTGC CGACGAGATG
GAGCCCGCGG GTTACTCGGC CAAGGAGGCA GCAGAGATCA AGCGCAGGCT GGAGGATTAT
CTCAAGCTAC GGGACATCAT CCGCAACGCC AGCGGCGAGA CCCTGGACCT GAAGGCCTAC
GAGGCCGACA TGCGGCACCT GATCGACACC TACATCGAGG CGGACGAACC CCGCAAGATC
TCCGCCTTCG AGAACATCGG CCTGCTGGAC CTGATCGTGA AGACCGGCGT CGCCGACGCC
ATCAATCAGC AGCTCGGCAA CCTGAAGGGC AGCAGGGACG CCATCGCCGA GACCATCGAG
AACAACGTCC GCAGCAAGAT CCTCAAGGAA CACCTGACCG ACCCGGCCTA CTTCGAGAAG
ATGTCCGCCC TGCTCGCCGA GATCATCGAG GCCCGCAAGG CCAAGGCCCT GGAGTACGAG
GAATACCTCA GGCAGATCGC CGAGCTGGTG AAGAAGGTGG CCGAAGGCAA GGATGAGGAC
GTCCCGCCCG CGCTGGATAC ACCGGGCAAG CGGGCGCTGT ACAACAACCT GATGCTGCGG
GTGCCTGCGG CTCAGCCAGC GGGGCAGGTG TTGGAGGTCA GGCCGAGCTG CCCGGAACTG
ACCCAGGAGG AGGCCCTGGA GCTGGCCCTG AAGCTGGATG AGGTGGTGCG ACAAGTCAGG
CCGGACGGCT GGCGCGGTTT ACAGCCACGG GAGAACACCA TCAAGCGCGC GCTCTATGAG
GTGCTTAAGG ATCCCGATGA GGTGGATCGG CTGTTCAGCA TCATCAAGGC CCAGGCGGAG
TATTGA
 
Protein sequence
MSSIGQPERV TQNRVVQLFR DELGYDYLGD WGEREGNSNI EEGLLTQWLA GRGYSQAQIS 
RAIHKLRAEA NDHGRELYEN NKHIYGLLRY GVAVKTEAGQ HTHTVHLIDW QSPQNNHFAI
AEEVTLKGKL ERRPDLVLYV NGIAVGVLEL KNSRVSLGDG IRQSLSNQQP EFNAWFFSTV
QFIFAGNDSE GLQYGTIGTP EKFFLKWKED EADNSRYKLD KYLLKMCDKQ RLIELMHDFV
LFDGGVKKLP RVHQYFGIKA AQTYVRRGEG GIIWNTQGSG KSIVMVLLAK WILENNPDAR
VVVITDRDEL DAQIERVFRD AGESIYRTSS GRDLVNQLGQ ARPRLLCSLV HKFGARGVED
FDAFIRELKA QPSPTVGEVF VFVDECHRTQ GGKLNQVMKA MMPDAVFIGF TGTPLLKRDK
ATSLEVFGGY IHTYKFSEAV EDQVVLDLVY EARDIDQKLG SQAKIDEWFA AKTKGLNPWQ
RDELMKKWGT MQQVLSSKSR MSRVVADILF DFGVKPRLSS ERGNAILVAS SIYEACKYFS
LFQQTSLKGK CAVVTSYNPM ANHVTLEETG ANTESDKQFI YSMYTELLKD VDAQPGKTKT
ETYEAWAKDR FTREPANMKL LVVVDKLLTG FDAPPCTYLY IDKSMQDHGL FQAICRTNRL
DGEDKDFGYI VDYKDLFKKV ENAIAVYSSE LDHSAPGADP EVMIQDRLTR GREKLDEALE
ALVLLCEPVE PPRGELEHIH YFCGNTEIPE DLAERVPLRT ALYKGVATLM RTYAGLADEM
EPAGYSAKEA AEIKRRLEDY LKLRDIIRNA SGETLDLKAY EADMRHLIDT YIEADEPRKI
SAFENIGLLD LIVKTGVADA INQQLGNLKG SRDAIAETIE NNVRSKILKE HLTDPAYFEK
MSALLAEIIE ARKAKALEYE EYLRQIAELV KKVAEGKDED VPPALDTPGK RALYNNLMLR
VPAAQPAGQV LEVRPSCPEL TQEEALELAL KLDEVVRQVR PDGWRGLQPR ENTIKRALYE
VLKDPDEVDR LFSIIKAQAE Y