Gene Tgr7_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_1778 
Symbol 
ID7317588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp1891507 
End bp1892766 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content52% 
IMG OID643616670 
Producttype I restriction-modification system specificity subunit 
Protein accessionYP_002513847 
Protein GI220934948 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.926504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTGA GGGAGGCAAG TGCGAAGTAT CTGCCGCCGG AGGCCGAGGG GTGTCCGGCG 
GGGTATAAGC AGACTGAGGT GGGTCTGGTG CCGTTGGATT GGGAGGTCAT ATCTCTTGAT
AAGTTCGCAG ACGTCACGAG CGGCAAGCGT CTGCCTTTGG GGCGTTCACT GACAGAGCAT
GAAACGCCAC ACCCGTACAT CCGCGTCTCG GATATGCGCC CTGGATATGT CTGCGTTGAT
GAGATTCGGT ACGTTCCAGT GGATGTGTTC CCGAAGATTA AGCGGTACCG GATCTATACA
GACGATATTT TTATATCCGT GGCGGGAACG CTCGGGATTG TCGGTAAGAT ACCGAAGCGA
CTCAATGGCG CGAACTTGAC TGAGAACGCT GATCGCATAA CGAATATAAA GTGCTCACAA
AATTATCTTC TGCATGTTCT GATGTCGCCG TTGATTCAGA GCAAGATTGA ATCTATTCAA
ACAGTCGGCG CACAGCCAAA ACTGGCTTTG ACGAGGATTC GGAAGTTCGA GATTCCGCTA
CCCCCAACAG ATAGAGAGCA GCAAGCCATC GCCTCCGCCT TGAGCGATGC GGACGCCCTC
ATCGAATCCC TCTCGCAGCT CCTCGCCAAG AAACGCCAGA TCAAACAAGG CGCCATGCAG
GAACTGCTCA CCGGCAAGCG GCGCCTGCCG GGGTTTAGTG GGGAGTGGGA TGTGAAGCGG
TTGGGTAGTG TTTTGAAATT CCAAGTGGGA TTTCCATTTA GTTCAATTTA TTTCAACGAT
GAATTTCAAG GGATCCGACT GATCAAGAAT CGTGATCTTA AAGCTAGTGA CCAGATCATT
AGCTACACCG GAGATTATCG GCATGAATTT CTCGTCAAAG ATGGAGATTT GCTGATTGGA
ATGGATGGTG ATTTCATCCC ATGCTTGTGG GGTGAAGGGG TTGCTCTTCT GAATCAGCGG
GTTGGGCGGG TTATTCCGCT TTCTGGATTA GATGCAAAAT TTGCCTACTA CTATCTAATT
GCGCCGTTGA AGAAAATCGA GGATTCAACG TCAAGCACAA CTGTTAAGCA CTTGTCTCAT
GGGGATGTGG AAGGTATCGA AGAGCCTCTT CCGGAAGTTG AGGAACAAAT CGCTATCGCT
ACCACCCTCT CCGACATGGA CGCCGAAATT GCCACACTGG AGGCGAAGCT CGCCAAGGCC
CGCCAGCTCA AGCAGGGCAT GATGCAGGCG CTGCTCACCG GTCGGATCCG GCTGGTATGA
 
Protein sequence
MEVREASAKY LPPEAEGCPA GYKQTEVGLV PLDWEVISLD KFADVTSGKR LPLGRSLTEH 
ETPHPYIRVS DMRPGYVCVD EIRYVPVDVF PKIKRYRIYT DDIFISVAGT LGIVGKIPKR
LNGANLTENA DRITNIKCSQ NYLLHVLMSP LIQSKIESIQ TVGAQPKLAL TRIRKFEIPL
PPTDREQQAI ASALSDADAL IESLSQLLAK KRQIKQGAMQ ELLTGKRRLP GFSGEWDVKR
LGSVLKFQVG FPFSSIYFND EFQGIRLIKN RDLKASDQII SYTGDYRHEF LVKDGDLLIG
MDGDFIPCLW GEGVALLNQR VGRVIPLSGL DAKFAYYYLI APLKKIEDST SSTTVKHLSH
GDVEGIEEPL PEVEEQIAIA TTLSDMDAEI ATLEAKLAKA RQLKQGMMQA LLTGRIRLV