Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0599 |
Symbol | |
ID | 7317947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 633846 |
End bp | 635222 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643615484 |
Product | type I restriction-modification system, S subunit |
Protein accession | YP_002512683 |
Protein GI | 220933784 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAGT ACCAAGCCTA CCCAGAGTAT CGTGAAACAC GGCATGACCT GCTACCGCCG ATACCTGTGC ACTGGATGAC GGGGCAGATC AAAAATGCCC ATGACGTTGT GCTTGGCAAG ATGCTGCAAA GTGACGCTAA GACACCGGCA GACAGATTGT TGCCGTATCT CCGAGCTGCA AATGTGAACT GGGGTGGTGT CGACCTCAGC ACCGTAAAAG AAATGTGGTT CTCGCCTGCC GAGCGTAAGG CGTTGAGGCT GATGGTTGGC GACGTAGTGA TTAGCGAGGG CGGGGATGTC GGCCGTTCCG CTGTTTGGCA GGGCGAATTG CCAGAATGCT ATTTTCAGAA TGCGATTAAT AGGGCGCGCC CGAAGGGCGA ACACAGTAGC CGCTATCTCT ACTACTGGAT GAGCTTCATC AAGAGTGCTG GCTATATCGA CATAATCTGC AACAAGTCTA CGATTCCTCA CTACACAGCA GAGAAGGTCC AGGGGACGCC ATTTTTATTC CCCCCAGCCG GCGAGCAGGC AGGTATCGCC GCCTTCCTCG ACCACGAAAC CGCCAAGATC GACCGGCTGA TCGCCAAGCA GCAGCGGCTG ATCGAGCTGC TCAAGGAAAA GCGGCAGGCG GTGATCTCCC ATGCCGTCAC CAAGGGCCTA AACCCCGACG CGCCGATGAA GGACTCCGGT GTGGAGTGGC TAGGTGAAGT GCCGGCGCAT TGGAGGCTTG AGAAATTGAA GTACACAGCC ATCTTCAAGG GTGGTGGCAC ACCATCCAAA GATTCACCCG AATATTGGGG TGGTGATATT CCGTGGGTTT CCCCAAAGGA CATGAAGTCT CGATATGTTG CCGATTCGCA GGACAAGATC ACGGTTGAGG CTATTGCAGC GAGTTCAACT AGCCTGATTG GGCCTGGACA GGTTCTGGTT GTCGTCCGGT CAGGAATTCT TCAAAGAACT ATTCCGGTTG CCGTGAATCT TGTTGAGGTC ACACTTAACC AGGACATGAA GGCAATAGAT TTTAGGGATG AAACCCGTTC TGAGTTTTTC TCATATTTCG TTGAAGGGCA TGAGGATAAC CTGCTGCTTG AATGGCGAAA GCAAGGTGCG ACCGTAGAAA GCATAGAGCA GGAGTATTTG GGGAATACTA TGGTGCCGAT GCCGCCGCCC TCGGAAATGA TGGAGATTCT TCAGTTTCTA AATGGGCAAT TGGAGAAGTA CCGGCTTCTT ACGGAAAAGG CAACGCGCGC AATTGAGTTA CTTAGGGAGC ACCGAACCGC GCTTATATCA GCTGCCGTCA CCGGAAAGAT CGACGTCCGT GGCTGGCAAA AACCGAACAC CGAACCACAA GAAGCGGCAG AAGCCGCCAG CGCCTAA
|
Protein sequence | MGKYQAYPEY RETRHDLLPP IPVHWMTGQI KNAHDVVLGK MLQSDAKTPA DRLLPYLRAA NVNWGGVDLS TVKEMWFSPA ERKALRLMVG DVVISEGGDV GRSAVWQGEL PECYFQNAIN RARPKGEHSS RYLYYWMSFI KSAGYIDIIC NKSTIPHYTA EKVQGTPFLF PPAGEQAGIA AFLDHETAKI DRLIAKQQRL IELLKEKRQA VISHAVTKGL NPDAPMKDSG VEWLGEVPAH WRLEKLKYTA IFKGGGTPSK DSPEYWGGDI PWVSPKDMKS RYVADSQDKI TVEAIAASST SLIGPGQVLV VVRSGILQRT IPVAVNLVEV TLNQDMKAID FRDETRSEFF SYFVEGHEDN LLLEWRKQGA TVESIEQEYL GNTMVPMPPP SEMMEILQFL NGQLEKYRLL TEKATRAIEL LREHRTALIS AAVTGKIDVR GWQKPNTEPQ EAAEAASA
|
| |