Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1087 |
Symbol | |
ID | 7315796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1170869 |
End bp | 1172302 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643615974 |
Product | protease Do |
Protein accession | YP_002513160 |
Protein GI | 220934261 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.386741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGTTT TCTCGCGTAG TACCCTGCCT TTCGCCCTGC TGTTCCTCAC CCTGGTGTTT TCCGCGGCTG CCCAAGGGCG CGCCGCGCTG CCGGATTTCG TCCCCCTGGT GGAGGACAAC AGCCCGGCGG TGGTCAACAT CAGCACCACC CGCAATATCG CCCGCGGCGG CCGTGAACCC CCCCAGTTGC GCATCCCCGA TATGCCCGAT GACGGCGTGT TGGGTGACCT GCTACGGCGT TTCTTTGGCG AGGGCGGCCA GATGCCCGAG CAGTTCGACA CCAGTTCCCT GGGCTCGGGC TTCATCATCT CGCGCGACGG CTACGTGGTG ACCAATCATC ACGTCATCGA GGATGCCGAC GAGATCATCG TGCGCCTGAG CGACCGGCGC AGCTTCCCCG CCACCGTGGT CGGTTCCGAT CCCAAGAGCG ACGTGGCCCT GCTCAAGATC GAGGCCAGCG ACCTGCCGAC CCTCAAGCTG GGTAACTCCG AGCAGCTCAA GGTGGGCGAG TGGGTGCTGG CCATCGGCTC TCCCTTCGGC TTCGACCATT CCGTGACCGC GGGTATCGTA AGCGCCAAGG GGCGCAGCCT GCCCACGGAG AACTACGTGC CCTTCATCCA GACCGACGTG GCCATCAACC CGGGCAACTC CGGCGGGCCG CTGTTCAACA TGAAGGGCGA GGTGGTCGGC ATCAATTCCC AGATCTACAG CCGCACCGGC GGTTTCATGG GCCTGTCCTT CGCCATCCCC ATCGAGATGG CCATGGAAGT GGTCGAGCAG CTCAAGACCC AGGGCTACGT GAGCCGGGGC TGGCTGGGTG TGCTGATCCA GGAGGTCACC CGGGAACTGG CCGACTCCTT CGGCATGAGC CGTCCCACCG GAGCCCTGGT GGCCCGGGTG CTGCCCGACA GCCCTGCGGA GAAGGCCGGT GTGCGGGTGG GCGACGTGAT CCTGACCTTC AACGGCGAGG AGGTCACCCG CTCCAGCGCC CTGCCGCCCC TGGTGGGGCG TGCCCCGGTG GGCAAGGACG CCCGGGTCGA GATCCTGCGC GACGGTCGCA AGCAGACCCT GCGGATACGC ATCGCGGAAC TGCCGCCGGA CGACGAACTG GCCAGCCGCG CGCCGGCCGA ATCAGTGGCC CCTTCGGCAG CTGAAAATCG CTTCGGCATG ACCCTGGAGC CGGTGCCCGC CGAGTTGCGC GAGGCCCTGG ACCTGGAGCA GGGCGGTGTC CTGGTGGCCG GGGTCGGCGA GGGCGCGGCA CGGGATGCGC GCATCCAGCG CGGCGACGTG CTGGTGATGA TCAACAACCA GCGCATCGAA TCTCCCGCGC ATTTCGCCGA GCTGGCGGGT AAACTCACCC CCGGCAGCAG GGTCCCCGTG CTGGTGCAGC GTCCCCAGGG TCCGGTCTTT CTGGCACTGA AGGTGCCGGA TTGA
|
Protein sequence | MMVFSRSTLP FALLFLTLVF SAAAQGRAAL PDFVPLVEDN SPAVVNISTT RNIARGGREP PQLRIPDMPD DGVLGDLLRR FFGEGGQMPE QFDTSSLGSG FIISRDGYVV TNHHVIEDAD EIIVRLSDRR SFPATVVGSD PKSDVALLKI EASDLPTLKL GNSEQLKVGE WVLAIGSPFG FDHSVTAGIV SAKGRSLPTE NYVPFIQTDV AINPGNSGGP LFNMKGEVVG INSQIYSRTG GFMGLSFAIP IEMAMEVVEQ LKTQGYVSRG WLGVLIQEVT RELADSFGMS RPTGALVARV LPDSPAEKAG VRVGDVILTF NGEEVTRSSA LPPLVGRAPV GKDARVEILR DGRKQTLRIR IAELPPDDEL ASRAPAESVA PSAAENRFGM TLEPVPAELR EALDLEQGGV LVAGVGEGAA RDARIQRGDV LVMINNQRIE SPAHFAELAG KLTPGSRVPV LVQRPQGPVF LALKVPD
|
| |