Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2598 |
Symbol | |
ID | 7315695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2740457 |
End bp | 2741491 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643617499 |
Product | trypsin family protein |
Protein accession | YP_002514660 |
Protein GI | 220935761 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.7222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGGT TTCTTGCCTG TCTTCTTTCA CTCCTGATCC TTGCATCCGC CTCGGCGCAG GCAGCGGATC TCGCCGATCT GTTCGAGCGG GTCAATCCCA GCGTGGTGGT GATCCAGTCC CAGGAGCAGG TGGCGCGGGG CACCGACCAG CTGGCGTGGC GTACCATCCA GCGCGTCGGG TCGGGGGTGC TGGTGTCCGG GGACGGCCGC ATCGTGACCG CCGCCCACGT GGTGCAGTCC GCGGACCAGA TCCAGGTGAG CTTTGCCGAT GGCAGCACGG TGGATGCCCG GGTGATCGGT TCGGAGCCCG CGGCGGATAT CGCCCTGCTG AAGGTGGAGG CGGTGCCTGC GAGTGCCGCG GTGGCACGTC TGGGGGACTC GGACGGCGTG CGCATCGGCC AGCCGGTGTT CGCCATCGGC GCGCCCCACG GCCTGGCCCA TGCCCTGAGC GTGGGACACA TCAGCGCCCG CCATCGGGGC AGCGACATCA CCGGCGACTT CGGCCTGGGC GAGCTGCTGC AGACCGATGC CTCCCTCAAC CGCGGCAACT CCGGTGGACC GTTGTTCAAC CATTCAGGTG AAGTGGTGGG CATCGTCAGC CGCATCCTCA CCAGTTCCGG CGGCTCCCAG GGCCTGGGGT TTGCGGTGAC CTCCAACCAG GTGCGCGAGC TGCTGCTGGA GCAGCGGGCC TTCTGGAACG GCCTGAACGG TTACTGGGTG CAGGGCAACC TGGCCCGGCT CCTAAACATC CCGCAGGCGC GGGGCCTGCT GGTGCAGGGC ATGGCCTCGG GTTCGCCCGC GGCGCGCATC GGCCTGCGCG CGGGCACGGT GGAGGTGGAC ATCGCCGGCG AGACCCTGAT CCTCGGCGGC GACGTGATCC TGAGCGTCGC GGGCATCTCC TTTGCGGACG ATGATGGGTA TCAGCGTGTC CGTCAGCATC TGGGCGGGCT CAATCCTGGC GACAGCGTCA CCATCAGCGT CCTGCGTGCG GGCGAGCGGA TCTCCCTGAC CACCACCCTG GGCGAATCGG ATTAG
|
Protein sequence | MPRFLACLLS LLILASASAQ AADLADLFER VNPSVVVIQS QEQVARGTDQ LAWRTIQRVG SGVLVSGDGR IVTAAHVVQS ADQIQVSFAD GSTVDARVIG SEPAADIALL KVEAVPASAA VARLGDSDGV RIGQPVFAIG APHGLAHALS VGHISARHRG SDITGDFGLG ELLQTDASLN RGNSGGPLFN HSGEVVGIVS RILTSSGGSQ GLGFAVTSNQ VRELLLEQRA FWNGLNGYWV QGNLARLLNI PQARGLLVQG MASGSPAARI GLRAGTVEVD IAGETLILGG DVILSVAGIS FADDDGYQRV RQHLGGLNPG DSVTISVLRA GERISLTTTL GESD
|
| |