Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0208 |
Symbol | |
ID | 7316786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 228558 |
End bp | 229508 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643615093 |
Product | proline iminopeptidase |
Protein accession | YP_002512294 |
Protein GI | 220933395 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTGC TTTACCCGCC CATCGACCCC TATCACATGG AAACCCTGGC CGTGGACCAG ACCCACCGGC TGCACCTGGA GACCTGCGGC ACTGCCCAGG GTCTGCCGGT GGTGTTCCTG CATGGCGGCC CCGGTTCCGG CTGCGAGCCC TGGCATCGGC GTTTCTTTGA TCCCGCCGCC TACCGCATCG TGCTCTTCGA CCAGCGGGGC TGCGGCCGAT CCCGCCCCCA CGCCTCCCTG GAGGACAACA CCACCGCCCA CCTGGTGTCC GATATGGAGC GCATCCGGGA ACACCTGGGC ATCGAGCGCT GGGTGGTGTT CGGCGGCTCC TGGGGCTCGA CCCTGGCGCT GGCCTATGCC GAGGCCCACC CGGAGCGGGT GCTGGGACTG GTGTTGCGCG GCATCTTCCT GTGCCGGCCC CGGGACATCC ACTGGTTCTA CCAGGAGGGC GCCGGGCGCC TGTTCCCCGA CTACTGGGAG GACTACCTGG CGCCGATCCC CGAGTCTGAA CGGGATGAGA TGGTCTCCGC CTACCATCGC CGGCTCACCG GTGAGGACGA GGTGGCGCGC ATGGCGGCCG CCAAGGCTTG GTCCGAGTGG GAGGGGCGCA CCGCGACGCT GCTGCCCAAT CCGGGGGTGG TGGACCATTT CCGGGATCCC CACGTAGCGC TCAGCCTGGC GCGCATCGAG TGTCATTACT TCATGAACCA GTCCTTCCTG GAACCGAACC GGCTGCTGCG CGACGCCCAC CGCCTGGCGG ACATCCCCGG CACCATCGTG CACGGCCGCT ACGACGTGGT CTGCCCGCTG GACCAGGCCC ATGCCCTGCA CCGGGCCTGG CCCCGGGCGA AGCTCGAGAT CATCCCGGAT GCCGGCCATT CCGCCGGCGA ACCGGGCATC GTGGATGCCC TGGTGCGGGC CACGGACGAA CTGGCCGTGA TGCTGCGATG A
|
Protein sequence | MRVLYPPIDP YHMETLAVDQ THRLHLETCG TAQGLPVVFL HGGPGSGCEP WHRRFFDPAA YRIVLFDQRG CGRSRPHASL EDNTTAHLVS DMERIREHLG IERWVVFGGS WGSTLALAYA EAHPERVLGL VLRGIFLCRP RDIHWFYQEG AGRLFPDYWE DYLAPIPESE RDEMVSAYHR RLTGEDEVAR MAAAKAWSEW EGRTATLLPN PGVVDHFRDP HVALSLARIE CHYFMNQSFL EPNRLLRDAH RLADIPGTIV HGRYDVVCPL DQAHALHRAW PRAKLEIIPD AGHSAGEPGI VDALVRATDE LAVMLR
|
| |