Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3016 |
Symbol | |
ID | 7318313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 3156406 |
End bp | 3157416 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643617914 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_002515073 |
Protein GI | 220936174 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.612396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATCC TGGGTATCGA GACCTCCTGT GATGAGACCG GCGTGGCGGT GATCGATGCC GAGCGCGGGC TGCTGTCCCA TGCCCTCTAC AGCCAGGTGG CCCTGCACGC GGAATACGGC GGCGTCGTGC CGGAGCTGGC CTCCCGGGAC CATATCCGCA AGCTGCTGCC CCTGGTGCGC CAGGCCCTGG ACGGGGCGGA CACCGCCGCC TCGGATCTCA CCGGCATCGC CTATACCTCG GGCCCCGGCC TGCTGGGTGC GCTGCTGGTG GGTGCCGGGG TGGCGCGCAG TCTCGCCTGG GCCTGGGGCC TGCCCGCCGT GGGCGTGCAC CACATGGAGG GGCACCTGCT CGCGCCCCTG CTGGGCGATG AACCGCCCGA GTTTCCCTTC CTGGCCCTGC TGGTCTCCGG CGGGCATACC ATGCTGGTGG ACGTGCAGGG GGTGGGGCGT TACCGGATCC TGGGCGAGAG CCTGGACGAC GCCGCCGGCG AGGCCTTCGA CAAGACCGCC AAGCTGCTCG GGCTGAGCTA CCCCGGCGGC CCGGCCCTGG CGGCCCTGGC GGAACGGGGT GATCCGGCGC GTTTCAGCTT TCCCCGCCCC ATGACCGACC GCCCGGGCCT GGACTTCAGT TTCTCCGGGC TCAAGACCCG GGCCCTGACC ACCCTGCGCG AGACGCGCAC CGAGCAGGAC CGGGCCGACG TGGCGCGGGC CTTCGAGGAG GCGGTGGTGG ATACCCTGGT GATCAAGTGT CTGCGGGCGG TGCAGGAGAC CGGGGCTGAG CGCCTGGTGG TGGCCGGTGG CGTGGGCGCC AACCGCCGCC TGCGGCAGCG GCTCAAGGAA GCGGTGGGCG CCGAGGGCGC CAGCGTGCAC TACCCGCCAT TCGAGTTCTG CACCGACAAC GGCGCCATGA TTGCCCTGGC GGGGTTGATG CGTTTCCAGG CGGGTGCGGG CGAGGACCTG ACCATCCGGG CGCGCGCCCG GTGGAACCTG GAGTCGAAGT CGGAAGTGTG A
|
Protein sequence | MRILGIETSC DETGVAVIDA ERGLLSHALY SQVALHAEYG GVVPELASRD HIRKLLPLVR QALDGADTAA SDLTGIAYTS GPGLLGALLV GAGVARSLAW AWGLPAVGVH HMEGHLLAPL LGDEPPEFPF LALLVSGGHT MLVDVQGVGR YRILGESLDD AAGEAFDKTA KLLGLSYPGG PALAALAERG DPARFSFPRP MTDRPGLDFS FSGLKTRALT TLRETRTEQD RADVARAFEE AVVDTLVIKC LRAVQETGAE RLVVAGGVGA NRRLRQRLKE AVGAEGASVH YPPFEFCTDN GAMIALAGLM RFQAGAGEDL TIRARARWNL ESKSEV
|
| |