Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0402 |
Symbol | |
ID | 6316235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 417978 |
End bp | 419012 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642642786 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_001916586 |
Protein GI | 188585041 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0900376 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACTC AAAGAGAAAC TGTATATATT CTAGGAATCG AAACTTCATG TGACGAAACA GCAGTTTCTG TAGTCTCAAA TGGTCAGAAG GTTTTAAGTA ATGTAGTTAG CTCTCAAACC GATGTTCATT CCTTATATGG AGGGGTGGTT CCAGAAATAG CTTCAAGGGA ACATTCAAAA TTGCTTCCTC CACTGGTGAG TAAGGCCGTT GACGAAGCAG GGTTAAATAT GAATGATATC AATGCCATAG CTGTTACTCA AGGTCCTGGA CTGGTTGGCC CACTATTAGT TGGTGTATCG TTTGCTAAAT CTTTAGCATA TTCTCTAAAG ATACCTCTAA TTCCCGTTAA CCATATTAAA GCACATCTAT ATGCCAATTT TATACTTGCT GACGACGATA AAGATTCAAA CTTACCTAGT TTCCCTTTAA TTGCTTTAAT AGTTTCAGGA GGCCATACTA ATCTTTACTA TTTGTATGAT CACGATAATT GGAAGCTTCT GGGGCGCAGT AGAGACGATG CTTCCGGTGA GTCCTTTGAC AAAATTGCTA GAGCTTTAGA CCTGGGATAT CCGGGTGGAC CAGCTGTTGA AAAGGAAGCT CAAAAGGGCC AACCTAATGT TGATTTTCCG AAACCTAAGC TAGAAAACGA ATACGATTTT AGTTTCAGTG GTTTGAAAAG TGCTGTATTG AATTATTTGA ACCGGAAAAA AATGAAAGGT GAACAATACA GTAGCTCTGA TATATGCGCA AGTTTTCAAC AGGTGGTTGT TGACTCACTT GTGGAGAAAA CAATCTCTGC GGCTCGGGAT AATCAGGTTG ATACTATAGT TCTAGCGGGA GGAGTTTCTG CTAATGGACA GTTAAGAGCG AATTTTAAGG AAACAACTGC TGGAGAGGGA ATAAATTTAT TTCTTCCTAG ACTGGAGTAC TGTACTGATA ATGCAGCCAT GATAGCTGCA CTGGGTTATC ATAACTATCG TCGTGGCTGG ATTGCTTCCC TGGATCTGAA TGCGGTCCCC AATTTGAGTC CATAA
|
Protein sequence | METQRETVYI LGIETSCDET AVSVVSNGQK VLSNVVSSQT DVHSLYGGVV PEIASREHSK LLPPLVSKAV DEAGLNMNDI NAIAVTQGPG LVGPLLVGVS FAKSLAYSLK IPLIPVNHIK AHLYANFILA DDDKDSNLPS FPLIALIVSG GHTNLYYLYD HDNWKLLGRS RDDASGESFD KIARALDLGY PGGPAVEKEA QKGQPNVDFP KPKLENEYDF SFSGLKSAVL NYLNRKKMKG EQYSSSDICA SFQQVVVDSL VEKTISAARD NQVDTIVLAG GVSANGQLRA NFKETTAGEG INLFLPRLEY CTDNAAMIAA LGYHNYRRGW IASLDLNAVP NLSP
|
| |