Gene Nther_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0402 
Symbol 
ID6316235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp417978 
End bp419012 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content40% 
IMG OID642642786 
ProductO-sialoglycoprotein endopeptidase 
Protein accessionYP_001916586 
Protein GI188585041 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0900376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACTC AAAGAGAAAC TGTATATATT CTAGGAATCG AAACTTCATG TGACGAAACA 
GCAGTTTCTG TAGTCTCAAA TGGTCAGAAG GTTTTAAGTA ATGTAGTTAG CTCTCAAACC
GATGTTCATT CCTTATATGG AGGGGTGGTT CCAGAAATAG CTTCAAGGGA ACATTCAAAA
TTGCTTCCTC CACTGGTGAG TAAGGCCGTT GACGAAGCAG GGTTAAATAT GAATGATATC
AATGCCATAG CTGTTACTCA AGGTCCTGGA CTGGTTGGCC CACTATTAGT TGGTGTATCG
TTTGCTAAAT CTTTAGCATA TTCTCTAAAG ATACCTCTAA TTCCCGTTAA CCATATTAAA
GCACATCTAT ATGCCAATTT TATACTTGCT GACGACGATA AAGATTCAAA CTTACCTAGT
TTCCCTTTAA TTGCTTTAAT AGTTTCAGGA GGCCATACTA ATCTTTACTA TTTGTATGAT
CACGATAATT GGAAGCTTCT GGGGCGCAGT AGAGACGATG CTTCCGGTGA GTCCTTTGAC
AAAATTGCTA GAGCTTTAGA CCTGGGATAT CCGGGTGGAC CAGCTGTTGA AAAGGAAGCT
CAAAAGGGCC AACCTAATGT TGATTTTCCG AAACCTAAGC TAGAAAACGA ATACGATTTT
AGTTTCAGTG GTTTGAAAAG TGCTGTATTG AATTATTTGA ACCGGAAAAA AATGAAAGGT
GAACAATACA GTAGCTCTGA TATATGCGCA AGTTTTCAAC AGGTGGTTGT TGACTCACTT
GTGGAGAAAA CAATCTCTGC GGCTCGGGAT AATCAGGTTG ATACTATAGT TCTAGCGGGA
GGAGTTTCTG CTAATGGACA GTTAAGAGCG AATTTTAAGG AAACAACTGC TGGAGAGGGA
ATAAATTTAT TTCTTCCTAG ACTGGAGTAC TGTACTGATA ATGCAGCCAT GATAGCTGCA
CTGGGTTATC ATAACTATCG TCGTGGCTGG ATTGCTTCCC TGGATCTGAA TGCGGTCCCC
AATTTGAGTC CATAA
 
Protein sequence
METQRETVYI LGIETSCDET AVSVVSNGQK VLSNVVSSQT DVHSLYGGVV PEIASREHSK 
LLPPLVSKAV DEAGLNMNDI NAIAVTQGPG LVGPLLVGVS FAKSLAYSLK IPLIPVNHIK
AHLYANFILA DDDKDSNLPS FPLIALIVSG GHTNLYYLYD HDNWKLLGRS RDDASGESFD
KIARALDLGY PGGPAVEKEA QKGQPNVDFP KPKLENEYDF SFSGLKSAVL NYLNRKKMKG
EQYSSSDICA SFQQVVVDSL VEKTISAARD NQVDTIVLAG GVSANGQLRA NFKETTAGEG
INLFLPRLEY CTDNAAMIAA LGYHNYRRGW IASLDLNAVP NLSP