Gene Nther_2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2063 
Symbol 
ID6317129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2180863 
End bp2181789 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content36% 
IMG OID642644451 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001918218 
Protein GI188586673 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000239523 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000119694 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAATT GGAACCAAAA CAAAAATAGA AATGAAAAAG AATTCGAATA TAAGGAGAAT 
AATCATTTAG ATAGGGATGA AGAGTTTGAA AATGAAGAAT TAGAAGATGA AGAATTTTAT
TTTCCTGAAG AATACTGGGA CTTTGAATTT GATGAAGAAG ACGATGAGGA CTGGACTGAG
TTTGATGAAA GAAAAAGCAA GATTCGAAAA ATATTTACAG CGGCTGTGTT AGTGATGTTT
GTTATTACAG CTATGACCGG AGTATTTAAT GTTTTAGCTA ATTTTCCTAT TGATGCGTAT
TTAGAATCTT TGGATTTGAG AGATAACCCC CAAGTAAAAG AACTGAAAAA AAGTGTTGTT
ATGGTATCAG GACATGGAGA AGCCCAAAAA AACTCTATTT CTACAAGACA GGCGGGCTCT
GGTTTTAATA TTGATCCTTC GGGTAAAATA CTGACTAATC GTCATGTTGT TGAAGACGCT
ACTAATATTT CCGTGAATTT CAGAGAAGAA GAAAAGGGAT TTCCTGTTGA GGAATGGCAC
GGTGCTCCTT ACCCAAATAT CGATATGGCT ATTTTAGAAA TTCAGGGAGA AAATTTACCC
TATGTTGAGC TTAAGGATGA TCCGATCGCA TCTTTAGACA AAGGTCAAGA TGTTTTGATT
ATTGGTAATC CCAGGGGGAT AGGGAGCCTG GCTGTAGAAG GTGAACTCAT GAAAATTCAC
GAACTTTCCG GGACACCTCA TAGTATTTTA GAAATAGATG CCCATATTCA TCCTGGACAT
AGTGGTAGCC CTGTATTTGA TGCAGAAGGT GAAGTAGTGG GAATTATATA CGCTTCGCGT
GAAACAAATG ATGGTAAACA AGTAGGTTTA GCAGTTTCTT TAAAAGACGT GAAAGATCTA
GAAAAATTTA AGGATAGAGG GGAATAA
 
Protein sequence
MKNWNQNKNR NEKEFEYKEN NHLDRDEEFE NEELEDEEFY FPEEYWDFEF DEEDDEDWTE 
FDERKSKIRK IFTAAVLVMF VITAMTGVFN VLANFPIDAY LESLDLRDNP QVKELKKSVV
MVSGHGEAQK NSISTRQAGS GFNIDPSGKI LTNRHVVEDA TNISVNFREE EKGFPVEEWH
GAPYPNIDMA ILEIQGENLP YVELKDDPIA SLDKGQDVLI IGNPRGIGSL AVEGELMKIH
ELSGTPHSIL EIDAHIHPGH SGSPVFDAEG EVVGIIYASR ETNDGKQVGL AVSLKDVKDL
EKFKDRGE