Gene Nther_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1775 
Symbol 
ID6314296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1842184 
End bp1843443 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content36% 
IMG OID642644149 
Productpeptidase U32 
Protein accessionYP_001917935 
Protein GI188586390 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.768163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.436207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCG AATATAAAGG TAAATCTAAA GTTGAACAAA ATCAAAATCC CTTAGAACTT 
TTAGCTCCTG CTGGCAATCT AGAAAAACTT AAATTTGCTA TTGATTACGG AGCAGATGCC
GTATATTTAG GTGGGAAATC TTACGGATTA AGGGCTTTTG CCGGTAATTT TTCAAGAGAG
GATATGAGAG CGGGCGTTGA TTATGCTCAT TCGCGCGGTA GAAAGGTTTA TATTACAGTT
AATATCTTTC CACACAACGA GGATTTAGAA GGTTTAGAAG AGTATTTAAA GGAATTACAA
TCAATAGGGG TAGATGCCAT AATAATATCT GATCCTGGTA TACTTAGAAT CTGCAAAGAA
ACTGTACCAG AGATGGAAAT TCACTTATCG ACTCAAGCTA ACTGTACAAA CTGGAGATCT
GCTAAATTTT GGAAAGATCA AGGTATAAAT CGAATTATTC TAGCTCGAGA GCTAAGTTTA
GCGGAAATAG AACAAATTCA TAGAATGGAA TCTAATATAG AATTTGAAAC TTTTGTCCAC
GGGGCAATGT GTATTTCGTA CTCAGGTAGG TGCTTACTAT CCAATTATTT GGCCAATAGA
GATGCAAATA GAGGTGAATG CGCTCATCCT TGTAGATGGC AGTATTATCT TATGGAACGT
GAGCGTCCGG GGGAGTACCT GCCAATAACT GAAGATCAGG AAGGTACCAA AATTATGAGT
TCCAAAGATT TATGTATGAT TAGGCATATT CCTGAACTTA TAGATGCAGG GATTAAAAAC
TTCAAAATAG AAGGACGTAT GAAGAGTGTA CATTACGTTG CTACTGTAGT AAGAGCTTAT
CGAAAAGCTA TTAATGCATA TCTTTCCGAT CCTGATAACT ATAGTTTTAA AAGAGAATGG
GAAGAAGAAT TAAAGAAAGC TTCTACAAGA CCTTTTAGTA CTGGTTTTTA TTTTGGCTCA
CCTGATGAAA CTGATCAAGA ATATACTAAA GAGCCAAGAC AATCAAATCG CGATTTTGTG
GGAATCGTTT TAAACAGTCA AGATGGATAT TTAACTATTC AGCAGAGGAA TCATTTTCAA
ATTGGTGATC GCATAGAAAT AATAGGACCT AACAAAACTT ATGGTGAATT TGTCATAAAC
GAGATAATCA ATAGTGAAGG TAAAAAAAGT CAAGCTGCTC CACATCCCAA AGAAGTTGTT
ACAGTTCCGT TGAATGTAGA TTGTGAACCT AATTCATTGA TACGTAAAAT TCTGGGTTGA
 
Protein sequence
MTIEYKGKSK VEQNQNPLEL LAPAGNLEKL KFAIDYGADA VYLGGKSYGL RAFAGNFSRE 
DMRAGVDYAH SRGRKVYITV NIFPHNEDLE GLEEYLKELQ SIGVDAIIIS DPGILRICKE
TVPEMEIHLS TQANCTNWRS AKFWKDQGIN RIILARELSL AEIEQIHRME SNIEFETFVH
GAMCISYSGR CLLSNYLANR DANRGECAHP CRWQYYLMER ERPGEYLPIT EDQEGTKIMS
SKDLCMIRHI PELIDAGIKN FKIEGRMKSV HYVATVVRAY RKAINAYLSD PDNYSFKREW
EEELKKASTR PFSTGFYFGS PDETDQEYTK EPRQSNRDFV GIVLNSQDGY LTIQQRNHFQ
IGDRIEIIGP NKTYGEFVIN EIINSEGKKS QAAPHPKEVV TVPLNVDCEP NSLIRKILG