Gene Nther_1155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1155 
Symbol 
ID6315721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1222402 
End bp1223790 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content34% 
IMG OID642643528 
Productpeptidase M28 
Protein accessionYP_001917326 
Protein GI188585781 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.447711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.173867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAAA ATATAGTCAC AAAAGATTTA CAAAAAGTTA GTGTGCTACA CTTATTACGT 
GATCAAAGAG GGGTTTTTGC CTACTCTTCT AAAATAACTA TAACAGACAA GGGGGTTGTA
ACAATGGAGA TTTTTGAAAG TGATGGATTG AGAATGTATG AAGATATTGT CGATTTAGCA
GCTTTAGGGG ATCGATTTGC AGGTTCTACA GCAGAAAAAG AAGCTGGTGA AATTGTTAAA
AAATCGTTTA TACAATCTGG ACTCGAAGTT TATGAAGAAT CTTTTGATGT ATTTTCTTTT
TATGAAAAAA ATTCCCAAAT AAAGTTAAGA AAGCCTCTAG AACTCACATT CGATACCAGG
GCAATGTATT ATTCACCGGG AACTCATAAA GAAGGATTAC AAGGTGAACT AGTTTATGTA
AAAAATGGTT TAGAACAAGA TTATGAAGGT AAAGATGTAG AAGGAAAAAT AGTAATTTTC
CATAGAGATG ACAAACAAAT CAAAGATCAT TTCTGGCCAG AAATAAAAAC AGCTTCAGAA
AAAGGTGCAA TTGGTGCAAT TCTTATTAAT TTCGATGATT GGGAATTTAT AACTACTCTA
GAAACTGGCT ATTTTGAACC ATCGAAAAGG TTTTTACCTA TCGAACCAAA TGAAATCCCA
TCTGTTATTG TTAGCAAAAA TAAAGGTGAC TTGATTCTTG ATTTAATGAA TCAAGAAAAA
GTTATTGTAG ACATTATTGT TGACACGCTA AACGAAAAAA TGCGTTCATC CAATATAAGA
GGGGTTAAAG CAGGAAGCCA AAATAATAAT GAGAAAATTC TTATATATGG ACATAGAGAC
TCTGCTGGAA CGCCTGGAGC GAATGATAAT GGATCAGGTA CCGTAATAAT GATGGAACTT
GCTCGTCTTT TAAAGGATAT GAAATTAAAT AGAACTATAG AATTTTTATC AACTGGGGCA
GAAGAACAAT TAGGGGCCGC AGGTGCATTG GAGTATATAA ATAGGCATAA AAGCGAACTA
AATAATATAA AAGCTGCTGT TGAATTAGAT ATGGTTGGTA ATGGAAATTC ATTATGTGTT
ATGAAGGGAG GAGAGTGGCC AGATAAAACA GTAAATTTCC CAGACAAAGT TTGTCAGTTT
TTTTACGAAA AAGCGCGGGA ATATGGATAT GCCGTGGAAT ATGGATTCAA TGATTTTGGG
ACACCAGATA GTGGTAAGTT TGCATCAGCA GGTGTTCCTA CCACTTGGAT ATGGGGACCT
GATGATATTT ATTATCATAG TCCAGAAGAT ACTCCTGAAA AGGTTGATCG AAATAAATTG
AAAATAGTGG CTGACATGCT GTTAAAAGTA ATTTTAGATT TAGATAAACA AGAAACCCTC
AATTTTTAG
 
Protein sequence
MMENIVTKDL QKVSVLHLLR DQRGVFAYSS KITITDKGVV TMEIFESDGL RMYEDIVDLA 
ALGDRFAGST AEKEAGEIVK KSFIQSGLEV YEESFDVFSF YEKNSQIKLR KPLELTFDTR
AMYYSPGTHK EGLQGELVYV KNGLEQDYEG KDVEGKIVIF HRDDKQIKDH FWPEIKTASE
KGAIGAILIN FDDWEFITTL ETGYFEPSKR FLPIEPNEIP SVIVSKNKGD LILDLMNQEK
VIVDIIVDTL NEKMRSSNIR GVKAGSQNNN EKILIYGHRD SAGTPGANDN GSGTVIMMEL
ARLLKDMKLN RTIEFLSTGA EEQLGAAGAL EYINRHKSEL NNIKAAVELD MVGNGNSLCV
MKGGEWPDKT VNFPDKVCQF FYEKAREYGY AVEYGFNDFG TPDSGKFASA GVPTTWIWGP
DDIYYHSPED TPEKVDRNKL KIVADMLLKV ILDLDKQETL NF