Gene Nther_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2221 
Symbol 
ID6315399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2359384 
End bp2360469 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content49% 
IMG OID642644609 
Productpeptidase S58 DmpA 
Protein accessionYP_001918375 
Protein GI188586830 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000010872 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.545e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTCTATAA AGAGCTGTAG TATTACGGAA CTGACAGATT TTAAATTTGG TCATGCCCAA 
GATTTTCAGG CAGCCACGGG TGTAACCGTA ATTACCACTA CAGAAGGTCA GGGAGTTACG
GCCGGGGTAG ATGTCAGGGG TAGCGCCCCG GGCACTAGAG AAACAGATCT ACTAGACCCC
ACTAACCTGG TGGAAGAAGT TCACGGCATT TTTCTTGCCG GAGGCAGTGC CTTTGGATTA
GAAGCAGCAG GTGGTATCAT GAAGTACTTA GAAGAATCGG GTGTGGGTGT TCCCACAGGT
TATGCCAAAG TTCCCATAGT CCCAGGTGCA ATTTTGTTTG ACCTGGGAAT CGGCGATCCC
CGGACTCGTC CCGATGCCAA TATGGGCTAT CAGGCCGCAA AAAATGCCGC CAACTCAAAT
CCCCAGGAAG GTAATTACGG CTGTGGAACG GGAGCCACTA TCGGCAAGTT TGCCGGTGAA
GCCCATGCCA TGAAAGCAGG GGTCGGCGTA TCCGCCTTTA GAACGGGAGA TTTGATCGTG
GCTAGTTTGG TAGCAGTTAA CTGCTTTGGT GAAGTTATAG ACCCGGAAAC CGGTCAGATA
ATAGGTGGAG CTTATGATAG CAGTACCTAT CAGTTTATAA GGGCCAGGGA AGTCCTGGGC
CACGACAGTG AAAGTGATAA GGAAAACAGT AATAGTGCTG ACAATGGTAA AGATCAAGGT
AATAGCCACG GTGACAGTGA ACGCGATAGC GGCGGTTACA ATGACGATAA CGATGATGGT
GAAAGAAGTA TCTTCTCTCG CAATAATACC ACTATCGGAG TCGTTGTGAC CAATGCCCAG
CTAACTAAAG CAGCTGCTAC CAAGGTAAGC CAAATGGCCC ACAGCGGCAT CTCCCGCACT
ACCAGACCCG CTCATTCCAT GCTGGATGGT GACGCTTTAT TTACCATGGC TTCCGGTCGT
GTAACTTCTG ATTTGACTCT CATCGGTGAA CTGGCCGCCC TGTCCGTGGA AAAATCTATC
ATTAATGGTG TAACCAAAGC CCAGTCCAGC CATAATTTAC CCGCTCACAA TGATATATTT
TTTTAG
 
Protein sequence
MSIKSCSITE LTDFKFGHAQ DFQAATGVTV ITTTEGQGVT AGVDVRGSAP GTRETDLLDP 
TNLVEEVHGI FLAGGSAFGL EAAGGIMKYL EESGVGVPTG YAKVPIVPGA ILFDLGIGDP
RTRPDANMGY QAAKNAANSN PQEGNYGCGT GATIGKFAGE AHAMKAGVGV SAFRTGDLIV
ASLVAVNCFG EVIDPETGQI IGGAYDSSTY QFIRAREVLG HDSESDKENS NSADNGKDQG
NSHGDSERDS GGYNDDNDDG ERSIFSRNNT TIGVVVTNAQ LTKAAATKVS QMAHSGISRT
TRPAHSMLDG DALFTMASGR VTSDLTLIGE LAALSVEKSI INGVTKAQSS HNLPAHNDIF
F