Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2221 |
Symbol | |
ID | 6315399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 2359384 |
End bp | 2360469 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642644609 |
Product | peptidase S58 DmpA |
Protein accession | YP_001918375 |
Protein GI | 188586830 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000010872 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 4.545e-18 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGTCTATAA AGAGCTGTAG TATTACGGAA CTGACAGATT TTAAATTTGG TCATGCCCAA GATTTTCAGG CAGCCACGGG TGTAACCGTA ATTACCACTA CAGAAGGTCA GGGAGTTACG GCCGGGGTAG ATGTCAGGGG TAGCGCCCCG GGCACTAGAG AAACAGATCT ACTAGACCCC ACTAACCTGG TGGAAGAAGT TCACGGCATT TTTCTTGCCG GAGGCAGTGC CTTTGGATTA GAAGCAGCAG GTGGTATCAT GAAGTACTTA GAAGAATCGG GTGTGGGTGT TCCCACAGGT TATGCCAAAG TTCCCATAGT CCCAGGTGCA ATTTTGTTTG ACCTGGGAAT CGGCGATCCC CGGACTCGTC CCGATGCCAA TATGGGCTAT CAGGCCGCAA AAAATGCCGC CAACTCAAAT CCCCAGGAAG GTAATTACGG CTGTGGAACG GGAGCCACTA TCGGCAAGTT TGCCGGTGAA GCCCATGCCA TGAAAGCAGG GGTCGGCGTA TCCGCCTTTA GAACGGGAGA TTTGATCGTG GCTAGTTTGG TAGCAGTTAA CTGCTTTGGT GAAGTTATAG ACCCGGAAAC CGGTCAGATA ATAGGTGGAG CTTATGATAG CAGTACCTAT CAGTTTATAA GGGCCAGGGA AGTCCTGGGC CACGACAGTG AAAGTGATAA GGAAAACAGT AATAGTGCTG ACAATGGTAA AGATCAAGGT AATAGCCACG GTGACAGTGA ACGCGATAGC GGCGGTTACA ATGACGATAA CGATGATGGT GAAAGAAGTA TCTTCTCTCG CAATAATACC ACTATCGGAG TCGTTGTGAC CAATGCCCAG CTAACTAAAG CAGCTGCTAC CAAGGTAAGC CAAATGGCCC ACAGCGGCAT CTCCCGCACT ACCAGACCCG CTCATTCCAT GCTGGATGGT GACGCTTTAT TTACCATGGC TTCCGGTCGT GTAACTTCTG ATTTGACTCT CATCGGTGAA CTGGCCGCCC TGTCCGTGGA AAAATCTATC ATTAATGGTG TAACCAAAGC CCAGTCCAGC CATAATTTAC CCGCTCACAA TGATATATTT TTTTAG
|
Protein sequence | MSIKSCSITE LTDFKFGHAQ DFQAATGVTV ITTTEGQGVT AGVDVRGSAP GTRETDLLDP TNLVEEVHGI FLAGGSAFGL EAAGGIMKYL EESGVGVPTG YAKVPIVPGA ILFDLGIGDP RTRPDANMGY QAAKNAANSN PQEGNYGCGT GATIGKFAGE AHAMKAGVGV SAFRTGDLIV ASLVAVNCFG EVIDPETGQI IGGAYDSSTY QFIRAREVLG HDSESDKENS NSADNGKDQG NSHGDSERDS GGYNDDNDDG ERSIFSRNNT TIGVVVTNAQ LTKAAATKVS QMAHSGISRT TRPAHSMLDG DALFTMASGR VTSDLTLIGE LAALSVEKSI INGVTKAQSS HNLPAHNDIF F
|
| |