Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1478 |
Symbol | |
ID | 6315490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1550867 |
End bp | 1551853 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642643858 |
Product | Membrane dipeptidase |
Protein accession | YP_001917649 |
Protein GI | 188586104 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0000344928 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.899515 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATGGC AAGCACTTCA TGACAAAACT TTTATTGTCG ATGGACATAG TGATACTATT TTAAATTATG ACCGTTACGA GAATTTCGAT TTTCTCTACA GTAATGACAA TGTTCATATG GACTTACCTA AAATCGATAC AGGAGGGATT GATTTACAGT TTTTTGCTGT ATTTATAGAG GATCAATTTC TTCCTAATGC GGGTTTTAAA AATTGTGTCC GTTTATTAGA GACATTTCGG AACAATATTA TAGATAGTCC CAACTTTTCT ATGATTGAAA CTAAGAAAGA CTTAAGAAAT GCCATTGATG ATGGATCACA AAAGAAATAT GGTTTACTCA CCATAGAAGG TGGTGAAGTC CTAGAAGGTG ATATAAACCT ATTAAGAGCT CTTTATCGAT TGGGGATCAG GGGAATCACT TTAACGTGGA ATAGACGAAA TGAATTGGCA GATGGCTGTA GTTTAGGCAA GTATGCCGGT GGTTTAAGTG ATTTTGGCTG TCAAGTAGTT CGAGAAATGA GTAGATTGGG TATGATGGTA GATGTCAGCC ATTTGTCTTT AAATAGTTTT AATCATGTGT TGGAAATTCA TGACGAACCT GTAGTTGCAA CTCATTCAAA TGCTAGTTCA ATTTTAAATC ATCCAAGAAA TTTAGACGAT AATCAGCTTA AAAAAATTGC TGAAAGTGGT GGAGTTATAG GTCTGAATTA TGCATCCCAC TTCATCACCA ACTCTCAGAA AAGAGCTGGC TTAGATGAAT TATTTCAACA TTTACAATAC ATGATAAATC TAGTAGGAGA GGATCATGTG GCTCTAGGCA GTGACTTTGA TGGAATATCC AATCCACCCA AAGAAATAAA CACAGCCGCC GATTTACCTA AACTCACCGA ATATCTTTGC AAATGTAATC TTAGTGAAAC TACTATTCAG AAAGTTTTAG GGGATAATTG GCTCAGGGTA TTAAACGAGG TATTACCGGA GGATTGA
|
Protein sequence | MEWQALHDKT FIVDGHSDTI LNYDRYENFD FLYSNDNVHM DLPKIDTGGI DLQFFAVFIE DQFLPNAGFK NCVRLLETFR NNIIDSPNFS MIETKKDLRN AIDDGSQKKY GLLTIEGGEV LEGDINLLRA LYRLGIRGIT LTWNRRNELA DGCSLGKYAG GLSDFGCQVV REMSRLGMMV DVSHLSLNSF NHVLEIHDEP VVATHSNASS ILNHPRNLDD NQLKKIAESG GVIGLNYASH FITNSQKRAG LDELFQHLQY MINLVGEDHV ALGSDFDGIS NPPKEINTAA DLPKLTEYLC KCNLSETTIQ KVLGDNWLRV LNEVLPED
|
| |