Gene Nther_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1478 
Symbol 
ID6315490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1550867 
End bp1551853 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content36% 
IMG OID642643858 
ProductMembrane dipeptidase 
Protein accessionYP_001917649 
Protein GI188586104 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000344928 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.899515 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGGC AAGCACTTCA TGACAAAACT TTTATTGTCG ATGGACATAG TGATACTATT 
TTAAATTATG ACCGTTACGA GAATTTCGAT TTTCTCTACA GTAATGACAA TGTTCATATG
GACTTACCTA AAATCGATAC AGGAGGGATT GATTTACAGT TTTTTGCTGT ATTTATAGAG
GATCAATTTC TTCCTAATGC GGGTTTTAAA AATTGTGTCC GTTTATTAGA GACATTTCGG
AACAATATTA TAGATAGTCC CAACTTTTCT ATGATTGAAA CTAAGAAAGA CTTAAGAAAT
GCCATTGATG ATGGATCACA AAAGAAATAT GGTTTACTCA CCATAGAAGG TGGTGAAGTC
CTAGAAGGTG ATATAAACCT ATTAAGAGCT CTTTATCGAT TGGGGATCAG GGGAATCACT
TTAACGTGGA ATAGACGAAA TGAATTGGCA GATGGCTGTA GTTTAGGCAA GTATGCCGGT
GGTTTAAGTG ATTTTGGCTG TCAAGTAGTT CGAGAAATGA GTAGATTGGG TATGATGGTA
GATGTCAGCC ATTTGTCTTT AAATAGTTTT AATCATGTGT TGGAAATTCA TGACGAACCT
GTAGTTGCAA CTCATTCAAA TGCTAGTTCA ATTTTAAATC ATCCAAGAAA TTTAGACGAT
AATCAGCTTA AAAAAATTGC TGAAAGTGGT GGAGTTATAG GTCTGAATTA TGCATCCCAC
TTCATCACCA ACTCTCAGAA AAGAGCTGGC TTAGATGAAT TATTTCAACA TTTACAATAC
ATGATAAATC TAGTAGGAGA GGATCATGTG GCTCTAGGCA GTGACTTTGA TGGAATATCC
AATCCACCCA AAGAAATAAA CACAGCCGCC GATTTACCTA AACTCACCGA ATATCTTTGC
AAATGTAATC TTAGTGAAAC TACTATTCAG AAAGTTTTAG GGGATAATTG GCTCAGGGTA
TTAAACGAGG TATTACCGGA GGATTGA
 
Protein sequence
MEWQALHDKT FIVDGHSDTI LNYDRYENFD FLYSNDNVHM DLPKIDTGGI DLQFFAVFIE 
DQFLPNAGFK NCVRLLETFR NNIIDSPNFS MIETKKDLRN AIDDGSQKKY GLLTIEGGEV
LEGDINLLRA LYRLGIRGIT LTWNRRNELA DGCSLGKYAG GLSDFGCQVV REMSRLGMMV
DVSHLSLNSF NHVLEIHDEP VVATHSNASS ILNHPRNLDD NQLKKIAESG GVIGLNYASH
FITNSQKRAG LDELFQHLQY MINLVGEDHV ALGSDFDGIS NPPKEINTAA DLPKLTEYLC
KCNLSETTIQ KVLGDNWLRV LNEVLPED