Gene Nther_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1914 
Symbol 
ID6315294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2009625 
End bp2010941 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content34% 
IMG OID642644296 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001918073 
Protein GI188586528 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAT CAGTAATAAT CAATGAAGAT GGAAATTTGG AAATAGGTGG TTGTGATTTA 
GTAAAATTAG CTCAACAATA TGATACACCT TTGAATATAT TAGATGAAGA ACATATTAGG
AATAAATGTA GAAAATATCG AAGAATTTTA GAACAATATT ATCCTACAAC TGAATTGGCT
TATGCTGGTA AAGCTTTTTT GTCAACAACC ATTTGTAAAA TAGTTGAAGA AGAAGGCTTG
TTCCTTGATG TAGTCTCGGA AGGGGAATTA TTTACAGCGG TTCAGGCTGG TTTTCCTGGG
GACAAGCTGT TATTTCATGG AAATAATAAA ACCTCAAGAG AATTAGAATA TGCCTTAGAT
TGTAATGTGG GAAGAATAGT AGTTGATAGC TTCTTGGAAC TTGAACTGAT TGAAAAGTTA
GCACAAAAAA AGAGTACTAG GGTCCAAATT TATTTTAGAG TTAAACCCGG GATAGAGGCT
CATACCCACA AATATATATT AACTGGTCAA GAGGACTCCA AATTTGGCTT CAGCTTACGT
CAAGACGATA TAATGAAAGC TGTAAACTTT GCAATCAAAT CTTCATATAT TGATTTACAA
GGACTACACT GTCATATTGG TTCACAAATT AATGAAAGTA AACCTTATAG ATTAGCTGCT
AAAACCATGA TGGAATTATT AATTTCAATT CAAAAAATAT ATAATTACAG TATTTCGGAA
CTGGATTTAG GTGGGGGTAT GGGAATAAAA TATCGAAGTG ATGATTCGAA ATTAGATATT
GATTCACTTA TGAATGAAAT ATCCCAACTT ATCCTGGATA TGGCAAAAGA CTCTAAAATA
GAATTGCCGA AATTAATTTT TGAACCCGGG CGTTCAATTA TTGGAGAAGC GGGTGTCATG
TTGTATCGTG TTGGAGTCAT TAAAGAAATA CCAGGTATTA GAAATTATGT TTCAGTAGAC
GGAGGAATGA GCGATAATAT TAGACCAGCT CTTTACGGAG CTGAATACTC TGCTGTAGTG
GCAAACAAGG CCAATTATCC TCATAATAAT AGTTATACTG TTGTAGGTAA AATCTGTGAA
TCAGGTGATG TTCTCCTACA AAATATAGAT TTAGTAGATA ATCTAACACC TGGAGATTTA
CTTACCGTTT TTAGCTGTGG TGCGTATACT TATTCCATGT CAAGTAATTA TAATAGACTA
CCAAAACCAG CAGTGGTGTT AGTAAACCAA GGTCAAGCTG ATCTCATAGT TCGTCGAGAA
AGTTTAGAAG ATGTGATCCA AAATGACCTT ATCCCAGAAA GGTTTAGGAG GAACTAA
 
Protein sequence
MSESVIINED GNLEIGGCDL VKLAQQYDTP LNILDEEHIR NKCRKYRRIL EQYYPTTELA 
YAGKAFLSTT ICKIVEEEGL FLDVVSEGEL FTAVQAGFPG DKLLFHGNNK TSRELEYALD
CNVGRIVVDS FLELELIEKL AQKKSTRVQI YFRVKPGIEA HTHKYILTGQ EDSKFGFSLR
QDDIMKAVNF AIKSSYIDLQ GLHCHIGSQI NESKPYRLAA KTMMELLISI QKIYNYSISE
LDLGGGMGIK YRSDDSKLDI DSLMNEISQL ILDMAKDSKI ELPKLIFEPG RSIIGEAGVM
LYRVGVIKEI PGIRNYVSVD GGMSDNIRPA LYGAEYSAVV ANKANYPHNN SYTVVGKICE
SGDVLLQNID LVDNLTPGDL LTVFSCGAYT YSMSSNYNRL PKPAVVLVNQ GQADLIVRRE
SLEDVIQNDL IPERFRRN