Gene Nther_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2022 
SymboltnaA 
ID6315834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2133139 
End bp2134527 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content37% 
IMG OID642644410 
Producttryptophanase 
Protein accessionYP_001918177 
Protein GI188586632 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02617] tryptophanase, leader peptide-associated
[TIGR02618] tyrosine phenol-lyase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAATA TTATTGATGA AATGATGGCG GCTGAACCGT TTAGGATTAA AATGGTTGAG 
CCGATTAAGA CAACTACCAA GGAAGAACGC CAGAGAAAAA TTAAAGAAGC GGGTTATAAT
GTTTTTAATT TGGAATCTAA GGATGTTTAT ATCGATTTAT TAACAGATAG TGGAACAAGT
GCCATGAGTG ATAATCAATG GGCTGGTATG ATGTTAGGAG ATGAGGCATA TGCTGGTAGT
AAAAACTTCT ATAATCTCAA GGATGCCATC AAAGAAGTGA TGGGTTATGA CTACTTTTTA
CCTACACATC AAGGGAGAGC CGCAGAAAAT GTACTTTTTC AATTATACGT GGAAGAAGGA
AATTATGTAC CTAATAATAT GCATTTTGAT ACCACAAAGG CACATGTTCA AGATAAACGT
GGGCGACCTG TCAATTTAGT TATAGATGAA GCTTATGATG CCAAGAAAAA TGTGCCATTT
AAAGGAAATA TGGATATTGA TAAACTAGAT AATTTCTTAA AAGAGCACAG TGATAACGTT
CCCATTGTAT TATTGACAGT AACGTGTAAT AGCGGTGGTG GTCAACCCGT TTCTATGGAA
AATATCAAAG AAGTGTCAGA AATTTGTAAG AAATATAATA AACCTTTCTA TTTAGATGCT
TGTCGCTTCG CAGAAAATGC TTATTTTATT AAACTAAGAG AGCCAGGCTA TGCCGATAAA
TCCATTCCAG AGATAACCCG GGAGATGTTC TCTTATGCAG ATGGTTGCAC CATGAGTTCC
AAAAAAGATG CTTTAGTTAA TATTGGTGGT TTTTTAGCAA TGGAAGATGA GAAACTACTT
GATAAAGCAA AAGTTTTAGG AGTACTATAT GAAGGGTTCC CTACTTACGG GGGTATGGCT
GGCAGAGATA TGGAAGCCAT GGCCCGTGGA TTACTTGAAG TTGTGGAAGA AGAATATTTA
CAATACAGAA TAAACCAGGT TAATTATTTA GGTGAAAAGT TGAAGGAAAA GAAAGTACCT
ATATTGGAAC CCGTTGGTGG ACACGCTGTA TATCTAGATG CTGGTCGTTT CTTACCAAAT
ATTCCAAAAG ACCAATTTCC TGGTCAAGCT TTAACAGTAG CACTTTACGA AGAGGGCGGT
ATTAGAGGTG TTGAAATAGG AACTGTCTTA AGTGGAAGAG ATCCTGAAAC CGGAGACCAT
GATTATCCCG AATTAGAACT GGTTCGTTTA ACTATACCCA GAAGAGTATA TACTTATCGA
CATATGGATG TTGTGGCAGA GGCTGCTAAA ATAATTCATG ATAAGCGAGA TGAAATTGGT
GGCTATAAAT TTACTTATGA ACCTGAGATC TTAAGGCATT TTACTGCCAG ATTTGCTCCT
GTAAAATAA
 
Protein sequence
MGNIIDEMMA AEPFRIKMVE PIKTTTKEER QRKIKEAGYN VFNLESKDVY IDLLTDSGTS 
AMSDNQWAGM MLGDEAYAGS KNFYNLKDAI KEVMGYDYFL PTHQGRAAEN VLFQLYVEEG
NYVPNNMHFD TTKAHVQDKR GRPVNLVIDE AYDAKKNVPF KGNMDIDKLD NFLKEHSDNV
PIVLLTVTCN SGGGQPVSME NIKEVSEICK KYNKPFYLDA CRFAENAYFI KLREPGYADK
SIPEITREMF SYADGCTMSS KKDALVNIGG FLAMEDEKLL DKAKVLGVLY EGFPTYGGMA
GRDMEAMARG LLEVVEEEYL QYRINQVNYL GEKLKEKKVP ILEPVGGHAV YLDAGRFLPN
IPKDQFPGQA LTVALYEEGG IRGVEIGTVL SGRDPETGDH DYPELELVRL TIPRRVYTYR
HMDVVAEAAK IIHDKRDEIG GYKFTYEPEI LRHFTARFAP VK