Gene Nther_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1802 
Symbol 
ID6316372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1865992 
End bp1867368 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content37% 
IMG OID642644176 
Producttyrosine phenol-lyase 
Protein accessionYP_001917962 
Protein GI188586417 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02618] tyrosine phenol-lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.106432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00367591 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAAGAG AGTTTCAAGC AGAACCTTTC AGAATTAAAA CAGTAGAGCC AATCAAAATG 
ACTACAAAAG AAGAGCGAGA AGAGGCTATA CAAAAAGCAG GGTTCAACAC TTTCCTATTA
GATTCTAAAG ATGTTTACAT CGATTTATTA ACTGACAGCG GAACTAATGC CATGAGTGAT
TATCAATGGG CAGGTCTTAT GATGGGTGAT GAGGCTTATG CCGGAAGTAA AAACTGGTAT
AACTTAGAAA AAGCAGTACA AGAATGTTTT GGTTTCGAAT ATGTAGTTCC AACTCATCAA
GGTAGGGGCG CAGAAAACAT TTTATCTCAG ATCATGATTA AAGAGGGAGA TTACATCCCA
GGAAATATGT ATTTTACAAC TACTAAGGCC CATCAAGAAT TTGCCGGAGG TACTTTTAGA
GATGTGATAA TTGACAAGGC ACATGATTCA CAAGCTGAGC ATCCTTTTAA GGGAAATGTA
GACCTAGAAA AATATCAAGC TTTGATTGAT GAAGTAGGTC CGGAACGAAT TCCATATATC
TGTGTTGCAG TTACTGTAAA TATGGCGGGT GGACAGCCTG TTAGCATGGA AAATTTACGC
AAGGTTAAAG AAATATCTGA TAAATATGGT ATAAAAGTAA TGTTTGATGC TACTCGTTGT
GTGGAAAATG CTTACTTTAT CAAAGAAAGA GAAGAAGGGT ATCAGGATAA ATCCATTGCT
GAAATTTTAA AAGAAATGAT GAGTTATGGT GACGGAGCTA CTATGAGTGG TAAAAAAGAT
CCCTTAGTAA ATATTGGTGG TTTCTTGGCT ATGAATGACG AAGAGCTATA TCAAAGAGCT
ACTCAACTAG TTGTTGTGTA CGAGGGAATG CCCACTTATG GTGGAATGGC AGGAAGAGAT
ATGGAAGCCA TGGCGAGAGG AATATATGAA TCACTAGATT ATCACTACAT TAGTCACAGA
GTCAATCAGG TAAGGTATTT AGGAGAGAAG CTAGAGGCTG GCGGAGTTCC AATTGTCAAA
CCTATCGGTG GTCATGCAAT ATTTGTAGAT GCCGAGAGAT TCTTGTCCCA TTTACCTAGA
GAAGAATTTC CAGCCCAGGC CTTGGCTGCT AATATTTATA GAGATTCTGG AGTAAGGACT
ATGGAAAGAG GTACTGTTTC TGCTGGAAGA GATAAAGATG GCAACAATAT TTTCCCCAAA
TTAGAAACAG TTAGATTAAC TATTCCTAGA AGAGTTTACA CTTATGCGCA TATGGACGTT
GTTGCGGACT CGATTATCAA TCTTTATGAA AATAGAGAAA AAATCAAAGG TTTACGATTT
GCTTATGAAC CACCAGTATT AAGATTCTTT ACCGCAAAAT TTGAAGAAAT AAAATAA
 
Protein sequence
MSREFQAEPF RIKTVEPIKM TTKEEREEAI QKAGFNTFLL DSKDVYIDLL TDSGTNAMSD 
YQWAGLMMGD EAYAGSKNWY NLEKAVQECF GFEYVVPTHQ GRGAENILSQ IMIKEGDYIP
GNMYFTTTKA HQEFAGGTFR DVIIDKAHDS QAEHPFKGNV DLEKYQALID EVGPERIPYI
CVAVTVNMAG GQPVSMENLR KVKEISDKYG IKVMFDATRC VENAYFIKER EEGYQDKSIA
EILKEMMSYG DGATMSGKKD PLVNIGGFLA MNDEELYQRA TQLVVVYEGM PTYGGMAGRD
MEAMARGIYE SLDYHYISHR VNQVRYLGEK LEAGGVPIVK PIGGHAIFVD AERFLSHLPR
EEFPAQALAA NIYRDSGVRT MERGTVSAGR DKDGNNIFPK LETVRLTIPR RVYTYAHMDV
VADSIINLYE NREKIKGLRF AYEPPVLRFF TAKFEEIK