Gene Pnuc_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_1900 
Symbol 
ID5053864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp1978285 
End bp1979805 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content46% 
IMG OID640472074 
Productthreonine dehydratase, biosynthetic 
Protein accessionYP_001156676 
Protein GI145590079 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACGA ACTATTTAAA GAAAATTTTA TCGGCTCGCG TCTATGACGT AGCTAGAGAA 
ACCGAGCTCC AAGTAGCTCC CGAACTTACC AGGCGATTGG GCAACCAGGT CTTACTGAAA
AGGGAGGATA ACCAGCCGGT TTTTTCCTTC AAATTGCGTG GCGCCTATAA CAAAATGGCC
CATTTACCCT TGGAAGCCCT AAAACGCGGG GTAATTGCTG CTTCTGCAGG CAATCATGCA
CAAGGGGTAG CCCTTTCTGC CGCCAAAATG AAGTGCAAAG CAGTCATCGT GATGCCGGTT
ACCACCCCTA GCGTCAAAAT TGATGCGGTA AAGGCCCGTG GCGGCTCTTG GGTCGAAATT
ATTCTGCACG GCGAATCCTA TAGCGACGCT TTTAAGTATT CAGAAGTTCT GGGTAAAAAA
CGCGGCCTCA CCTTCGTTCA CCCGTTTGAT GATCCTGACG TCATTGCCGG GCAAGGAACC
ATTGCTCACG AAATTTTTAC GCAATATGAA AAACCCATTG ATGCAGTATT TGTGGCAATT
GGTGGTGGCG GCTTAATTTC CGGAATTGGT GAATACATCA AAGCAGTGAG CCCAAAGACT
AAAGTGATTG GCGTTCAAGC ATCGGACTCT GATGCCATGA ACCAATCTCT CAAAGCAAAC
AAGCGCATTG AAATGAAAGA TGTCGGTTTA TTCTCTGACG GCACTGCAGT AAAGCTAGTA
GGCAAAGAAA CCTTTCGCAT TTGCAAAAAA GTGGTTGATG AAATCATCAC CGTTGATACC
GATGAAATCT GCGCAGCAAT TAATGATGTG TTCACTGATA CCCGTAGCAT CCTTGAACCA
GCAGGCGCAC TAGCTATTGC AGGCATGAAG AAGTACGTCG AAAAGAAGCG TATTAAGAAG
AAAACTTTAG TGGCTGTGGC TTGTGGAGCC AATATGAACT TTAGCCGCCT GCGCTTTGTA
GCTGAACGTG CAGACGTTGG CGAGTTCCGT GAAGCGGTAT TTGCTGTCAC CATTCCTGAA
GAGCGAGGAT CACTCAAGCG CTTTTGTGAG TTACTTGGAA AACGCAACGT TACCGAATTT
AATTATCGAA TTGGCAACCA AAGTGAAGCA CATATTTTTG TTGGTATTAG CACGCAAAAA
TCTGGTGATA GCGAAGTCAT TGCCAAGCAT TTCCGCAAAG CCAAATTTGC AACTATCGAT
CTGACGCATG ATGAGTTAGC CAAGTCTCAC TTACGCCACA TGGTGGGTGG ACATTCAGCA
CTCGCAAATG ATGAGCTGTT GTACCGCTTT GAATTTCCAG AGCGCCCAGG TGCTTTGATG
AAGTTCTTGA CCAGCATGGC GCCCAATTGG AATATCAGCT TATTTCACTA CCGCAATCAT
GGTGCAGACT ATGGTCGCAT TCTAGTAGGC CTACAAGTTC CTAAGAATGA GCAAAAGAAA
TTCCAAAACT TCTTGGCTAG TCTTGGCTAT CCCCACTGGG ATGAGACCAA CAATCCTGCC
TACCATCTCT TCCTTAAATA G
 
Protein sequence
MATNYLKKIL SARVYDVARE TELQVAPELT RRLGNQVLLK REDNQPVFSF KLRGAYNKMA 
HLPLEALKRG VIAASAGNHA QGVALSAAKM KCKAVIVMPV TTPSVKIDAV KARGGSWVEI
ILHGESYSDA FKYSEVLGKK RGLTFVHPFD DPDVIAGQGT IAHEIFTQYE KPIDAVFVAI
GGGGLISGIG EYIKAVSPKT KVIGVQASDS DAMNQSLKAN KRIEMKDVGL FSDGTAVKLV
GKETFRICKK VVDEIITVDT DEICAAINDV FTDTRSILEP AGALAIAGMK KYVEKKRIKK
KTLVAVACGA NMNFSRLRFV AERADVGEFR EAVFAVTIPE ERGSLKRFCE LLGKRNVTEF
NYRIGNQSEA HIFVGISTQK SGDSEVIAKH FRKAKFATID LTHDELAKSH LRHMVGGHSA
LANDELLYRF EFPERPGALM KFLTSMAPNW NISLFHYRNH GADYGRILVG LQVPKNEQKK
FQNFLASLGY PHWDETNNPA YHLFLK