Gene Pnap_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0644 
Symbol 
ID4688043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp681977 
End bp683626 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID639833638 
Productthreonine dehydratase 
Protein accessionYP_980884 
Protein GI121603555 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0883544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCCT CCAAAAAATC CATCCCCGGC GCTGCAGCCA AAGCAGCCGC CACCGTCCCT 
CCTAAAGCTT CAGCCAAACC GGCCCACAAG CCTTCGGCCA AAACGCTCAA GCCCGCCGAC
TACCTGAAAA AAATCCTGAC CGCGCGGGTT TATGACGTGG CGGTCGAATC GGCGCTGGAG
CCGGCCAAAA CCTTAAGCCT GCGCCTGAAC AACACCGTGC TGCTCAAGCG CGAGGACCAG
CAGCCGGTGT TCAGCTTCAA GCTGCGCGGC GCCTACAACA AGATGGCCCA CCTGACGCCG
GCGCAGCTCA AAAAGGGCGT GATTTGCGCC TCGGCTGGCA ACCATGCCCA GGGCGTGGCC
ATGAGCGCGC AAAAGCTCGG CACGCGCGCC GTCATCGTCA TGCCGACGAC CACGCCGCAG
CTCAAAATTG ACGCGGTCAA GGGCTGGGGC GGCGAGGTCG TGCTGCACGG CGACAGCTAT
TCGGACGCCT ACACCCACTC GGTGGCCCTG CAAAAAGAAC AGGGCCTGAC CTTCGTGCAT
CCCTTCGACG ACCCGGACGT GATTGCCGGC CAGGGCACGA TTGCCATGGA AATCCTGCGC
CAGCTGCAAA CGCTGGGCTC GCGCCGGCTC GATGCTGTGT TTGTCGCCAT TGGCGGTGGC
GGGCTGATTT CCGGCGTGGC CAACTACATC AAGGCGGTGC GGCCCGAGAT CAAGGTCATC
GGCGTGCAGA TGAACGACTC CGACGCCATG ATGCAGTCGG TGGCGGCAAA AAAACGCGTC
ACCCTTTCCG ACGTGGGCCT GTTCTCGGAC GGCACGGCGG TCAAGCTGGT GGGCGAGGAA
ACCTTCCGCA TCAGCCGCGA ACTGGTCGAT GAATTCATGA CGGTGGACAC CGACGCGGTC
TGCGCCGCCA TCAAGGATAT TTTTGTCGAT ACGCGCAGCA TTGTCGAGCC GGCCGGCGCG
CTGGCCGTGG CCGCCATCAA GCAGTACGTC GCCCGGCACA AGACCAAGGG CGAAACCTAT
GCCGCCATCC TGTGCGGCGC CAACATGAAC TTCGACCGCC TGCGCTTTGT CGCCGAGCGC
GCCGAAGTGG GCGAGGAGCG CGAAGCGCTG TTCGCGGTGA CGATTCCCGA GGAGCGCGGC
AGCTTCCGGC GTTTTTGCGC GCTGATCGAC AAGGCGCCCG GCGGGCCGCG CAGCGTGACC
GAGTTCAACT ACCGCATCAA CGACCAGGCC GTGGCGCATG TGTTTGTCGG CCTGACGACC
TCGGCCAAGG GCGAGAGCAG CAAGATTGCC GCCCAGTTCA CCCGCCACGG CTTCAAGGCG
CTGGACCTGA CGCATGACGA ACTGGCCAAG GAACACATCC GCCACATGGT GGGCGGGCAC
AGCGCGCTGT CCAAAGACGA GCGCCTGCTG CGCTTCATCT TTCCCGAGCG GCCCGGCGCG
CTGATGAAGT TCCTGTCGAG CATGCGGCCG GACTGGAACA TCACCCTGTT CCACTACCGC
AACCAGGGCG CCGACTATGG CCGCATCCTG GTCGGCCTGC AGGTGCCCAA GGCGGACAGC
GGCGCGTTCC AGGAATTCCT GGACACCCTC GGCTACCCCC ATGTGGAAGA AACCGACAAC
CCGGTGTACC GGCTGTTTTT GCAAAGCTGA
 
Protein sequence
MPSSKKSIPG AAAKAAATVP PKASAKPAHK PSAKTLKPAD YLKKILTARV YDVAVESALE 
PAKTLSLRLN NTVLLKREDQ QPVFSFKLRG AYNKMAHLTP AQLKKGVICA SAGNHAQGVA
MSAQKLGTRA VIVMPTTTPQ LKIDAVKGWG GEVVLHGDSY SDAYTHSVAL QKEQGLTFVH
PFDDPDVIAG QGTIAMEILR QLQTLGSRRL DAVFVAIGGG GLISGVANYI KAVRPEIKVI
GVQMNDSDAM MQSVAAKKRV TLSDVGLFSD GTAVKLVGEE TFRISRELVD EFMTVDTDAV
CAAIKDIFVD TRSIVEPAGA LAVAAIKQYV ARHKTKGETY AAILCGANMN FDRLRFVAER
AEVGEEREAL FAVTIPEERG SFRRFCALID KAPGGPRSVT EFNYRINDQA VAHVFVGLTT
SAKGESSKIA AQFTRHGFKA LDLTHDELAK EHIRHMVGGH SALSKDERLL RFIFPERPGA
LMKFLSSMRP DWNITLFHYR NQGADYGRIL VGLQVPKADS GAFQEFLDTL GYPHVEETDN
PVYRLFLQS