Gene Pnap_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4102 
SymbolmetX 
ID4688687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4387713 
End bp4388822 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content62% 
IMG OID639837114 
Producthomoserine O-acetyltransferase 
Protein accessionYP_984313 
Protein GI121606984 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.23189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTTTG CCGACGTCCT GCCCTTGCGC AGCGGCGCCT CGATTCGCGC CTATGACCTG 
AGCTATGAAA CCTATGGCCA GCTCAACGCC GACAAGTCCA ACGCGGTGCT GATCTGCCAT
GCGCTGAACG CGTCGCACCA TGTGGCCGGC GTTTACCTGG ACGAATCCGG CCAGATTCAG
AAAAAATCCG AAGGCTGGTG GGACACCATG ATCGGCCCGG GCAAGCCGGT CGATACCAAC
CGGTTTTTTG TCATCGGCGT GAACAACCTC GGTTCGTGCT TCGGCTCGAC CGGGCCGATG
CAGACCAACC CCGACACGAA TGAGGTGTAC GGCGCCGATT TCCCGGTCGT CACGGTGCAG
GACTGGGTTG ATGCGCAGGC CAGATTGCTC GATGCGCTGG GCATCCAGAC GCTGGCCGCC
GTCATGGGCG GCAGCCTGGG CGGCATGCAG GCGCTGAGCT GGACGCTGCA ATACCCCGGG
CGTGTCCGCC ATGCGGTGGT GGTGGCCAGC GCGCCCAACC TGACCGCCGA GAACATTGCC
TTCAACGAAG TCGCGCGCCG CGCCATCGTC ACCGACCCCG ACTTTCACGG CGGACATTTT
TACAAACACG GCGTGCTGCC CAAGCGCGGC CTGCGCATTG CCCGCATGAT CGGCCACATC
ACCTACCTGA GCGACGACGT GATGAACGAG AAGTTCGGAC GCCAGCTGCG CGACGCCGCA
GGCATCAAGT TTTCGACGCA GGACGTCGAG TTCCAGATCG AAAGCTACCT GCGCTACCAG
GGCGACAAGT TCGCCGAATA CTTCGACGCC AATACCTATC TCTTGATCAC GCGCGCGCTC
GACTACTTCG ACCCGGCCGG TGAATTCGGC GGCGACCTGA GCCGCGCGCT GGCCCAGGCC
AGCGCCAAGT TCTTGCTGGT CAGCTTCACC ACCGACTGGC GGTTTTCCCC GGCGCGCAGC
CGCGAAATCG TCAAGGCCCT GCTCGACAAC CAGATTGATG TGAGCTACGC CGAAATCGAC
GCGCCCCATG GCCATGATGC ATTTTTGCTC GATGATGCAC GCTACATGGG CGTGGTGCGC
TCCTATTTCG AGAGCAAGGT GAGCGCATGA
 
Protein sequence
MHFADVLPLR SGASIRAYDL SYETYGQLNA DKSNAVLICH ALNASHHVAG VYLDESGQIQ 
KKSEGWWDTM IGPGKPVDTN RFFVIGVNNL GSCFGSTGPM QTNPDTNEVY GADFPVVTVQ
DWVDAQARLL DALGIQTLAA VMGGSLGGMQ ALSWTLQYPG RVRHAVVVAS APNLTAENIA
FNEVARRAIV TDPDFHGGHF YKHGVLPKRG LRIARMIGHI TYLSDDVMNE KFGRQLRDAA
GIKFSTQDVE FQIESYLRYQ GDKFAEYFDA NTYLLITRAL DYFDPAGEFG GDLSRALAQA
SAKFLLVSFT TDWRFSPARS REIVKALLDN QIDVSYAEID APHGHDAFLL DDARYMGVVR
SYFESKVSA