Gene Pnap_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3801 
Symbol 
ID4686744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4053155 
End bp4054690 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content64% 
IMG OID639836819 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_984018 
Protein GI121606689 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.228893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.75946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCAT CGCCCGCCGA AATCGCTCCC CAGATCACCA TTGGCCACTA CCTGAACGGC 
AGCGCTGTCA CGCCGGCCGC CGGGCGCAGC CAGGAGGTGT TCAATCCGGC CACCGGCGCG
GTCTCTGGCC ATGTGGCGCT GGGCAGCGCA GCCGATGTGG ATGTCGCCGT TGCCAGCGCG
CAGGCCGCCT TTCCGGCCTG GTCCGACATC CCGCCGATTC GCCGTGCCCG CGTCATGTTC
AAGTTCCTGG AACTGCTCAA CCAGAACAAG GACAAGCTCG CCCACCTGAT CACCGCCGAG
CACGGCAAGG TGTTCACCGA CGCGCAGGGC GAAGTCTCGC GTGGCATCGA CATCGTCGAA
TTCGCCTGCG GCATTCCGCA ACTGCTCAAG GGCGATTTCA CCGACCAGGT GTCAACCGGC
ATCGACAACT GGACGCTGCG CCAGCCGCTG GGCGTCGTGG CCGGCATCAC GCCGTTCAAC
TTTCCGGTCA TGGTGCCGAT GTGGATGTTC CCGGTGGCGA TTGCTGCGGG CAACTGCTTT
GTCCTCAAGC CCAGCCCGAT TGACCCGAGC GCCAGCCTCT TCATGGCTGA CTTGCTCAAA
CAGGCCGGTT TGCCCGACGG CGTGTTCAAC GTCGTGCAGG GCGACAAAGA AGCGGTCGAT
GCGCTGCTGG TGCATCCCGA TGTCAAGGCC GTGTCCTTCG TCGGCTCGAC ACCGATTGCC
AACTACATCT ATGAAACCGG CGCCCGCCAC GGCAAGCGCG TGCAGGCGCT GGGCGGCGCC
AAGAACCACA TGGTGGTCAT GCCCGACGCC GACCTGGAGC AGGCGGTCGA TGCGCTGATC
GGCGCGGGCT ACGGCTCGGC CGGCGAGCGC TGCATGGCGA TTTCGGTGGC GGTGCTGGTC
GGCGACGTGG CCGAAAAGCT CATTCCGATG CTCAAGGCCC GGGCCGAAAC GCTGGTGATC
AAGAACGGCG TGAACCTGGA TGCGGAAATG GGTCCGATCG TCACCGGCAT TGCGCACCAG
CGCATCACCG GCTACATCGA CCTGGGCGTT GAAGAGGGCG CCGAGCTGCT GGTCGATGGC
CGCACCTTCA CTGGCGCGCA AGCGGGCGAG GGCTGCGCGC AGGGTTTCTG GATGGGTGGC
ACGCTGTTTG ACCATGTCAC GCCCGACATG CGCATCTACA AGGAAGAAAT CTTCGGCCCG
GTGCTGGGCT GCGTGCGGGT GAACGATTTG GCCGAAGCGG TCGAGTTGAT CAACGCCCAC
GAGTTTGGCA ACGGCGTGAG CTGCTTCACC CGCGACGGCC ATGTGGCACG CGAATTCAGC
CGCCGCATCC AGGTCGGCAT GGTCGGCATC AATGTGCCGA TTCCGGTGCC CATGGCCTGG
CACGGCTTTG GCGGCTGGAA GAAGAGCCTG TTCGGCGACA TGCACGCCTA CGGCGAGGAA
GGCGTGCGCT TCTACACGAA GCAAAAATCC ATCATGCAGC GCTGGCCAGA AAGCATTGGC
AAAGGCGCCG AGTTCGTGAT GCCGACAGCG AAATAA
 
Protein sequence
MFSSPAEIAP QITIGHYLNG SAVTPAAGRS QEVFNPATGA VSGHVALGSA ADVDVAVASA 
QAAFPAWSDI PPIRRARVMF KFLELLNQNK DKLAHLITAE HGKVFTDAQG EVSRGIDIVE
FACGIPQLLK GDFTDQVSTG IDNWTLRQPL GVVAGITPFN FPVMVPMWMF PVAIAAGNCF
VLKPSPIDPS ASLFMADLLK QAGLPDGVFN VVQGDKEAVD ALLVHPDVKA VSFVGSTPIA
NYIYETGARH GKRVQALGGA KNHMVVMPDA DLEQAVDALI GAGYGSAGER CMAISVAVLV
GDVAEKLIPM LKARAETLVI KNGVNLDAEM GPIVTGIAHQ RITGYIDLGV EEGAELLVDG
RTFTGAQAGE GCAQGFWMGG TLFDHVTPDM RIYKEEIFGP VLGCVRVNDL AEAVELINAH
EFGNGVSCFT RDGHVAREFS RRIQVGMVGI NVPIPVPMAW HGFGGWKKSL FGDMHAYGEE
GVRFYTKQKS IMQRWPESIG KGAEFVMPTA K