Gene Pnap_0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0434 
Symbol 
ID4687603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp448360 
End bp449985 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content65% 
IMG OID639833431 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_980677 
Protein GI121603348 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAT TGAACGACCC AATCAAGGAC GGCCTGCAGC GCGGCTGGAA AGTCTCGGGC 
GGCGCCCTGG GGCCGATTCC AGAAAAAATC GTCTGCGATG TCGCCATCAT CGGCAGTGGC
GCGGGCGCGG GCATCACGGC CGAACTGCTG GCCAAGGCCG GGCTGCAGGT GGTGGTGATT
GAAGAGGGCC CGCTCAAGAG CAGCAGCGAC TTCAACCAGA AGGAGTCCGA AGCCTATCCT
TCGCTGTACC AGGAAAGCGC GGCGCGCAAG ACCGAAGACA AGGCGATCAA CATCCTGCAG
GGCCGCTGCG TCGGGGGCTC GACGACGGTG AACTGGACCA GCTCCTTTCG AACGCCGCCG
GCCACGCTCC AGTTTTGGCA GGACCAGTTT GGACTGGGCA GCTACACCGC CGAGGCGCTG
GCGCCTTATT TCGCCCAGGC CGAGCGGCGG CTGAACATTG CGCCGTGGCC GGTGGCGCCC
AACGAAAACA ATGAACTGCT GCGCCGTGGC GCCGTCAAGC TCGGGATTCC CGCCGCGTCC
ATCGCGCGCA ACGTCAAGGG CTGCTGGAAT CTGGGCTCGT GCGGACTGGG CTGCCCGACC
AACGCCAAGC AGTCGATGCT GGTCACGACG ATTCCGGCCG CGCTGGACCT GGGCGCGCAA
CTGCTGACCG AAACCCGGGC CGAGCGCTTC GAACTGGCCA ACGGCAGGGT GACGGCGCTG
GTGTGCCGAA ATGTAGAGCC AAATGGGGCT TTTGCGCAAT ATGGACGGGC GCAAACAGCT
ATTAAAATAA TAGCAAAACA TTATGTGCTG GCCGGCGGCG CCATCAACTC GCCCGCCGTG
CTGCTGCGCT CTGGCGCGCC CGACCCGCAT GGCCGGCTGG GAGTGCGGAC CTTTTTGCAC
CCGGTGCTCA TGTCTTCCGG CATCTTTGCG CAACAAGTCG CGGCCTGGAG CGGCGCGCCG
CAGTCGATCT ACAGCGACCA TTTTCTGCAA ACCCAGCCGA TGGACGGCCC CATGGGCTAC
AAGCTGGAGG CGCCGCCGCT GCATCCGCTG ATTTTTGCGT CCACCGTGCC GGGCTTTGGC
GAAGGGCAGC ATGCGCTGCT GAAGGCTTTT GCGCACAACC ACACCTTGCT GGCGCTGCTG
CGCGACGGCT TTCACGATGA AGCGCCCGGT GGCAAGGTCA GGCTGCGCGG TGACGGCTCG
GCCGTGCTCG ATTACCCGCT GAGCGACTAC GTCATGGACG GCGGCCGCCG GGCGCTGCTG
TCGATGATGC AGATCCAGTT CGCGGCCGGG GCGCAGCAGG TGCTGCCGCT GCACGAAATG
GCCGCGCCCT ACAGCTCCTG GGTGCAGGCA CGGGCTGCCG TCATGGCGCT GCCGATGAAG
CCGCGCCTGG TGAAAATCGT CAGCGCGCAT GTGATGGGCG GCTGCGGTCT GGCCGCCACC
GAAGCGCAGG GCGTGACGCG GCCTGACGGC CTGCATTGGC AACTGGACAA TCTCTCGATT
CACGACGGCT CGCTGTTCCC GACCAGCATA GGCGCCAATC CGCAGCTGTC GGTGTATGGC
ATGGTCAACC GCCTGGCGCA GGGACTGGCC AAAACGCTGA CCGGCCGGGA TGTGGCGCTG
GCCTGA
 
Protein sequence
MSRLNDPIKD GLQRGWKVSG GALGPIPEKI VCDVAIIGSG AGAGITAELL AKAGLQVVVI 
EEGPLKSSSD FNQKESEAYP SLYQESAARK TEDKAINILQ GRCVGGSTTV NWTSSFRTPP
ATLQFWQDQF GLGSYTAEAL APYFAQAERR LNIAPWPVAP NENNELLRRG AVKLGIPAAS
IARNVKGCWN LGSCGLGCPT NAKQSMLVTT IPAALDLGAQ LLTETRAERF ELANGRVTAL
VCRNVEPNGA FAQYGRAQTA IKIIAKHYVL AGGAINSPAV LLRSGAPDPH GRLGVRTFLH
PVLMSSGIFA QQVAAWSGAP QSIYSDHFLQ TQPMDGPMGY KLEAPPLHPL IFASTVPGFG
EGQHALLKAF AHNHTLLALL RDGFHDEAPG GKVRLRGDGS AVLDYPLSDY VMDGGRRALL
SMMQIQFAAG AQQVLPLHEM AAPYSSWVQA RAAVMALPMK PRLVKIVSAH VMGGCGLAAT
EAQGVTRPDG LHWQLDNLSI HDGSLFPTSI GANPQLSVYG MVNRLAQGLA KTLTGRDVAL
A