Gene Pnap_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0020 
Symbol 
ID4687095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp18854 
End bp19921 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content70% 
IMG OID639833014 
Productglycine oxidase ThiO 
Protein accessionYP_980267 
Protein GI121602938 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.504997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCAC CCATGAACTC CCTTCACATC GGCATTGCCG GCGCCGGCCT GGCCGGCCGC 
ACGCTGGCCT GGCGGCTGCT GCGCGCGGGC TGCCGCGTCA CTTTGTTCGA TTCGCGCCAG
CGCGCCGAAC TGGACACCGC TTCGATGACC GCAGCGGCCA TGCTGTCGCC GCTGGCCGAA
CTGTCGGTAT CAGACGAAGT GGTGTTTCAG TTGGGCCGGC GCTCGATGGA GTTATGGCCG
CGCTGGGTCG CCGAACTGGC CGAGGGCGGC GGCGAGCCAG TGTATTTCCG CCAGAAAGGC
ACGCTGGTCG TGGCGCACGC GCCTGACCAG AGTTCGCTCG ACCACTTCAG CGGCCTGTTG
CACCACCGGC TGCCCGAGGC CTGCCGCGCC GAGGTGCACA CGCTGGACGC GGCCGCGCTG
GCGCAGCGCG AGCCGGCGCT GGCCGGGCGC TTTGGCGGCG GCCTGTTCCT GGAGAGCGAA
GGCCAGCTGG CCAATGACCA GTGGATGGCC GTGCTGGGCC GGGAGATCGA CCGGCTCGGC
GTGACCTGGC ATGAAGGCCA GGCGGTTGAC CGGGTGGAAG AGGGGCGCAT CATTTGCGCC
AGCGGCGAAT ACGCAGTGGA TGTGGCGGTC GATGCGCGCG GCGTGGGCAG CAAGGCGCAG
TGGCCGCAGC TACGCGGCGT GCGCGGCGAG GTGCTGCGCG TCGAATGCCA CGGCGTGACT
TTGCAGCGCC CGGTGCGGCT GATGCATCCG CGCTACGCGC TCTACGTCGC GCCCCGGCCC
GACCACCAGT TCGTCGTCGG CGCGACCGAA CTCGAATCGG AAGACACCGG CCCGGTCACG
CTGCGTTCAA CGCTGGAGCT GGGCAGCGCG CTGTACAGCC TGCACCCGGC CTTTGGCGAG
GCGCGCGTGC TGCGGCTGTC GGCCGCGCTG CGTCCGGCGC TGGACGACCA CCGGCCGGCC
GTGGCGCTGC GCGATGGCGT GTGGCACATC AACGGCCTGT ACCGGCATGG CTATTTGTGC
GCGCCGGCGG TGGTCGATGA ACTGGCCCAT AAACTGTTGG CAACATGA
 
Protein sequence
MHSPMNSLHI GIAGAGLAGR TLAWRLLRAG CRVTLFDSRQ RAELDTASMT AAAMLSPLAE 
LSVSDEVVFQ LGRRSMELWP RWVAELAEGG GEPVYFRQKG TLVVAHAPDQ SSLDHFSGLL
HHRLPEACRA EVHTLDAAAL AQREPALAGR FGGGLFLESE GQLANDQWMA VLGREIDRLG
VTWHEGQAVD RVEEGRIICA SGEYAVDVAV DARGVGSKAQ WPQLRGVRGE VLRVECHGVT
LQRPVRLMHP RYALYVAPRP DHQFVVGATE LESEDTGPVT LRSTLELGSA LYSLHPAFGE
ARVLRLSAAL RPALDDHRPA VALRDGVWHI NGLYRHGYLC APAVVDELAH KLLAT