Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnuc_0801 |
Symbol | |
ID | 5052203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 |
Kingdom | Bacteria |
Replicon accession | NC_009379 |
Strand | + |
Start bp | 795830 |
End bp | 797185 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640470958 |
Product | oxidoreductase, molybdopterin binding |
Protein accession | YP_001155583 |
Protein GI | 145588986 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.325021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGATC CAAAACGAAG AGGTCAAATT CAGAAGGCCC CTGAAAACTT TTTATCTCAA GATCTTATTG CTGATTTAAA TAAAAACGGC ATGGACAATA CACGTCGCGG CTTTATGCGA AATGGTTTCA TGGCCGCTGT AGGTGGTGCG GTAGGTTTGG GTACATCCAT GAATGCTCTT GCATCATCAG AAGGTGATCC AGCAATTCTC GAGAAGCAAG TTTGGCAGAC TACTTTAGGT AAAAACGTTG CCACAATGCC TTATGGCACA CCATCTATTT ACGAGGCGAA CTTGATTCGT CGCGAATCGC CAGGATTGAC AAGGGTATCT GCGGCATCAG TTGCATTCAC GCCACTACAA GGTTTGTTTG GAATCATTAC GCCAAATGGT TTACATTTTG AAAGGCAGCA TCAGGGTTGG TACAACATTG ATCCTGAGAA GCATCGTTTA ATGGTAAATG GTTTAGTTAA GAACAATCGT GTGTTTACGA TGAATGATTT GATGCGCTTA CCATCAGTTT CGCGCATTCA TTTTATTGAA TGTGGAGCAA ATACCGGCTT GGAGTGGGGT AATGTTGCAG TACCAACTGT GCAGTACACC CATGGAATGC TCTCTTGCTG TGAATTTACC GGCGTTCCTC TCTCAGTGCT TTTGGAGGAG TGTGGTGCTG ACCTCAAAAA AGGAAAATAT TTGTTGGCGG AAGGTGGAGA CGGTTCGGGA ATGACCCGAA CCATTAATTT AGAAGATGTC TTAAATGACG CTATTGTTGC GTGGGGCATG AACGGAGAAA TGCTCCGTCC CGAAAACGGC TTTCCTTTGC GCTTAGTCGT TCCTGGAGTT CAGGGTGTTA GTTGGGTTAA GTGGTTGCGT CGCTTAGAGG TTGGCGATAT GCCCTATGCA ACCAAGGACG AGGCTGTTCA TTACAACGAT TTAATGCCCG ATGGGCTGGC ACGTCAATAC ACCTCGATAC AGGAATGCAA GTCAGTGATC ACAACGCCTT CAGGTGGCCA GCAATTGTTG GACAAAGGCT TTTATAACGT CAGCGGATTG GCGTGGTCCG GAAGAGGAAA GATCAAGCGA GTAGATGTTT CTTTCGATGG TGGAAATAAT TGGAAAACTG CTCGCTTAGA AACCCCAGTT CTCACAAAGG CGCTCACTCG TTTCAATATT AATTGGGTGT GGGATGGTGC ACCAGCTATT TTGCAGTCAC GAGCCATTGA TGACACGGGT TACGTGCAAC CGTCAATCAA GCTCTTAAGA GATGTGCGTG GTACGCGATC TATTTATCAC AACAACGCTA TTCAATCTTG GAAAGTTGGT ACAAACGGGG AGGTTAGCAA TGTTCAAGTT GGCTAA
|
Protein sequence | MSDPKRRGQI QKAPENFLSQ DLIADLNKNG MDNTRRGFMR NGFMAAVGGA VGLGTSMNAL ASSEGDPAIL EKQVWQTTLG KNVATMPYGT PSIYEANLIR RESPGLTRVS AASVAFTPLQ GLFGIITPNG LHFERQHQGW YNIDPEKHRL MVNGLVKNNR VFTMNDLMRL PSVSRIHFIE CGANTGLEWG NVAVPTVQYT HGMLSCCEFT GVPLSVLLEE CGADLKKGKY LLAEGGDGSG MTRTINLEDV LNDAIVAWGM NGEMLRPENG FPLRLVVPGV QGVSWVKWLR RLEVGDMPYA TKDEAVHYND LMPDGLARQY TSIQECKSVI TTPSGGQQLL DKGFYNVSGL AWSGRGKIKR VDVSFDGGNN WKTARLETPV LTKALTRFNI NWVWDGAPAI LQSRAIDDTG YVQPSIKLLR DVRGTRSIYH NNAIQSWKVG TNGEVSNVQV G
|
| |