Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_3735 |
Symbol | |
ID | 4686771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | + |
Start bp | 3977368 |
End bp | 3978606 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639836753 |
Product | hypothetical protein |
Protein accession | YP_983952 |
Protein GI | 121606623 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.756384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0338258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCC ACACACCTAC TTACACTCCC CTCCATGGCC ATCGCACAGG CTCCCGCCTG GCCACCACGA TGCGCTGCAC CACCTTGCTC GTTGCGATGG CAGGCGCCTT CACGGCCCAT GGCCGCACCC TGACCGTCAT TGGCGACACG CAGCCCGCAG GCAGCACCTG GCAAGACAGT TCCGCCACGG AATCCAGCTC CACCAGGACG TGGCAACGGA GAAAACACAC AAACACACAA ACGCCAACAA CGCCGACTCC AACGCCAACG CCTACGCCTA CGCCTACGCC TACGCCTACG CCTACGCCTA CGCCTACGCC TACGCCTACG CCTACGCCTA CGCCCACCCC AACGCCCCCA GCAACGACTT CCAGCCGAGG CTTCCCCACC GCCGGTCCGT GGGCGTCCTT CTATGGGTCG GCGGACAGCA TCGATCTGCC CAAATTGGCG GCGACGTACC GCATCCTGGA CATTGACGCC GACCCGGACA TGGGCAACTT CAGCGTCAGC CAGATCAAGA CGCTCAAGAA CGGCGGCGCC AACAAGGTGT TGAGCTACCT GAACCTGGGC TCGTGCGAAA ACTTCCGTGG CTACTGGTCA AAGGTGCCGT CGGGATTCCT CTCATGCTCG GCCAACAAGG CGGCGCAATT GGGTACCTAC TCGGGTTACA GCAACGAGGT CTGGATGAAT GTCGGCAATG CCGCTTACCA AAACCTGGTC ATCAACTACA TCGTGCCCCG GCTCGCCGCC CAGGGCGTGG ACGGTTTTTA CTTCGACAAC ATGGAAATCG TCGAGCACGG AACGAACACC AAGAACGGCC CCTGCGACGC CCAGTGCAGC CAGGGCGGCC TCGACCTGAT TGCCAAGCTG CGCGACAAGT ACCCTTCGAT GCTCTTCGTG CTGCAGAACG CGACCAGCGA CAAGACGCGC CTCGGCCGGG CAACGGGCGC ATCCGGCACA GTCGCCTTCC CGAGCCTGCT CGACGGCATC GCGCACGAAG AGGTGTACAA GCCCGTCCAT GACACGTCCG TCGAAGCCGA ACTGGTCAGC TGGTCGGGCA TGAACCTGAT GCCGGGCGGC CGCAAGTTCT GGATCGGAAC GCTGGACTAC GCCAGCAGCT GCACCAACAC CAGCGCAGCC CAGTCGGCCT TCCAGTCCAG CCGTGCGCGC GGCTTCTCGC CTTCAGTCTC CGACTCCAGC GCGGGACAGC AGACCGTGTG TTACTGGCCT GCTTTTTAA
|
Protein sequence | MSTHTPTYTP LHGHRTGSRL ATTMRCTTLL VAMAGAFTAH GRTLTVIGDT QPAGSTWQDS SATESSSTRT WQRRKHTNTQ TPTTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPP ATTSSRGFPT AGPWASFYGS ADSIDLPKLA ATYRILDIDA DPDMGNFSVS QIKTLKNGGA NKVLSYLNLG SCENFRGYWS KVPSGFLSCS ANKAAQLGTY SGYSNEVWMN VGNAAYQNLV INYIVPRLAA QGVDGFYFDN MEIVEHGTNT KNGPCDAQCS QGGLDLIAKL RDKYPSMLFV LQNATSDKTR LGRATGASGT VAFPSLLDGI AHEEVYKPVH DTSVEAELVS WSGMNLMPGG RKFWIGTLDY ASSCTNTSAA QSAFQSSRAR GFSPSVSDSS AGQQTVCYWP AF
|
| |