Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_2938 |
Symbol | |
ID | 4686376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | - |
Start bp | 3093570 |
End bp | 3094691 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639835945 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_983158 |
Protein GI | 121605829 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.135746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCG TACGCCGCAG CTTCCTCAAG AACACCGCAG CCGCCGCCAT GGCCGCTGCC TTGCCCGCCC TTGGTTTCGC CCAGGCACCC GCGGCACCTG CTTCACGCCG CAACTTCGCT CCCCAAAGCG GCGGCTGGCG CACCTTTGAG GTCACCACCC GCGTGGACAT TCCCAAGCCC GAAGGCGTGA CCCGGGTGTG GCTGCCGATT CCGTCGGTCA ACAGCGACTA CCAGCATTCG CTCGAAAACG GATTTTCAAG CAACGGAACG GCCAAGCTGG TGCAGGACGG CCAGGACGGC GCAAAAATGC TCTACGTTGA ATTTGCTGCC AGCGAAGCCA AGCCGTTTGT CGAAATCACC AGCCGCGTGC AGACGCAGGG CCGCGCGATG GACTGGTCGC AAAAAACCGC CAAGGCCGAG GAAGCCGACA CGCTGCGCTA TTTCACCCGC GCCACCACCT TGATTCCGAC CGACGGCATC GTGCGCAAGA CCGCGCTGGC CGCCACGCAG GGCGCTAGAG GCGATGTCGA AAAAGCCCAG AAGCTCTATG ACTGGATCGT GGCCAACACC TACCGCGAAC CCAAGGTGCG CGGCTGCGGC GAAGGCGACA TCAAGACCAT GCTGGAAACC GGCAACCTGG GCGGCAAATG CGCCGACCTG AACGCGCTGT TTGTCGGCCT GTGCCGCTCG GTGGGTGTGC CCGCGCGCGA TGTGTACGGC ATCCGGCTGG TGCCATCGGC CTTTGGCTAC AAGGAGCTGT CGGGCAACCC GGCCAGCCTC AAGGGCGCGC AGCACTGCCG CTCCGAGGTG TACTTGAAGG GCTATGGCTG GGTGGCGATG GACCCGGCCG ACGTGGCCAA GGTCATGCGC CTGGAAACCG CCGACTGGAT CAAGAACACC ACCAACCCGG TGGTCGCGCC GGTCAACAAG GCGCTGTTCG GCGGCTGGGA AGGCAACTGG ATGGCCTACA ACACCGCGCA CGATGTGGCC TTGCCCAATT CCAAGGGCGA CAAGCTCGGT TTCCTGATGT ACCCAGTTGG CGAGAATGCC GCCGGCCGCT TCGACTCCTA CGCGCCGGAT GACTTCAAGT ACCAGATCAC CGCCAGGGAA ATCAAGGCCT GA
|
Protein sequence | MTTVRRSFLK NTAAAAMAAA LPALGFAQAP AAPASRRNFA PQSGGWRTFE VTTRVDIPKP EGVTRVWLPI PSVNSDYQHS LENGFSSNGT AKLVQDGQDG AKMLYVEFAA SEAKPFVEIT SRVQTQGRAM DWSQKTAKAE EADTLRYFTR ATTLIPTDGI VRKTALAATQ GARGDVEKAQ KLYDWIVANT YREPKVRGCG EGDIKTMLET GNLGGKCADL NALFVGLCRS VGVPARDVYG IRLVPSAFGY KELSGNPASL KGAQHCRSEV YLKGYGWVAM DPADVAKVMR LETADWIKNT TNPVVAPVNK ALFGGWEGNW MAYNTAHDVA LPNSKGDKLG FLMYPVGENA AGRFDSYAPD DFKYQITARE IKA
|
| |