Gene Pnap_2938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_2938 
Symbol 
ID4686376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3093570 
End bp3094691 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content64% 
IMG OID639835945 
Producttransglutaminase domain-containing protein 
Protein accessionYP_983158 
Protein GI121605829 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.135746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG TACGCCGCAG CTTCCTCAAG AACACCGCAG CCGCCGCCAT GGCCGCTGCC 
TTGCCCGCCC TTGGTTTCGC CCAGGCACCC GCGGCACCTG CTTCACGCCG CAACTTCGCT
CCCCAAAGCG GCGGCTGGCG CACCTTTGAG GTCACCACCC GCGTGGACAT TCCCAAGCCC
GAAGGCGTGA CCCGGGTGTG GCTGCCGATT CCGTCGGTCA ACAGCGACTA CCAGCATTCG
CTCGAAAACG GATTTTCAAG CAACGGAACG GCCAAGCTGG TGCAGGACGG CCAGGACGGC
GCAAAAATGC TCTACGTTGA ATTTGCTGCC AGCGAAGCCA AGCCGTTTGT CGAAATCACC
AGCCGCGTGC AGACGCAGGG CCGCGCGATG GACTGGTCGC AAAAAACCGC CAAGGCCGAG
GAAGCCGACA CGCTGCGCTA TTTCACCCGC GCCACCACCT TGATTCCGAC CGACGGCATC
GTGCGCAAGA CCGCGCTGGC CGCCACGCAG GGCGCTAGAG GCGATGTCGA AAAAGCCCAG
AAGCTCTATG ACTGGATCGT GGCCAACACC TACCGCGAAC CCAAGGTGCG CGGCTGCGGC
GAAGGCGACA TCAAGACCAT GCTGGAAACC GGCAACCTGG GCGGCAAATG CGCCGACCTG
AACGCGCTGT TTGTCGGCCT GTGCCGCTCG GTGGGTGTGC CCGCGCGCGA TGTGTACGGC
ATCCGGCTGG TGCCATCGGC CTTTGGCTAC AAGGAGCTGT CGGGCAACCC GGCCAGCCTC
AAGGGCGCGC AGCACTGCCG CTCCGAGGTG TACTTGAAGG GCTATGGCTG GGTGGCGATG
GACCCGGCCG ACGTGGCCAA GGTCATGCGC CTGGAAACCG CCGACTGGAT CAAGAACACC
ACCAACCCGG TGGTCGCGCC GGTCAACAAG GCGCTGTTCG GCGGCTGGGA AGGCAACTGG
ATGGCCTACA ACACCGCGCA CGATGTGGCC TTGCCCAATT CCAAGGGCGA CAAGCTCGGT
TTCCTGATGT ACCCAGTTGG CGAGAATGCC GCCGGCCGCT TCGACTCCTA CGCGCCGGAT
GACTTCAAGT ACCAGATCAC CGCCAGGGAA ATCAAGGCCT GA
 
Protein sequence
MTTVRRSFLK NTAAAAMAAA LPALGFAQAP AAPASRRNFA PQSGGWRTFE VTTRVDIPKP 
EGVTRVWLPI PSVNSDYQHS LENGFSSNGT AKLVQDGQDG AKMLYVEFAA SEAKPFVEIT
SRVQTQGRAM DWSQKTAKAE EADTLRYFTR ATTLIPTDGI VRKTALAATQ GARGDVEKAQ
KLYDWIVANT YREPKVRGCG EGDIKTMLET GNLGGKCADL NALFVGLCRS VGVPARDVYG
IRLVPSAFGY KELSGNPASL KGAQHCRSEV YLKGYGWVAM DPADVAKVMR LETADWIKNT
TNPVVAPVNK ALFGGWEGNW MAYNTAHDVA LPNSKGDKLG FLMYPVGENA AGRFDSYAPD
DFKYQITARE IKA