Gene Pnap_4924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4924 
Symbol 
ID4685774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008759 
Strand
Start bp111722 
End bp114829 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content60% 
IMG OID639826566 
Producthypothetical protein 
Protein accessionYP_973730 
Protein GI121583294 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAA TAGCCAACGT CCACACCGAA CGCGTCTTCG AAAACGAGCT TTGCGAGCAC 
CTGGCCGCCA ACGGCTGGTC CGTGCGCACC CACCTGCAGG ATGCCAAGTC CTACAGCCGT
GAGCTGGCCC TCTTCCCCGA AGACCTGATT GCCTTCGTCC AGGAGACGCA GCCCAAGGAG
TGGGCCAAGT TCAAGCAGTG GCACAACGGC CAGTCCGAGG CGATGTTCAC CAAGCGTGTG
GCCGACCAGC TCAACAGGCA CGGCACCCTG CACCTGCTGC GCCACGGGTT CAAGGACTCG
GGCAGCAACG GAACCACCAA GTTCTTCCTG TGCCAGTTCC GCCCGGCGCA CAAGAAAAAC
ATGCAGCTGT GGGAGATGTA TGAGAAGAAC CGCCTGACGG CCATCCGCCA GTTGCACTAC
TCGCTGCACA ACGAAAAAAG CTTTGACATG GTGCTCTTCG TCAACGGCTT GCCGGTGGCC
ACCACAGAGC TCAAGACCGA CATGACGCAG TCCATCAAGG ACGCCATCGA CCAGTACAAG
AAGGACCGCC TGCCCCGCGA CCTCAAGTCC AAGGAGCTCG AACCCCTACT GCAGTTCAAG
ACCCGGGCCC TGGTGCACTT TGCGGTGTCG ACCGACGAGG TTTTCATGGC CACCAAGCTC
GAGGGCGACA AAACATATTT CCTGCCGTTC AACCTGGGCC GTCCGGACGG CTTTGGCGGC
GCTGGCGCTG GCAATCCCCC TGCGACCAAG GAACATGGCT ACCCGACTTG GTACCTGTGG
AAGCTGGTCT GGTCGCGCGA TGTCTGGCTC GACATCCTGG GGAACTTCCT GCACATCGAG
GTGAAGGAGG CGCAGGACAA GGCCGGCAAG AAAGTCACCA AGGAATCCTT GATTTTCCCG
CGCTTTCACC AGCTCGACGC CGTGCTGCAC CTGGTTCGCG GCGCCGGCGA CGAAGGGGTT
GGACAGGTCT ACCTGATTCA GCACTCTGCC GGCTCGGGCA AGTCCAACAC CATCGCCTGG
ACGGCGCACC GGCTCGCCAA CCTGCACAGC GCCCAAGATG ACAAGGTATT TGACACGGTC
ATCGTCATCA CGGACCGCCG GGTCCTGGAC CGCCAGCTGC AGGAAACGAT TTCGCAGTTC
GAGCACAAGG CCGGCGCCGT TCAAAAAATT GACCAGAACT CCGAGCAGCT GGCCAAGGCC
CTCAATGACG GCGTGCCGGT CATCATCACC ACGCTGCAAA AATTCCAGTT CATCCTGCAA
AAGGTCCAGG GCCTGAAGGA CAAGAAGTTC GCGCTCATCG TAGACGAGGC CCACTCCAGC
CAATCCGGGT CAGCCGCCCA GAAGCTACGC CGGGCTCTGA CGACGGACAC CAAGAAGGTA
GTGACCGTTG AGCTGGACGG CGCTCCGGCG GACGTCGATG TGGATGTGGA AATCGACCCG
GAAGACGTGA CCTCCGAGGA CATCATCAAC CAGGTGATGC TCTCGCGCCA GCGCCCGCCC
AATGTGAGCT ACTTCGCGTT CACGGCCACC CCCAAGTCCA AAACGCTTGA GCTGTTCGGC
CGGCCGGGCA CCGACGGGCT GCCCGTGCCG TTTCACGTTT ACTCGATGCG CCAGGCCATC
GAGGAAGGCT TCATCCTGGA CGTGCTCAAG CGGTACATGT CGTACAAGAC GTTCTACAAG
CTGGGCTCGA CGGCGGCCGA GAAGATGGTG CCGCAGATGA AGGCCAAGAA GACCCTGGGC
CGTTTTGCCG TCCTGCATGC CTATAACATT GCCCAAAAGA TTGTGGTCAT CGTGGAGCAC
TTTCGCGAGT TCATCGCGCC CAAGCTGGGG GGCCACGCCA AGGCCATGGT GGTGACTGAC
AGCCGGTTAG CCGCGGTGCG CTACAAGCTG GCCATGGACA AGTACCTCAA GGAGATGGGC
TACGACAAGG AGATGAAAGC GCTGGTGGCT TTTTCCGGAG AGGTGCAGGA CCCCGAGTCG
GGGCCGGATG ATTTCAGCGA ACGCAGTATG AATGCCGGCA TCAAGGGCCA GGAGCCATCC
GACGCCTTCA AGGAAGAAGT TTACCGCGTG CTGCTGGTGG CCAACAAATA CCAGACCGGG
TTCGACCAGC CGCTGCTGCA GGCGATGTAT GTCGACAAGC GCTTGTCTGG CGTGATGGCG
GTGCAGACGC TGTCCCGGCT CAACCGCATG GCGCCAGGCA AGGAAGACCC GTTCGTGCTG
GACTTCGTGA ACAAGCCTGA AGAAATTCTG GCCAGCTTCA AGCCTTACTT TCGCACGGCC
GAGGTGGAGG CCGTCACGGA CCCCAACATC GTTCACGAGC TCCAGGTTAA GTTGGACAAG
GCGAAGGTGT ACGTCCCCAA CGAGATTGAG CAATACGCCA AAGCCTTTTT CGACCCCAAG
TGCAAGCAGG CGAGCCTGAT GTATATCCTG AAACCTGCCA AGGACCGCTT CTACGACCTC
GAGGACGAAG ACGCCGAGCA GTTTCGCAAG GACCTGGGCA CCTTCTTGCG GCTGTACGAC
TTCCTGTCGC AAATCATTCC CTATGGGGAT GCGGACCTGG AAAAGCTCTA CAGCTTTGGC
AAAGGCTTGA TGCCCATGGT GGCGGCCAGG AATTCAGGCT CCAGCATCCT GGAGCTGGAC
TCGGACGTGC AGCTCACGCA CTATCGCATC CAGAAGCTGG GCGAGCAAAC GCTCAACCTC
GCCACGGGCG AGCTGGTCAA GCTCAAGCCG GCTTCCGACG CCGGCAGCGG CACCACCCAG
ACCGACGAAG AAAAGAAGCT GGCCGAAATC GTCGACAAGA TGAACGATTT GTTCTCCGGC
GAGCTGACCG AGGCCGACCT GGTGGGCTAC GTCACGACCA TCAAGGGCAA GCTGCTCGAG
AGCGAAACGC TGGCCGAGCA GGCCGTGAGC AACACCGAGC AGCAGTTCGG CATGGGCGAC
TTCAAGGACA TCATGATGGA CATCATCATT GATGGCCAGG AGAGCCACAA CAAGATTGCC
GGGCAGTTGC TCCAGGACGA CCGGACTTTT GCCATCATGC AGGGAATGCT GGCAAAAATG
GTGTTTGATG GTTTTAAGCG GGCAGCCGCG CAGGCTCAGG CCTCTTGA
 
Protein sequence
MTTIANVHTE RVFENELCEH LAANGWSVRT HLQDAKSYSR ELALFPEDLI AFVQETQPKE 
WAKFKQWHNG QSEAMFTKRV ADQLNRHGTL HLLRHGFKDS GSNGTTKFFL CQFRPAHKKN
MQLWEMYEKN RLTAIRQLHY SLHNEKSFDM VLFVNGLPVA TTELKTDMTQ SIKDAIDQYK
KDRLPRDLKS KELEPLLQFK TRALVHFAVS TDEVFMATKL EGDKTYFLPF NLGRPDGFGG
AGAGNPPATK EHGYPTWYLW KLVWSRDVWL DILGNFLHIE VKEAQDKAGK KVTKESLIFP
RFHQLDAVLH LVRGAGDEGV GQVYLIQHSA GSGKSNTIAW TAHRLANLHS AQDDKVFDTV
IVITDRRVLD RQLQETISQF EHKAGAVQKI DQNSEQLAKA LNDGVPVIIT TLQKFQFILQ
KVQGLKDKKF ALIVDEAHSS QSGSAAQKLR RALTTDTKKV VTVELDGAPA DVDVDVEIDP
EDVTSEDIIN QVMLSRQRPP NVSYFAFTAT PKSKTLELFG RPGTDGLPVP FHVYSMRQAI
EEGFILDVLK RYMSYKTFYK LGSTAAEKMV PQMKAKKTLG RFAVLHAYNI AQKIVVIVEH
FREFIAPKLG GHAKAMVVTD SRLAAVRYKL AMDKYLKEMG YDKEMKALVA FSGEVQDPES
GPDDFSERSM NAGIKGQEPS DAFKEEVYRV LLVANKYQTG FDQPLLQAMY VDKRLSGVMA
VQTLSRLNRM APGKEDPFVL DFVNKPEEIL ASFKPYFRTA EVEAVTDPNI VHELQVKLDK
AKVYVPNEIE QYAKAFFDPK CKQASLMYIL KPAKDRFYDL EDEDAEQFRK DLGTFLRLYD
FLSQIIPYGD ADLEKLYSFG KGLMPMVAAR NSGSSILELD SDVQLTHYRI QKLGEQTLNL
ATGELVKLKP ASDAGSGTTQ TDEEKKLAEI VDKMNDLFSG ELTEADLVGY VTTIKGKLLE
SETLAEQAVS NTEQQFGMGD FKDIMMDIII DGQESHNKIA GQLLQDDRTF AIMQGMLAKM
VFDGFKRAAA QAQAS