Gene Pnap_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3053 
Symbol 
ID4687294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3226937 
End bp3228868 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content67% 
IMG OID639836066 
Producthypothetical protein 
Protein accessionYP_983273 
Protein GI121605944 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA ACGTATTGCG ATTCACCCAG GACATGGCCG CTGCCAGCCA GGAGGTCGCG 
CGACTGGGCG GCCGAATCGT CCAGCAGTTC AGCCCGACCG TGTTTGTGGC CGAACTGCCC
GATGGCACCG ATGAACGGGC GATGACGTCC TCGACGGATC AGCCGGCCCA GCCGCTGGAC
GCTGTTTCGC AGCTGGCCGC TGATGCCTGG ACCCAGGCCA GAGCAGCCAG GAGCGCCCGT
GCCGGCGCGG CGCGATCACC CACCGAGGGG CTCTCGTGGG ACGCGCCGGG CTACCAGCCT
CCGCGTGAGT TCGACACGCC TGAAGCCGGC GCGCGCGCCG CGCGCGAAGC CGAACTGGTG
GCCGAATCGA CCGGCACGCC GACCAGCCGC TACATGGTCG GTTCGGTGGC TGTCGGGGTC
ATCCTGGTGT CGCGCAATAC CGGGGCGGAA GTCCTGAGCG ATGCCGAGCG GGTCAAGATC
GTGCAGGAGG TTCAGGAAGG GCTGGGCTGG CTCGCCGGCG TCGAACCGCG GGCCAGGGTT
TCATTCGTGT ATGACATCCG CGTTGTCACG GTATCGTCCG CCACCGGGCC GTATGCCGGC
GTCACCGAGC CTTACGAGCG CTACGAGAGG GACTGGCGGG ACGCCGCGCT GGCCAGCATG
GGCTACGGGC CGGGGCGGGC AGGGTACCAG AAGTACGCCA ACGACCTGCG CACCAGCCGG
CATACCGACT GGGCCTACGT GGCCTTTTTC ACCAAGTACC CGCTGAACCA CTTTGCCTAC
GCCATCGTCG AGAAGGTGGT GATGAACTAT GCCAATGATG GCTGGGGGCC TGACAACATC
AACCGGGTGT TCGCCCATGA GTCCTGTCAC ATCTTCGGGG CGGCGGACGA ATACGGCTCG
TGCGCCTGCG GCACTACCAG CGGTCACCTC GCAGTGCCCA ACAGCAACTG CGTCAACTGC
TTCCCGCCAG GCACGCAACA GGCTTGCCTG ATGAACGCCA ACACCCTGTC GATGTGTGAT
TTCAGCCGGC GGCAAATCGG CTGGGACGAG CGGCTGTTCC CCAGGCCCAC GGGCTGGTCG
GGCTGGGTCG CGCTCGGCGC ACCTTCGACC GGATTTGCCG GTGCGCCAGC CGTGATCTCG
CGCAACGGCT CGGTCTGCAA CATCTACGTG CGCGGCGCCG ACAACGCGCT GTGGCAAAAG
GCCTGGTTCA ACAACGCCTG GCATGACTGG GGACGGCACA ACGACGGCGG CGTGCTCGCT
TCCGAGCCCG CGCTGGGCTC CATGGGGCCG GACCACGAGC ATGTCTTCGT GCGCGGCACC
GACAACCAGG TCTGGCAGAA ATTCTGGAAA GCCGCCAGCG GCTGGTCGGG CTGGTTTGCC
CTCGGCGCGC CTCCGGTCGG CTTCACCGGA GCGCCGGCAG TCATCTCGCG CAACGGCTCG
GTCTGCAATA TCTACGTGCG CGGCGCCGAC AACGCGCTGT GGCAAAAGGC CTGGTTCAAC
AACGCCTGGC ATGACTGGGG ACGGCACAAC GACGGCGGCG TGCTCGCTTC CGAGCCCGCG
CTGGGCTCCA TGGGGCCGGA CCACGAGCAT GTCTTCGTGC GCGGCACCGA CAACCAGGTC
TGGCAGAAAT GGTGGACAGG CGCAGGCGGC TGGTCAGGCT GGGTCGCCCT CGGCGCGCCT
GCGGGCGGAT TTGTCGGTGC GCCCTCCGTG ATCTCGCGCA ACGGCTCGGT CTGCAACATC
TACGTGCGCG GCACCGACAA CGCGCTGTGG CAGCGGGCTT ACTGGAATGG CGCCTGGCAC
GACTGGGGTC GGCACAACGA CGGCGGCGTG CTGGCTTCCG AGCCCGCGCT GGGCTCCATG
GGCCCCAACC ACGAGCATGT GTTCGTGCGC GGAACTGACA ACCAGGTCTG GCAGAAGTGG
TGGCAGGGTT GA
 
Protein sequence
MAKNVLRFTQ DMAAASQEVA RLGGRIVQQF SPTVFVAELP DGTDERAMTS STDQPAQPLD 
AVSQLAADAW TQARAARSAR AGAARSPTEG LSWDAPGYQP PREFDTPEAG ARAAREAELV
AESTGTPTSR YMVGSVAVGV ILVSRNTGAE VLSDAERVKI VQEVQEGLGW LAGVEPRARV
SFVYDIRVVT VSSATGPYAG VTEPYERYER DWRDAALASM GYGPGRAGYQ KYANDLRTSR
HTDWAYVAFF TKYPLNHFAY AIVEKVVMNY ANDGWGPDNI NRVFAHESCH IFGAADEYGS
CACGTTSGHL AVPNSNCVNC FPPGTQQACL MNANTLSMCD FSRRQIGWDE RLFPRPTGWS
GWVALGAPST GFAGAPAVIS RNGSVCNIYV RGADNALWQK AWFNNAWHDW GRHNDGGVLA
SEPALGSMGP DHEHVFVRGT DNQVWQKFWK AASGWSGWFA LGAPPVGFTG APAVISRNGS
VCNIYVRGAD NALWQKAWFN NAWHDWGRHN DGGVLASEPA LGSMGPDHEH VFVRGTDNQV
WQKWWTGAGG WSGWVALGAP AGGFVGAPSV ISRNGSVCNI YVRGTDNALW QRAYWNGAWH
DWGRHNDGGV LASEPALGSM GPNHEHVFVR GTDNQVWQKW WQG