Gene Pnap_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0019 
Symbol 
ID4687094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp16663 
End bp18579 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content64% 
IMG OID639833013 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_980266 
Protein GI121602937 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.691133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.432221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAA CCAATCTTTC CGTCGATGCA GCGGACCTGA CCAGCCGCAT CACCCGCACG 
CCTTTTCCGG GTTCGCGCAA GATCTACATC GAAGGCTCGC GCCCCGACAT CCGCGTGCCG
TTTCGCGAAG TCACGCTCAC CGACACGCTG GTGGCCGAAG GCAGTGAAAC CCGCCGCGAG
GCCAATCCGC CGCTGCGCCT GTTCGACTCG TCGGGCGTGT ACACCGACCC GGCAGCAAGC
ATCGACATCA CGCGCGGCCT GTCTCCGCTG CGCGGCGCCT GGATCAACGA GCGCCAGGAC
ACCGAAGCCC TGCCCGGCAT CAGCAGCGCC TACGGCCGCG AGCGCCTGAA CGACCCGGCC
CTCAGCGCGC TGCGCATGGC CCACGCGCCC GTGCCGCGCC GCGCCAAGGC GGGCGCCAAC
GTGTCGCAGA TGCATTACGC GCGCCAGGGC ATCATCACGC CGGAGATGGA ATACATCGCG
ATCCGCGAAA ACCTGGTGCG CGCCCAGCTT GCCGAACGCC TGGCGACCGA GCGCGTGCCG
AAAACCGGCC ATTCGTTCGG CGCGTCGATT CCGAAAGACA TCACCGCCGA ATTCGTTCGC
GACGAAGTGG CGCGCGGCCG CGCCGTGATT CCGAACAACA TCAACCACCC CGAAACCGAG
CCGATGATCA TCGGCCGCAA CTTCCTGATC AAGGTCAACG CCAACATCGG CAACTCGGCC
GTCACCTCGT CGATTGAAGA GGAGGTGGAC AAGCTGGCCT GGTCGATCCG CTGGGGGGCC
GACACCGTGA TGGACCTCTC GACCGGCGAG AACATCCACG AAACCCGCGA ATGGATTCTG
CGCAATTCGC CGGTGCCGAT TGGCACGGTG CCGATTTACC AGGCGCTGGA AAAAGTCAAC
GGCAAGGCCG AAGACCTGAC CTGGGAAATC TTCCGCGACA CGTTGATCGA GCAGGCCGAG
CAGGGCGTGG ACTATTTCAC CATCCACGCC GGCGTGCGCC TGGCCTATGT GCCGCTGACC
GCGAACCGCC TGACCGGCAT CGTCTCGCGC GGCGGCTCGA TCATGGCGAA ATGGTGTTTG
TCGCACCACA AGGAAAGCTT TTTGTACGAG CATTTCGAGG AGATTTGCGA AATCATGAAG
GCCTACGACG TCTGCTTCTC GCTCGGCGAC GGCCTGCGCC CCGGCTCGAT TGCCGACGCC
AATGACGAAG CGCAGTTCGC CGAACTGCAC ACGCTGGGCG AACTCACGCA GATCGCCTGG
AAGCACGACG TTCAGGTGAT GATCGAAGGC CCCGGCCATG TGCCGCTGCA GCTGGTCAAG
GAAAACGTCG AGAAGCAACT CGAAGCCTGC TTTGAAGCGC CGTTCTACAC GCTTGGCCCC
TTGATCACCG ACATCTCGCC CGGCTACGAC CATATTTCGT CGGCGATGGG CGCGGCGAAT
ATCGGCTGGT ACGGCACGGC CATGCTGTGC TACGTGACGC CCAAGGAGCA TCTGGGCCTG
CCGAACCGCG ACGACGTGAA GCAGGGCCTG ATCGCCTACA AGATCGCCGC GCATGCGGGC
GACCTGGCCA AGGGCTACCC GGGCGCGCAG ATGTGGGACA ACGCGGTCAG CAAGGCGCGC
TTCGAGTTCC GCTGGGAAGA CCAGTTCCGC CTGGCGATTG ACCCCGACAC GGCGATGGCC
TACCACGATG AAACGCTGCC CAAGGAAAAC GCCAAGGTGG CGCATTTCTG CTCGATGTGC
GGGCCGAAGT TCTGCTCGAT GAAGATTTCG CAGGAAGTGC GCGAGTTTGC GCGGCTGAAT
CCGTCCACCA CGACGCTGGC CAAGGCGCCG GGCGTGATTC CGATCCAGCA GGTCAGCAGC
GGCTTCGAGG AAAAAGCCGA GGAGTTCCGC AAGGGCGGGA ACGAGATTTA CTCCTGA
 
Protein sequence
MAKTNLSVDA ADLTSRITRT PFPGSRKIYI EGSRPDIRVP FREVTLTDTL VAEGSETRRE 
ANPPLRLFDS SGVYTDPAAS IDITRGLSPL RGAWINERQD TEALPGISSA YGRERLNDPA
LSALRMAHAP VPRRAKAGAN VSQMHYARQG IITPEMEYIA IRENLVRAQL AERLATERVP
KTGHSFGASI PKDITAEFVR DEVARGRAVI PNNINHPETE PMIIGRNFLI KVNANIGNSA
VTSSIEEEVD KLAWSIRWGA DTVMDLSTGE NIHETREWIL RNSPVPIGTV PIYQALEKVN
GKAEDLTWEI FRDTLIEQAE QGVDYFTIHA GVRLAYVPLT ANRLTGIVSR GGSIMAKWCL
SHHKESFLYE HFEEICEIMK AYDVCFSLGD GLRPGSIADA NDEAQFAELH TLGELTQIAW
KHDVQVMIEG PGHVPLQLVK ENVEKQLEAC FEAPFYTLGP LITDISPGYD HISSAMGAAN
IGWYGTAMLC YVTPKEHLGL PNRDDVKQGL IAYKIAAHAG DLAKGYPGAQ MWDNAVSKAR
FEFRWEDQFR LAIDPDTAMA YHDETLPKEN AKVAHFCSMC GPKFCSMKIS QEVREFARLN
PSTTTLAKAP GVIPIQQVSS GFEEKAEEFR KGGNEIYS