Gene Pnap_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4030 
Symbol 
ID4689442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4296168 
End bp4297205 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID639837044 
Productbiotin synthase 
Protein accessionYP_984243 
Protein GI121606914 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.959642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCA TCACCACCAT TCCCCTGTCC ACGCTGCGTT CCTCCCTGCC GGCCCGCCCG 
GATGCCGCCG CCGCGCCGCA GCGCTGGCGC GTCGCCGACA TCGAAGCCCT GTACGCCTTG
CCCTTCATGG ACCTGCTGTT TCGCGCCCAG CAGGTGCACC GCGCGAACTT CGACGCCAAC
CAGGTGCAGC TCTCGACGCT GCTGTCGATC AAGACCGGCG GCTGCGCCGA GGACTGCGGC
TACTGCCCGC AATCGTCCCA TTTCGAAACC GAGGTGAAGG CCAGCAAGCT GATGGCGCTC
GACGAGGTGA TGGCCGCCGC GCAGGCCGCC AAGGACCAGG GCGCGACGCG CTTTTGCATG
GGCGCGGCCT GGAGCCGCCC GAAAGAGCGC GACATGGAGC GCGTCACCGA GATGGTGCGC
GAAGTGCGCG GCCTGGGGCT GGAAACCTGC ATGACGCTGG GCATGCTGGA GGCCGAGCAG
GCGCAGGCCT TGAAAGACGC GGGCCTCGAC TACTACAACC ACAACCTCGA CAGCTCGCCC
GAGTTCTACG GCAGCATCAT CAGCACCCGC ACCTACCAGG ACCGGCTCGA CACGCTGGAG
AATGTGCGCG GCGCGGGCAT CAACGTCTGC TGCGGCGGCA TTGTCGGCAT GGGCGAAAGC
CGTGCGCAGC GCGCCGGGCT GGTCGCGCAG CTGGCCAACC TGGAGCCGTA TCCGGAGTCG
GTGCCGATCA ACAACCTGGT GGCGGTCGAA GGCACGCCGC TGGCCGACAC GCCGCCGCTG
GACCCGTTCG AGTTCGTTCG CACGATTGCC GTGGCGCGCA TCACCATGCC GCGCACCATG
GTCCGGCTGT CGGCCGGGCG CGAGCAGATG GATGAAGCCC TGCAGGCGCT GTGCTTCATG
GCCGGCGCCA ACTCGATCTT CTACGGCGAC CGGCTGCTGA CCACCAGCAA CCCGCAGGCC
GACAAGGACC GCCAGCTGTT CGCGCGCCTG GGCCTGAAGG TGCAGGGCGA GCGCCCCGCC
GCCACGGTGC AAGGCTGA
 
Protein sequence
MTSITTIPLS TLRSSLPARP DAAAAPQRWR VADIEALYAL PFMDLLFRAQ QVHRANFDAN 
QVQLSTLLSI KTGGCAEDCG YCPQSSHFET EVKASKLMAL DEVMAAAQAA KDQGATRFCM
GAAWSRPKER DMERVTEMVR EVRGLGLETC MTLGMLEAEQ AQALKDAGLD YYNHNLDSSP
EFYGSIISTR TYQDRLDTLE NVRGAGINVC CGGIVGMGES RAQRAGLVAQ LANLEPYPES
VPINNLVAVE GTPLADTPPL DPFEFVRTIA VARITMPRTM VRLSAGREQM DEALQALCFM
AGANSIFYGD RLLTTSNPQA DKDRQLFARL GLKVQGERPA ATVQG