Gene Pden_5038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_5038 
Symbol 
ID4583599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008688 
Strand
Start bp550003 
End bp551244 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content70% 
IMG OID639772341 
Productcapsule polysaccharide export protein-like 
Protein accessionYP_918794 
Protein GI119387760 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAC CCGCCGCGAA GCCGGCCCCG AAACCTGCCG CGAAGCCCGC CGCAAAGCCT 
CTGGCCCGGG TCCGGCCCCA GCAGCCGGCC GCCGCCCCCG ACCAGCCGGT GCGCCCGACC
GTCGGCATCG CCGAGCCGCG CCGCCGGCAC TGGCTGATCC TGGGCTCGTT CCTGGCGCTG
GTGCTGCTGC CCAGCCTGCT CTGGGCCGCC TATCTCTGGC TGCGCGCCGA GGACCAGTAT
GTCTCGACCG TCGGCTTCTC GGTGCGCAAG GAACAGTCCA CCCCCTCGAT CGACCTGCTG
GGCGGGCTGG CGCCGCTGGC CGGCGGCGCC AGCGCCTCGG ACACCGACAT CCTCTACGAA
TACATCCGCA GCCAGGACAT GGTCGAAAGG ATCGACGCCC GGCTCGACCT GCGCGCGCGC
TTCTCGGGGC CTTGGCCGCA TGACTTCGTC TTCGCCTTCG ACCCCGAGGG CCATGTCGAG
GACCTGACCG ACTACTGGCA GCGCCAGGTC AAGGTGCTCT ACGACAGCAC CACCCAGCTG
ATCACGCTGA AGATCAGCGC CTTCACCGCC CGGGACGCGC AGGAGATCGC CGCGGCGGTG
TTCCAGGAAA GCTCGGACAA GATCAACCAG CTCTCGACCA TCGCCCAGGA CGACGCCACC
CGGCTGGCCA AGGCCGAGCT GGACAAGGCC CGCGCCGAGC TGACCGAGAC CCGCCAGGCG
ATGACCGCCT TCCGCATGCG CTCGCAGATC GTCGATCCCG AGGCGGACCT GGCCGGGCAG
ATGGGGGTGC TGAGCGGGCT GCAGGCGCAG CTGGCCGAGG TGCTGGTCGC CCACGACCTG
CTTCTGGACA ACGCCCAGCC CACCGACCAC CGCGTCACCC AGTCGCAGCA GAAGATCGAC
GCCCTGCGCC GGCTGATCGA CGCCGAGCGC GCCAAGTTCG GCGCCGAGGG CAAGGGACCG
GCGGGCGAGA GCTATGCCCA GCTGATGGCC GAATACGAAA AGCTGGCCGT GGACCGCGAA
TTCGCCGAGG GCGCCTATCG CTCGGCCCGC ATCGCCCATG AGACGGCGCT GGCCGAGGCA
CAGCGCCAGG CGCGCTACCT GGCCGCCCAT ATCGAGCCCA AGGTGGCGCA AAGCGCGACC
GAGCCGAACC GGCCCTGGCT CTGGGCGATG ATCACCGGCA TGCTGCTGGC GGGCTGGTCG
ATCCTGGTGC TGATCTATTA CAGCGTCCGC GACCGCCGCT AG
 
Protein sequence
MAQPAAKPAP KPAAKPAAKP LARVRPQQPA AAPDQPVRPT VGIAEPRRRH WLILGSFLAL 
VLLPSLLWAA YLWLRAEDQY VSTVGFSVRK EQSTPSIDLL GGLAPLAGGA SASDTDILYE
YIRSQDMVER IDARLDLRAR FSGPWPHDFV FAFDPEGHVE DLTDYWQRQV KVLYDSTTQL
ITLKISAFTA RDAQEIAAAV FQESSDKINQ LSTIAQDDAT RLAKAELDKA RAELTETRQA
MTAFRMRSQI VDPEADLAGQ MGVLSGLQAQ LAEVLVAHDL LLDNAQPTDH RVTQSQQKID
ALRRLIDAER AKFGAEGKGP AGESYAQLMA EYEKLAVDRE FAEGAYRSAR IAHETALAEA
QRQARYLAAH IEPKVAQSAT EPNRPWLWAM ITGMLLAGWS ILVLIYYSVR DRR