Gene Pden_0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_0239 
Symbol 
ID4580819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp219738 
End bp221327 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content64% 
IMG OID639767554 
Productextracellular solute-binding protein 
Protein accessionYP_914049 
Protein GI119382993 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.884064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCA ATCGTCGCAG AATCCTGCAC ATGTCCGTCG CGGCGGCAGC GGCAAGCCTT 
GCCGGTCCGG GACTGCTGCG CGCGCAGGGA TCGGACCGCC CGATCCGCAT CGCGCTTGCC
GCCACATCGC CGCGCGCGAT CGACCCGATC TTCAGCACGC TCGGGGCCGA CAACTGGGTC
AACCTCCAAG TCTATGAGCA TCTCGTCTCG CCGCCGAATG GCAGATTCGC GACCCAGGAC
GACGAATATC GCCCGATGCT TGCCGAAAGC TGGACCAAGT CCGACGATGC CCGGACATGG
ACCTTCAAGC TGCGCAGGGA TGTCGCGTTC CACGGCGGCC ATGGCACGCT GACCCCCGAG
GACGTGGTGT TCACCTTCGA ACGCGCCATG CGTGAAGGCA TCGCCACGGC ATCCTATGCC
AATGTGCGGG GCGTCGCCGC CTCGGGCCCC GACGAGGTGA CGTTCACGCT GAACGGGCCG
GATCCGTTTT TCCTGGGGGG CGTGATCTCG ATCCCCAGTT CGCTGATCGT GTGCAAGAGC
GCGGTCGAGG AGAAGGGCGA GAATTTCGGC AAGGAACCCG TCGGCACCGG CCCCTATGCG
GTCGAACGCA TGTCAAGCAG CGGCGTCAAT ACCCTGCGCT TCGACGGCTA TTGGGGCGAG
CCTCCGAAGA CCGCGCGGAT CGACTTCCTC TATACCTCGG ATACGACCTC GCGGACGCTT
TCGCTGATGT CGGGCGACGT GGACATGATC GAGGCGGTGC GCGCGCCCGG ATGGGTGCAG
CAGATCCGGC AGCAGAACGA CGCGCTGATC GTCGATCAGA CGCAGCCGGG ATCCTTCAAC
ACCCTTTTCT TCAACCTGAC CAAGGCACCC TTCGACAATC CGCTGGTGCG CAAGGCGGTC
GCCACCGCGA TCGACAGCGC GGTGGTCGCC CAGGCGCTGG CCCCGTTCGG CGCACAGACA
TGGACGCTCT CGCCGCCCGA CTATCCCTCT GGCTGGGCCG CGGAAGATCT GCCCGAGGAT
CTGCGCTATG ATTACGATCC CGATCGCGCG CGCGAATTGC TGGCCGAGGC CGGGCACGGA
AACGGGCTGA ACTTCACCGC CAGCATCAGC CAACGCGAGG ATTACCGCTC GATCATGCTG
ATCCTGCAAG AGCTGATGCG CCCGGCCGGG ATCAACATGA ACCTGAACAT CATGGATCAC
GCGGCCTTCC ACGGCGCCAA TCGGCAGGAT GCCAACTCGC TGGTGCTGTA TTCGCAAAGC
CTTCCCCCGG TGCCGCTGGA ATACATGTCG CGCTACCTGT CCTCTGCGGC GGTGGTGAAA
TCCGATGGGA CCGGCGGCGA CAACTTCAGC CATTACGGCA TCGCCATGCC CGGCGTGGAC
GACCGGATCG AGGCGATGCG CCAGGCCACC ACGGTCGAGG AATATTCCTC GATCGGACGC
GAGATCGAGA AGAAGGTGCA AGAGGATCTG CCCCTTATGG GGGTCGGCAA CCTTGGCTAT
GCCATCGTCC GCAATCCCGC AGTCGATATC GGCTACCAGG TCGAAAGCGG CTATGCGCGC
TGGCGCCTGG ATCTGGCGCA GCGCGCCTGA
 
Protein sequence
MLINRRRILH MSVAAAAASL AGPGLLRAQG SDRPIRIALA ATSPRAIDPI FSTLGADNWV 
NLQVYEHLVS PPNGRFATQD DEYRPMLAES WTKSDDARTW TFKLRRDVAF HGGHGTLTPE
DVVFTFERAM REGIATASYA NVRGVAASGP DEVTFTLNGP DPFFLGGVIS IPSSLIVCKS
AVEEKGENFG KEPVGTGPYA VERMSSSGVN TLRFDGYWGE PPKTARIDFL YTSDTTSRTL
SLMSGDVDMI EAVRAPGWVQ QIRQQNDALI VDQTQPGSFN TLFFNLTKAP FDNPLVRKAV
ATAIDSAVVA QALAPFGAQT WTLSPPDYPS GWAAEDLPED LRYDYDPDRA RELLAEAGHG
NGLNFTASIS QREDYRSIML ILQELMRPAG INMNLNIMDH AAFHGANRQD ANSLVLYSQS
LPPVPLEYMS RYLSSAAVVK SDGTGGDNFS HYGIAMPGVD DRIEAMRQAT TVEEYSSIGR
EIEKKVQEDL PLMGVGNLGY AIVRNPAVDI GYQVESGYAR WRLDLAQRA