Gene Pden_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_5072 
Symbol 
ID4583633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008688 
Strand
Start bp584405 
End bp585964 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content66% 
IMG OID639772375 
Productextracellular solute-binding protein 
Protein accessionYP_918828 
Protein GI119387794 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.236333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.758542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCA ATCCGACGGG GCAGGGGCTG ACCAGGCGGG GCTTGCTGGC CGGCGCGGCA 
GGCATGGCGG CGGCGGGGCT GATCCTGCCG CGCGGCGCGC GCGCGCAGGA GGCGCGGCGC
GGCGGCAGGC TGCGCATCGG CCATCTGGGC GGCGCGACCT CGGACACGCT GGACCCGGCG
ACCTATGCCG CGGGGCCGGT GGTGACGGCG ATGCTGGCGG TCTGCAACAA CCTGGTCGAG
ATCGACGCCA AGGGCCAGGC CGTGCCCGAA CTGGCCGAGT TCGAGCCCGA TGCCGAGGCC
CGCGTCTGGA CCTTCCGCCT GAAGGACGGC GTCACCTTCT CGGACGGGCG CAAGCTCACG
GCCAGGGACG TGATCGCGTC CTTCGACCAT CATCGCGGCG CCGATACCAA GTCCGGCGCC
AAGGGCTCGC TTGAGCAGGT CAAGGAAATC CGCGCCGATG GCGACAATGT CGTGGTCTTC
GAACTGACCT CGGGCAATGC CGATTTCGCC TATCTGACCT CGGACTATCA CTTCGTCATC
ATGCCGGCGA ACGAGGACGG CACGCTGGAC TGGCAGTCGG GTCTGGGCAC CGGCGGCTAT
GTGCTGGAGA ACTTCGAGCC GGGCGTGCGC ATCACGCTCA AGCGCCGCGA CGACTACTGG
AAGCCCGACC GCGCCTGGTT CGACGAGGCG GTGCTGCTGA CCATCAACGA TGCCACCGCC
CGGCAGAATG CGCTGATGAC CGGCGAGGTC GATGTCATCA ACTCGCCCGA CCTGGCTACC
CTGCACCTGT TGCAGCGCCG GCCGGGCCTG CAACTGGTCG AGGTGACGGG GACCGCGCAT
TACACCATGC CGATGTTCTG CGACCAGGCG CCCTTTACCG ATCCGAACCT GCGGTTGGCG
CTGAAATACG CCATCGACCG GCAGGAGGTG CTGGACAAGG TGCTGCGCGG CCATGGCCAG
ATCGCCAATG ACAGTCCCAT CGCGCCGGCG AACCGCTTCT TTGCCGCCGA CCTGCCGCAG
CGGGCCTATG ACCCGGACAA GGCGAAGCAT TACCTGAAAC AGGCCGGCAT GGAGGGGCTG
AAGGTCGAGA TTTCCGCCTC GGACGCGGCG TCGGTCGGGG CGCTGGACAT GGTGCAGCTG
TTCCAGCAAT CGGCCAAGGC CGCTGGAATC GACCTGACCG TCAAGCGCGA GCCGGACGAC
GGCTATTGGT CGAATGTCTG GCTGAAGAAG CCCTTTTGCG TCAGCTACTG GAACGGCCGC
CCGACCGAGG ACGACATGTT CAGCCTGGTC TATGCCCGGG GCGCCGAGTG GAACGAAAGC
CACTGGGACA ACGAGCGATT CAACGAACTG CTGCTGAAGG CGCGGGCCGA GCTGGACGAA
GGCCTGCGCG CCGAGATGTA TCGCGAGATG CAGGGGCTGG TTTCCGAGGA CGGCGGCACC
ATCATCCCGA TTTTCGTGAA CTATATCGAC GTGGCCAATG ACAAGGTGGC GCATGGCGAG
GTGGCGTCGA ACCGCTTCCT CGACGGCTGG AAGATCGTGG AACGGTGGTG GCAGGCATGA
 
Protein sequence
MSFNPTGQGL TRRGLLAGAA GMAAAGLILP RGARAQEARR GGRLRIGHLG GATSDTLDPA 
TYAAGPVVTA MLAVCNNLVE IDAKGQAVPE LAEFEPDAEA RVWTFRLKDG VTFSDGRKLT
ARDVIASFDH HRGADTKSGA KGSLEQVKEI RADGDNVVVF ELTSGNADFA YLTSDYHFVI
MPANEDGTLD WQSGLGTGGY VLENFEPGVR ITLKRRDDYW KPDRAWFDEA VLLTINDATA
RQNALMTGEV DVINSPDLAT LHLLQRRPGL QLVEVTGTAH YTMPMFCDQA PFTDPNLRLA
LKYAIDRQEV LDKVLRGHGQ IANDSPIAPA NRFFAADLPQ RAYDPDKAKH YLKQAGMEGL
KVEISASDAA SVGALDMVQL FQQSAKAAGI DLTVKREPDD GYWSNVWLKK PFCVSYWNGR
PTEDDMFSLV YARGAEWNES HWDNERFNEL LLKARAELDE GLRAEMYREM QGLVSEDGGT
IIPIFVNYID VANDKVAHGE VASNRFLDGW KIVERWWQA