Gene Pden_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_1047 
Symbol 
ID4578628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp1025338 
End bp1026639 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID639768369 
Productextracellular solute-binding protein 
Protein accessionYP_914854 
Protein GI119383798 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGAT TCCAGTCCCG TTCCGACTTT ACCCGCCGGC ACATGCTCAA GACGCTGACG 
GCCCTGACGG CTGCCTTTGC GGCCGCGCCG CGCGGCGCCT TTGCGCAATC CGGGGCGGCG
CTCAACATTC TCAACAGCAA TACCGCATGG GCGGAGGCTC TGACCGCATC GGTCGCCGAT
GCCTATGACA GCGCCCGGAT CACCGGCGAG GCCAATCCCT ATGAGGCACA TTACGAAAAG
CTGCTGATCG AGCTTAGCCA GGGTTCCTCG ACCTTCGACC TGTTCACCAC CGACAATCTC
TGGATCCGCC AACCGATCCG CAACGGCTGG GCTGCGCCGC TGGACGAGAT CCGGGCGGGC
AACGCCGAAC TGCCCGAACT GCAACTGCAC AACTTCGCAC CCGCCTCGCT GACCTATACC
GAGTACGAGG GAAAACGCTG GGGCTTGCCG CTGGTGATGA CCACGCCGGT CTTCGTCTAT
CGCAAGGATC TGCTGGAAGC GGCCGGCATC GAGGTGCCGA AGAACTGGGA CGATTATCGC
GAGGCGGCGG CGAAGCTGCA TTCCTCGGAT GTCGCGGGCA ACGTGTTGCT GCTGGGAGGC
CAGGACGCGC ACATGAGCGG TGACTGGGGG TCGCGCCTGA TGGGCATGAC CAAGATCGCC
CCCAATGACG ACGGCGTGCT GGACGAAGCC AACAAGCCGG TTTTCAACAG CGAAGGGCAG
GGGGCCCGCG CCATCGAGCG GCTGCGCGAG GTGCTGCCCT ATACCCCGAA CGGGGTCGAG
GGGCTGGACT ATGCCGAGGG CTCGTCGTTG ATGCAGCAGG GCCGCGCCGC CATGATGATC
ACCTGGTCGG ACGTGATCGT CGGCATCGAG GATGGGCCGA TGAAGGGCCG TTTCGGCTAT
ACCGTCGCCC CGACCGAGCG ATATGAGCAG CAGATGGTTG GTGGCTGGTC GATCATGGCC
AACGCAGCCT CCGGGCAACT CGAGGATGCC TATCGCTTCC TCGCCTGGAT GAGCGAGGGC
AAGGCCTATG AGTTGTTCCG CGAGGGCGGG GAATCCTCGC TTTGCCTGCA GCGCGACATC
GACAACCCCC AGACGGTCGA GGCGATTCCC ATGTTGCAAG CCTTCCACGA CTTCGAGACC
CGCGGCACCG CGCCGATCTC GATCCCGCCC TATCGGTTGC AGAACGCCGT CGAGGTGCAG
CGCGTGCTTT ACGAAGAGAT CCTGGCGGGC GTGAACGGCC GCAAGACCCC CGAACAGGCG
ATGGCGGATG CCGAATCGCG CCTGGCGCGG GTCATTCGCT GA
 
Protein sequence
MTRFQSRSDF TRRHMLKTLT ALTAAFAAAP RGAFAQSGAA LNILNSNTAW AEALTASVAD 
AYDSARITGE ANPYEAHYEK LLIELSQGSS TFDLFTTDNL WIRQPIRNGW AAPLDEIRAG
NAELPELQLH NFAPASLTYT EYEGKRWGLP LVMTTPVFVY RKDLLEAAGI EVPKNWDDYR
EAAAKLHSSD VAGNVLLLGG QDAHMSGDWG SRLMGMTKIA PNDDGVLDEA NKPVFNSEGQ
GARAIERLRE VLPYTPNGVE GLDYAEGSSL MQQGRAAMMI TWSDVIVGIE DGPMKGRFGY
TVAPTERYEQ QMVGGWSIMA NAASGQLEDA YRFLAWMSEG KAYELFREGG ESSLCLQRDI
DNPQTVEAIP MLQAFHDFET RGTAPISIPP YRLQNAVEVQ RVLYEEILAG VNGRKTPEQA
MADAESRLAR VIR