Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_1047 |
Symbol | |
ID | 4578628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008686 |
Strand | + |
Start bp | 1025338 |
End bp | 1026639 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639768369 |
Product | extracellular solute-binding protein |
Protein accession | YP_914854 |
Protein GI | 119383798 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGAT TCCAGTCCCG TTCCGACTTT ACCCGCCGGC ACATGCTCAA GACGCTGACG GCCCTGACGG CTGCCTTTGC GGCCGCGCCG CGCGGCGCCT TTGCGCAATC CGGGGCGGCG CTCAACATTC TCAACAGCAA TACCGCATGG GCGGAGGCTC TGACCGCATC GGTCGCCGAT GCCTATGACA GCGCCCGGAT CACCGGCGAG GCCAATCCCT ATGAGGCACA TTACGAAAAG CTGCTGATCG AGCTTAGCCA GGGTTCCTCG ACCTTCGACC TGTTCACCAC CGACAATCTC TGGATCCGCC AACCGATCCG CAACGGCTGG GCTGCGCCGC TGGACGAGAT CCGGGCGGGC AACGCCGAAC TGCCCGAACT GCAACTGCAC AACTTCGCAC CCGCCTCGCT GACCTATACC GAGTACGAGG GAAAACGCTG GGGCTTGCCG CTGGTGATGA CCACGCCGGT CTTCGTCTAT CGCAAGGATC TGCTGGAAGC GGCCGGCATC GAGGTGCCGA AGAACTGGGA CGATTATCGC GAGGCGGCGG CGAAGCTGCA TTCCTCGGAT GTCGCGGGCA ACGTGTTGCT GCTGGGAGGC CAGGACGCGC ACATGAGCGG TGACTGGGGG TCGCGCCTGA TGGGCATGAC CAAGATCGCC CCCAATGACG ACGGCGTGCT GGACGAAGCC AACAAGCCGG TTTTCAACAG CGAAGGGCAG GGGGCCCGCG CCATCGAGCG GCTGCGCGAG GTGCTGCCCT ATACCCCGAA CGGGGTCGAG GGGCTGGACT ATGCCGAGGG CTCGTCGTTG ATGCAGCAGG GCCGCGCCGC CATGATGATC ACCTGGTCGG ACGTGATCGT CGGCATCGAG GATGGGCCGA TGAAGGGCCG TTTCGGCTAT ACCGTCGCCC CGACCGAGCG ATATGAGCAG CAGATGGTTG GTGGCTGGTC GATCATGGCC AACGCAGCCT CCGGGCAACT CGAGGATGCC TATCGCTTCC TCGCCTGGAT GAGCGAGGGC AAGGCCTATG AGTTGTTCCG CGAGGGCGGG GAATCCTCGC TTTGCCTGCA GCGCGACATC GACAACCCCC AGACGGTCGA GGCGATTCCC ATGTTGCAAG CCTTCCACGA CTTCGAGACC CGCGGCACCG CGCCGATCTC GATCCCGCCC TATCGGTTGC AGAACGCCGT CGAGGTGCAG CGCGTGCTTT ACGAAGAGAT CCTGGCGGGC GTGAACGGCC GCAAGACCCC CGAACAGGCG ATGGCGGATG CCGAATCGCG CCTGGCGCGG GTCATTCGCT GA
|
Protein sequence | MTRFQSRSDF TRRHMLKTLT ALTAAFAAAP RGAFAQSGAA LNILNSNTAW AEALTASVAD AYDSARITGE ANPYEAHYEK LLIELSQGSS TFDLFTTDNL WIRQPIRNGW AAPLDEIRAG NAELPELQLH NFAPASLTYT EYEGKRWGLP LVMTTPVFVY RKDLLEAAGI EVPKNWDDYR EAAAKLHSSD VAGNVLLLGG QDAHMSGDWG SRLMGMTKIA PNDDGVLDEA NKPVFNSEGQ GARAIERLRE VLPYTPNGVE GLDYAEGSSL MQQGRAAMMI TWSDVIVGIE DGPMKGRFGY TVAPTERYEQ QMVGGWSIMA NAASGQLEDA YRFLAWMSEG KAYELFREGG ESSLCLQRDI DNPQTVEAIP MLQAFHDFET RGTAPISIPP YRLQNAVEVQ RVLYEEILAG VNGRKTPEQA MADAESRLAR VIR
|
| |