Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_3800 |
Symbol | |
ID | 4582351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008687 |
Strand | + |
Start bp | 936241 |
End bp | 937833 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639771109 |
Product | extracellular solute-binding protein |
Protein accession | YP_917562 |
Protein GI | 119386507 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.129084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACGA TTTCGGGACT TCGCGCCCTT GGATGGGCCG CCATGGCGGC GGGTTTGCTC CACGCCGGGC CGGCTTCGGC CGAGACGGTG CTGAACGTGG TGATGCAGGC GCCGCTGCGC ACGCTGGATC CGCATCTGAG CACGGCGCAG ATCGTGCGGA CGCATGGTTT CATGGTCTTC GACACGCTTC TGGGCATGGA TGCGGATTAC AAGCCGCAGC CGCAGATGGC GGATTACCAG GTTTCGGACG ACGGGCTGAC CTATACCTTC ACCCTGCGCG ACGGGCTGAC ATGGCATGAC GGCAGCCCGG TCGCGGCCGA GGATTGCGTC GCCTCGCTGA AACGCTGGGC CGAGAACGAT CCGGCCGGGC GCAAGATGAT GGAATATGTC GCCTCGATCG AGACGACCTC GGACAAGGTG CTGGTGCTGA CGCTGGCCCG TCCCTTCGGC CATGTGCTGG ACCTGCTGGC CAAGCCTTCG CCGGTGCCGC CCTTCATGAT GCCCAAGCGC CTGGCCGAGA CGCCCTCGGG CGAGCAGGTG GCCGAGATGG TCGGCTCGGG CCCGTTCAAG TTCGTCGCCG AGGAATACCG GCCCGGCGAT CAGGCGGTCT ATGTCAGGAA CGAGGATTAC CAGCCGCGCC CCGAGCCGGC CAGCTGGACC GCCGGCGGCA AGGTGGTGAA TGTCGACAAG GTGGTGTGGA AGGCGATGCC GGACATGCAG ACCTCGATCA ACGCGCTGCA ATCGGGCGAT GTGGATCTGA TCGAGCAGGT GACCATCGAC CTGCTGCCCC TGCTGCAAAT GAACGACGAG GTGGAATACG GCGTCATCAA CCCGCTGGGC AGCCAGGTCA CGGGGCGCTT CAACCACCGA CTGCCGCCCT TTGACGATGC CGAGATCCGC CGCGCCGCCA TGTATGCGCT GGATCAGGAG CAGTTGATGC AGACCGCCAT CGGCGACCCG GAATACTACA CGCTCTGCGC CTCGGATTAC GGCTGCGAGG TGCCGCTGGC GTCGGATGCC GGCTCGGAAT ACCTGGCCGG CAGCGCCGAG GAGCGCATGG CCAAGGCGCA GGAGATCCTG AAGGCCGCAG GCTATGACGG AACCCCGGTG CTGATGATGC AGCCGACCGA CCTGACCATC CTGTCCACCC AGCCCATCGT CGCCGCCGAG CGGCTGCGCG AGGCCGGCTT TGCGGTCGAG GTCGCCTCGA TGGACTGGGC CACGCTGCAA TCGCGCAAGA ACGGCTGGCA GCCGGTGGCC GAGGGCGGCT GGAACATGCT CTTTACCTAT TGGGGCGTGA CCGGGATCTG GAACCCGCTG GTCCATGCGC TGCTGGATGG CTCGGGCAGC GACAACGCCT GGTCCGGCTG GCCGGTCAGC CCCGAGATCG AGGCGCTGCG CGAGCAGTAT CTGGTCGCCA CCGACCTGGA CGAGCAAAAG CGCATCGCCG CCGAGATCCA GCAGATCGCC TATGACCAGG GCTTCTACTA CAATGCCGGC GAGTTCCAGT CGGTCGCCGC CTGGCGGGAC GGGATCACCG AGCTGCAACC CGGTCCGATC ACCCTGTTCT GGGGCGTGAA GAAGCCCGAA TAA
|
Protein sequence | MMTISGLRAL GWAAMAAGLL HAGPASAETV LNVVMQAPLR TLDPHLSTAQ IVRTHGFMVF DTLLGMDADY KPQPQMADYQ VSDDGLTYTF TLRDGLTWHD GSPVAAEDCV ASLKRWAEND PAGRKMMEYV ASIETTSDKV LVLTLARPFG HVLDLLAKPS PVPPFMMPKR LAETPSGEQV AEMVGSGPFK FVAEEYRPGD QAVYVRNEDY QPRPEPASWT AGGKVVNVDK VVWKAMPDMQ TSINALQSGD VDLIEQVTID LLPLLQMNDE VEYGVINPLG SQVTGRFNHR LPPFDDAEIR RAAMYALDQE QLMQTAIGDP EYYTLCASDY GCEVPLASDA GSEYLAGSAE ERMAKAQEIL KAAGYDGTPV LMMQPTDLTI LSTQPIVAAE RLREAGFAVE VASMDWATLQ SRKNGWQPVA EGGWNMLFTY WGVTGIWNPL VHALLDGSGS DNAWSGWPVS PEIEALREQY LVATDLDEQK RIAAEIQQIA YDQGFYYNAG EFQSVAAWRD GITELQPGPI TLFWGVKKPE
|
| |