Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_1788 |
Symbol | |
ID | 4580972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008686 |
Strand | + |
Start bp | 1780710 |
End bp | 1783367 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639769106 |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_915581 |
Protein GI | 119384525 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0591] Na+/proline symporter [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00597808 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCTG AAACCCTGGG CGCCGCCTCG GTCGTCTATG TCGCGCTGAT GTTCCTGCTG GCCCATGCCG CCGACCGGGC CGCCGCCGCC GGCCGCTGGC GCTGGATGGA CCGGCCGGTG ATCTATACGC TGTCGCTGTC GGTCTATTGC TCGGCCTGGA CCTTCTATGG CGCCGTGGGC TACGCCACCC GCTCGGGGCT GGAGTTCCTG ACCATCTACC TTGGCCCCGG CATCGTCTTT GCCGGGGCAT GGTGGGGCCT CAGGCGGCTG GTCCGGGTCG CGCGGCTGCA TCGCGTGACC TCGATCGCCG ACCTCATCTC CAGCCGCTTC GGCAAGTCCG CGGGGCTTGC GGCGCTGGTG ACGCTGATCG GCGTCCTGGC CTCGACCCCC TATATCGCGC TGCAACTGCA ATCGGTCTCG ATGGCGCTTT CGGCATTCGC CACCAGCGGC CACCCCCTGC CCGGCAGTAT CGCGCTGTGG GTGGCGGTCG GGCTTGCCGC CTTCGCCATC CTGTTCGGCA CCCGCAACCT GGCCGCGAAC GAGCGCCATC ACGGCGTCGT CATCGCCATC GCCGTCGAGG CAGTGGTCAA GCTTGCCGCC TTTCTGGCCG TCGGCGCCTA TGTGCTGTGG GGGATCGCGG ACGGGCCTGC CGACGTGCTG GCGCGTATCG ACCGCATGGC GGTCTCGGCC GGGGGCGAGG GCTGGCTGAT CCGGCCGGAT CGCTGGTTCA CGCTGATCGC GCTGTCCGGG GCGGCGGTGA TGGTGCTGCC GCGCATGTTC CACGTCCTGG TGGTCGAGGC CAGCGACGAG AATCGGCTGC GCCAGGCCGG CTGGGCTTTT CCCGCCTATC TGGCCGCCAT GTCCTTCCTG GTGCTGCCCA TCGCCGTGGT CGGCGCGGAC CTGCTGCCCA AGGGCTCGAA CCCCGACCTT TACGTGCTGG CCCTGCCGCT GTCGCAGGGG CGCGACACGC TGGCGGCGCT GGTGTTCCTG GGCGGCTTTT CCTCGGCCAT GTCGATGGTG GTGGTCAGCG CCATCGCGCT GTCGACGATG ATGGCGAACC ATTGGCTGGT GCCCTTGTGG CTCTGGCTGC GCCAGACGGC GCTGTCCGAA CCGGCGCAGG CCCGCGACGA TCTGGGCGCA TTGATGCTGA ACGCGCGGCG GCTGGCCATC GTCTCGGTGA TCGGCGCGGG CTGGCTCTAT CACCGCATGA CCGGGGGCAC CACCGCCCTG GCCGCCATGG GGACGGTCGC CTTTTCGGGC ATGGCGCAGG TGCTGCCGGC CATGATCGCG GGGCTGATCT GGCGCGGCGC GACCCGGCGG GGCGCCATCG CCGGGATCGT CGCGGGAGCG CTGATCTGGG GACGCGCGGT CTTCCTGCCC TCGCTGGGGC TGGCCGCTCC GCCGGCATTT CCTGCCGGAA TCGATGCTTT CGCGGGGGCG GTGCTGCTGG CGCTGGCGGT GAACCTGCTG CTTCTGGTGC TGACCTCGCT CATGGATTTC CCCGACCCGA CCGAGCGGTT GCAGGCGCTG TCCTTCGTCC ATGCCATCGC CCCCGACGCC GCCGCCAGCG AAACCGCCCC AGCGGTGCAG GCCGAGGCGC TGCTGACGCT GGCCGGGCGG ATCTGGGGCA ACGAACATGC GCTGGAGGTG TTCCGCAAGG CCGCGGAAGA TCAGGGAAAA TCCGGCTATC TTCCCGACCT GACCCCGCGC TTTCTCGCCG GGTTCGAGCG CCACCTGGCC GGCACGGTGG GCGCGGCGAC GGCCTCGGCG CTGATGGCGC GGGTCGGCGG GCGCGGCAGC GTCACCGTGG CCGACCTGAT GGAGGTCGCG GGCGAGGCCA GCCGCGCGCG CGAGGACAAC CGCCGGCTGG AGATCACCTC GGCGGAACTG GCCAGAACCG CCGCCATGCT GCGGGAAAGC AACGAGAAGC TGACCACGCT TTCCGCGCAG AAGGACGCCT TTCTCGGCCA GATCAGCCAT GAGCTGCGCA CGCCCATGAC CTCGATCCGC GCCTTCTCCG AGATCCTGAT GGAACCGGAC CTGCCCACCG GGGACCGTGC CCGCTTCGCC GGGATCATCC AGGACGAGGC CTGCCGCATG ACCCGGCTGC TGGACGACCT GCTGGACCTT TCGGTGCTGG AGGCGGGCCG GGCGCAACTG AAGCCCGGCG TCGTCAACCT GCACGACCTG ATCGGGCGCG CGCTGGCCGC CGCGGGGGCA AGTGCGACGG GGCGCGAATT CGCCATCCGC CGCAACCCGG TCGCCGAGCA TCTGCCGGTG ATCACCGACC CGGACCGGCT GTTGCAGGTG CTGATCAACG TCATCTCGAA CGCGCGGAAA TATTGCGACG CCAGCGAGCC CGCGATCCGC ATCGACACCC GCCGCAACCC CCAGGGCTGG ACCGAGATCG ACATCCACGA CAACGGCTCG GGCATCGGCC CGGAAAACCG CGCGCTGATC TTCGAGAAAT TCGCGCGGCT GGACGATCCC TCGCGCGCCG GGGGTGCCGG GCTGGGACTG GCGATCTGCA AGGAGATCAT GGATTTCCTG GGCGGCACGA TCGCCTATCT GCCAGGCCAG GGCGGCGCCT GCTTTCGCAT CGCGCTGCCG CCGCGCCCAC CACGCCGCGC GGAAAATACG GAAAATGCAG CGGGTTAA
|
Protein sequence | MTPETLGAAS VVYVALMFLL AHAADRAAAA GRWRWMDRPV IYTLSLSVYC SAWTFYGAVG YATRSGLEFL TIYLGPGIVF AGAWWGLRRL VRVARLHRVT SIADLISSRF GKSAGLAALV TLIGVLASTP YIALQLQSVS MALSAFATSG HPLPGSIALW VAVGLAAFAI LFGTRNLAAN ERHHGVVIAI AVEAVVKLAA FLAVGAYVLW GIADGPADVL ARIDRMAVSA GGEGWLIRPD RWFTLIALSG AAVMVLPRMF HVLVVEASDE NRLRQAGWAF PAYLAAMSFL VLPIAVVGAD LLPKGSNPDL YVLALPLSQG RDTLAALVFL GGFSSAMSMV VVSAIALSTM MANHWLVPLW LWLRQTALSE PAQARDDLGA LMLNARRLAI VSVIGAGWLY HRMTGGTTAL AAMGTVAFSG MAQVLPAMIA GLIWRGATRR GAIAGIVAGA LIWGRAVFLP SLGLAAPPAF PAGIDAFAGA VLLALAVNLL LLVLTSLMDF PDPTERLQAL SFVHAIAPDA AASETAPAVQ AEALLTLAGR IWGNEHALEV FRKAAEDQGK SGYLPDLTPR FLAGFERHLA GTVGAATASA LMARVGGRGS VTVADLMEVA GEASRAREDN RRLEITSAEL ARTAAMLRES NEKLTTLSAQ KDAFLGQISH ELRTPMTSIR AFSEILMEPD LPTGDRARFA GIIQDEACRM TRLLDDLLDL SVLEAGRAQL KPGVVNLHDL IGRALAAAGA SATGREFAIR RNPVAEHLPV ITDPDRLLQV LINVISNARK YCDASEPAIR IDTRRNPQGW TEIDIHDNGS GIGPENRALI FEKFARLDDP SRAGGAGLGL AICKEIMDFL GGTIAYLPGQ GGACFRIALP PRPPRRAENT ENAAG
|
| |