Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1701 |
Symbol | |
ID | 8252803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2011667 |
End bp | 2014594 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644935353 |
Product | PAS sensor protein |
Protein accession | YP_003091974 |
Protein GI | 255531602 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.454558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00934608 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCACCACT CAAAAGACCC CCTTTTCCAG ATCTTGTTCA ATCAGCTTGC CGAAGCAAGG CTTATCCTTA AAGCTGAATT CCCAAATTTT ATTGTTATTA GCAGTAATTC TGCATGGCAG AACCAAAGCG GAACAGATAA ACTTTATCCC GGGTTAAATA TGGATGACCT GTTTTTGCAG GTTATTGAAA AGGGTGAAGC ATTGGTCTTA CAACCTGTTT TGGATAATCA TTCAGACAAA GATACCTGGT TTGAACTGGA GATCCTCCCG ATCAAGGATA TTTCAGGTGA TTCCACAACA TACCTGATGT GTACCTTATA TGACGTGACT GAACGGGTAA ATGGAGAACA TGCGTTGACC GCCTCGAAAG TCGCGAGTGC GGGCTTGCTG CTCAATAACC AGGAACTTAA TGAAGAACTT GCTACTTCTA ATGAAGAACT TGCTGCGGCA AATGAAGAGT TAACTGCAGT TAACGAGGAA CTATTAAGTG CTCAGGAAAG ATTACGGTTG TTAAATAATG ATCTGGAGCT CCTTGTTCAG CAGCGTACGC TAGATTTGCA GCATAGCGAA CAAAAAGCCA GGTTTATTGT GGAAGATGCA CCAGTAGCCA TTGGCGTGCT GGAAGGCCGG AACCTGATAA TTGATTCGGC CAATAAAAAA ATGCTGGAAA TATGGGGGAA AGACGAATCG GTTATTGGCA AAACTTTACG TGAAGGCCTG CCAGAACTTG AGGGGCAGGC ATTTTTGGGT ATTCTTGATG ATATATTCGT TACCGGGTCA GTTTTTTACG GAAATGAGGT AATGGCTGTG CTGGAGTATT CGGGAATACT TAAAGAAGGT TATTTCAATT TTGTTTATCA ACCTGTAAAG GATGAGCTTG GCCGTACATT AAGCATTATG GTTGTAGCTA CTGAAGTGAC TGAACAGGTA AAGGCAAGAA AGGTTATTGA AGAAAGTGCC CATCAGCTCA GAAGAATGGT TATGACTACT CCAATAGGTT TAACTATCCT TAAAGGAACC GAATTGTCCA TCGAAATAGC CAATCAGCCG ATGCTCGACA TCTGGGGGCG AAGGGATGAA GAGGTTATCG GGAGAAATTT AACCAACGTT TTTCCGGAAC TTGCAGATCA GCCCTTTCCA GCTTTGCTCA GAAATATTTT TGACACGGGA AAGCGGGTAG CTGAACCTTC GGCTAAGGCT ACTATTGTTT TATCTGACGG TACATTTAAA GAAATTTATG TCGATTTTTC TTACGACCCG CTCTTTGATC TTGACGGAAA CGTAGAGGCA ATACTGGCCA GTGTAAAGGA TGTAACCGAA CTTACCGAAG GAAGAAAAAT GTTGCAGCAA AGGCAGGAGG AACTGGAAAC GTTGAATGAA GAATTTTTAG CCGCAAATGA GGAATTGGTT GCTACAAATG ATGAGCTTTT TGAAACTCAG GAAGATTTAA AGCTGCTATT TGAGCGATTA AAGGATAATG AAACCAGATT TCGTAGTCTG TTTGAGCAAT CGCCGGTAGG CATGTGTTTC CTCAAAGGAG AGGAATTGAT TATTGAGCTG GCGAACGAGA ATATTTTAAA AATCTGGGGA CGAACAAGAG AAGAGGTAGT TGGTAAACCT CATGCACTTG CCCGGCCGGA ACTTCGTGGG CAGCCCATGA ATGAATGGCT TCGCGAAGTT TACGTTACTG GTATTCCACG TATAAATAAT GAGTTAAAAG TTAAATTGTA TGATAAAGGT GGACTTAGAG AGGCATTTGT TCATTCCTTA TATCATCCTT TAAAGGATGA GCAGGGAGTT GTAACTGGTT TACTGATCAT TCTTAGTGAT GTTACACCTT GGGTACAAAC CAGAAAACAA GTGGAGTGGG CACAGGAACA ATTGAGCCAG GCTATACAGT CGGCTGAATT GGGTACCTGG TATATCAAAA CAGAAACCAG GGAATTCATA CCTTCACAAC GACTCAAGGA ATTATTTGGT TTTCAGAAGG ATGAGGTAAT GACCTTTGCT GATGCGATCA AACTGATTAC TGACGATTAT CGAGAAAGTG TTGTTAAAGC AATTGAGGAT ACCATTGAAA ATGGCAGCAG ATTTGAATTA GAATACCCAA TCCATAGCTA TAGGGACGGG CAGCTCCGTT GGTTAAGATC TACTGGAAAG TTGTATCCTG CAGAATCTGG AACAGCTTCA CATTTTTCTG GAACAGTATT AGATATTACT GCTCATAAAC TAGAAGAAAT TCGAAAAAAT GATTTTATTG CCATTGTTAG TCATGAGTTA AAAACTCCAT TAACAAGTTT AAAAGGATAT TTACAGTTGA TGAGAGGTAG ATTTGATAGC TCGGCAGCTC ATTCATTTTT CAGCACAGTT TCTGAAAAAT CTTTAGCTCA AGTAGAAAAA ATGCATTCAT TGGTAAAAGG ATTTCTTGAC GTGGCAAGGT TAGAATCTGC AAAGTTGGTG CTGAATCTTC AGCCTATGCG TATAGATCAG CTTGTACTTG AATCAGCTGA AGAAGCCAGT CTGATGTACG ATCAGCACGA AATTATAGTG GAGTATTGCG AACCCGTTGA AGTTATGGCT GACCGTGACA AAATTACACA GGTATTGGGT AACTTGTTGA GCAATGCAAT TAAATATTCT CCCCGGGGTA AAATAGTTTC GATGAGTTGC AAGGTTATTG GTACTGAAGT GCTTGTTGAA GTTAAAGATC AGGGGATGGG TATCAAACAG CATGAAATAT CTAAGGTATT TGATCGCTTT TACCGTGTCG AGACCAAACA TACAACTACA ATTTCCGGAT TTGGGATCGG ACTGTATTTA TGTGCTGAGA TCATTAAATT GCATAATGGA AGGATTTGGG TTGAAAGCAA GATTGGTGTC GGATCTTCTT TCTTTTTTAG CCTGCCAATT GGTAAAGTTA GCCCCTGA
|
Protein sequence | MHHSKDPLFQ ILFNQLAEAR LILKAEFPNF IVISSNSAWQ NQSGTDKLYP GLNMDDLFLQ VIEKGEALVL QPVLDNHSDK DTWFELEILP IKDISGDSTT YLMCTLYDVT ERVNGEHALT ASKVASAGLL LNNQELNEEL ATSNEELAAA NEELTAVNEE LLSAQERLRL LNNDLELLVQ QRTLDLQHSE QKARFIVEDA PVAIGVLEGR NLIIDSANKK MLEIWGKDES VIGKTLREGL PELEGQAFLG ILDDIFVTGS VFYGNEVMAV LEYSGILKEG YFNFVYQPVK DELGRTLSIM VVATEVTEQV KARKVIEESA HQLRRMVMTT PIGLTILKGT ELSIEIANQP MLDIWGRRDE EVIGRNLTNV FPELADQPFP ALLRNIFDTG KRVAEPSAKA TIVLSDGTFK EIYVDFSYDP LFDLDGNVEA ILASVKDVTE LTEGRKMLQQ RQEELETLNE EFLAANEELV ATNDELFETQ EDLKLLFERL KDNETRFRSL FEQSPVGMCF LKGEELIIEL ANENILKIWG RTREEVVGKP HALARPELRG QPMNEWLREV YVTGIPRINN ELKVKLYDKG GLREAFVHSL YHPLKDEQGV VTGLLIILSD VTPWVQTRKQ VEWAQEQLSQ AIQSAELGTW YIKTETREFI PSQRLKELFG FQKDEVMTFA DAIKLITDDY RESVVKAIED TIENGSRFEL EYPIHSYRDG QLRWLRSTGK LYPAESGTAS HFSGTVLDIT AHKLEEIRKN DFIAIVSHEL KTPLTSLKGY LQLMRGRFDS SAAHSFFSTV SEKSLAQVEK MHSLVKGFLD VARLESAKLV LNLQPMRIDQ LVLESAEEAS LMYDQHEIIV EYCEPVEVMA DRDKITQVLG NLLSNAIKYS PRGKIVSMSC KVIGTEVLVE VKDQGMGIKQ HEISKVFDRF YRVETKHTTT ISGFGIGLYL CAEIIKLHNG RIWVESKIGV GSSFFFSLPI GKVSP
|
| |