Gene Phep_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1701 
Symbol 
ID8252803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2011667 
End bp2014594 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content41% 
IMG OID644935353 
ProductPAS sensor protein 
Protein accessionYP_003091974 
Protein GI255531602 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.454558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00934608 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACCACT CAAAAGACCC CCTTTTCCAG ATCTTGTTCA ATCAGCTTGC CGAAGCAAGG 
CTTATCCTTA AAGCTGAATT CCCAAATTTT ATTGTTATTA GCAGTAATTC TGCATGGCAG
AACCAAAGCG GAACAGATAA ACTTTATCCC GGGTTAAATA TGGATGACCT GTTTTTGCAG
GTTATTGAAA AGGGTGAAGC ATTGGTCTTA CAACCTGTTT TGGATAATCA TTCAGACAAA
GATACCTGGT TTGAACTGGA GATCCTCCCG ATCAAGGATA TTTCAGGTGA TTCCACAACA
TACCTGATGT GTACCTTATA TGACGTGACT GAACGGGTAA ATGGAGAACA TGCGTTGACC
GCCTCGAAAG TCGCGAGTGC GGGCTTGCTG CTCAATAACC AGGAACTTAA TGAAGAACTT
GCTACTTCTA ATGAAGAACT TGCTGCGGCA AATGAAGAGT TAACTGCAGT TAACGAGGAA
CTATTAAGTG CTCAGGAAAG ATTACGGTTG TTAAATAATG ATCTGGAGCT CCTTGTTCAG
CAGCGTACGC TAGATTTGCA GCATAGCGAA CAAAAAGCCA GGTTTATTGT GGAAGATGCA
CCAGTAGCCA TTGGCGTGCT GGAAGGCCGG AACCTGATAA TTGATTCGGC CAATAAAAAA
ATGCTGGAAA TATGGGGGAA AGACGAATCG GTTATTGGCA AAACTTTACG TGAAGGCCTG
CCAGAACTTG AGGGGCAGGC ATTTTTGGGT ATTCTTGATG ATATATTCGT TACCGGGTCA
GTTTTTTACG GAAATGAGGT AATGGCTGTG CTGGAGTATT CGGGAATACT TAAAGAAGGT
TATTTCAATT TTGTTTATCA ACCTGTAAAG GATGAGCTTG GCCGTACATT AAGCATTATG
GTTGTAGCTA CTGAAGTGAC TGAACAGGTA AAGGCAAGAA AGGTTATTGA AGAAAGTGCC
CATCAGCTCA GAAGAATGGT TATGACTACT CCAATAGGTT TAACTATCCT TAAAGGAACC
GAATTGTCCA TCGAAATAGC CAATCAGCCG ATGCTCGACA TCTGGGGGCG AAGGGATGAA
GAGGTTATCG GGAGAAATTT AACCAACGTT TTTCCGGAAC TTGCAGATCA GCCCTTTCCA
GCTTTGCTCA GAAATATTTT TGACACGGGA AAGCGGGTAG CTGAACCTTC GGCTAAGGCT
ACTATTGTTT TATCTGACGG TACATTTAAA GAAATTTATG TCGATTTTTC TTACGACCCG
CTCTTTGATC TTGACGGAAA CGTAGAGGCA ATACTGGCCA GTGTAAAGGA TGTAACCGAA
CTTACCGAAG GAAGAAAAAT GTTGCAGCAA AGGCAGGAGG AACTGGAAAC GTTGAATGAA
GAATTTTTAG CCGCAAATGA GGAATTGGTT GCTACAAATG ATGAGCTTTT TGAAACTCAG
GAAGATTTAA AGCTGCTATT TGAGCGATTA AAGGATAATG AAACCAGATT TCGTAGTCTG
TTTGAGCAAT CGCCGGTAGG CATGTGTTTC CTCAAAGGAG AGGAATTGAT TATTGAGCTG
GCGAACGAGA ATATTTTAAA AATCTGGGGA CGAACAAGAG AAGAGGTAGT TGGTAAACCT
CATGCACTTG CCCGGCCGGA ACTTCGTGGG CAGCCCATGA ATGAATGGCT TCGCGAAGTT
TACGTTACTG GTATTCCACG TATAAATAAT GAGTTAAAAG TTAAATTGTA TGATAAAGGT
GGACTTAGAG AGGCATTTGT TCATTCCTTA TATCATCCTT TAAAGGATGA GCAGGGAGTT
GTAACTGGTT TACTGATCAT TCTTAGTGAT GTTACACCTT GGGTACAAAC CAGAAAACAA
GTGGAGTGGG CACAGGAACA ATTGAGCCAG GCTATACAGT CGGCTGAATT GGGTACCTGG
TATATCAAAA CAGAAACCAG GGAATTCATA CCTTCACAAC GACTCAAGGA ATTATTTGGT
TTTCAGAAGG ATGAGGTAAT GACCTTTGCT GATGCGATCA AACTGATTAC TGACGATTAT
CGAGAAAGTG TTGTTAAAGC AATTGAGGAT ACCATTGAAA ATGGCAGCAG ATTTGAATTA
GAATACCCAA TCCATAGCTA TAGGGACGGG CAGCTCCGTT GGTTAAGATC TACTGGAAAG
TTGTATCCTG CAGAATCTGG AACAGCTTCA CATTTTTCTG GAACAGTATT AGATATTACT
GCTCATAAAC TAGAAGAAAT TCGAAAAAAT GATTTTATTG CCATTGTTAG TCATGAGTTA
AAAACTCCAT TAACAAGTTT AAAAGGATAT TTACAGTTGA TGAGAGGTAG ATTTGATAGC
TCGGCAGCTC ATTCATTTTT CAGCACAGTT TCTGAAAAAT CTTTAGCTCA AGTAGAAAAA
ATGCATTCAT TGGTAAAAGG ATTTCTTGAC GTGGCAAGGT TAGAATCTGC AAAGTTGGTG
CTGAATCTTC AGCCTATGCG TATAGATCAG CTTGTACTTG AATCAGCTGA AGAAGCCAGT
CTGATGTACG ATCAGCACGA AATTATAGTG GAGTATTGCG AACCCGTTGA AGTTATGGCT
GACCGTGACA AAATTACACA GGTATTGGGT AACTTGTTGA GCAATGCAAT TAAATATTCT
CCCCGGGGTA AAATAGTTTC GATGAGTTGC AAGGTTATTG GTACTGAAGT GCTTGTTGAA
GTTAAAGATC AGGGGATGGG TATCAAACAG CATGAAATAT CTAAGGTATT TGATCGCTTT
TACCGTGTCG AGACCAAACA TACAACTACA ATTTCCGGAT TTGGGATCGG ACTGTATTTA
TGTGCTGAGA TCATTAAATT GCATAATGGA AGGATTTGGG TTGAAAGCAA GATTGGTGTC
GGATCTTCTT TCTTTTTTAG CCTGCCAATT GGTAAAGTTA GCCCCTGA
 
Protein sequence
MHHSKDPLFQ ILFNQLAEAR LILKAEFPNF IVISSNSAWQ NQSGTDKLYP GLNMDDLFLQ 
VIEKGEALVL QPVLDNHSDK DTWFELEILP IKDISGDSTT YLMCTLYDVT ERVNGEHALT
ASKVASAGLL LNNQELNEEL ATSNEELAAA NEELTAVNEE LLSAQERLRL LNNDLELLVQ
QRTLDLQHSE QKARFIVEDA PVAIGVLEGR NLIIDSANKK MLEIWGKDES VIGKTLREGL
PELEGQAFLG ILDDIFVTGS VFYGNEVMAV LEYSGILKEG YFNFVYQPVK DELGRTLSIM
VVATEVTEQV KARKVIEESA HQLRRMVMTT PIGLTILKGT ELSIEIANQP MLDIWGRRDE
EVIGRNLTNV FPELADQPFP ALLRNIFDTG KRVAEPSAKA TIVLSDGTFK EIYVDFSYDP
LFDLDGNVEA ILASVKDVTE LTEGRKMLQQ RQEELETLNE EFLAANEELV ATNDELFETQ
EDLKLLFERL KDNETRFRSL FEQSPVGMCF LKGEELIIEL ANENILKIWG RTREEVVGKP
HALARPELRG QPMNEWLREV YVTGIPRINN ELKVKLYDKG GLREAFVHSL YHPLKDEQGV
VTGLLIILSD VTPWVQTRKQ VEWAQEQLSQ AIQSAELGTW YIKTETREFI PSQRLKELFG
FQKDEVMTFA DAIKLITDDY RESVVKAIED TIENGSRFEL EYPIHSYRDG QLRWLRSTGK
LYPAESGTAS HFSGTVLDIT AHKLEEIRKN DFIAIVSHEL KTPLTSLKGY LQLMRGRFDS
SAAHSFFSTV SEKSLAQVEK MHSLVKGFLD VARLESAKLV LNLQPMRIDQ LVLESAEEAS
LMYDQHEIIV EYCEPVEVMA DRDKITQVLG NLLSNAIKYS PRGKIVSMSC KVIGTEVLVE
VKDQGMGIKQ HEISKVFDRF YRVETKHTTT ISGFGIGLYL CAEIIKLHNG RIWVESKIGV
GSSFFFSLPI GKVSP