Gene Phep_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3858 
Symbol 
ID8254992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4627531 
End bp4629924 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content42% 
IMG OID644937522 
ProductPAS sensor protein 
Protein accessionYP_003094111 
Protein GI255533739 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAG CACAGCAGGA TTCATCAACT CATGAAAGTA AACGCATCAA TGCGTTGCAT 
GGCTATAGTA TATTGAATAA ACCACAAGAG GACCAGTTTG ACCATATGGC TAAACTGGCC
TCGTTTTTAT GTGGCACACC CATAGCAATC ATTGCGTTGG TTGACGAAGA CCTGCTTTGG
TTCAAATCTA CCATAGGTAT CACCTATAAT GAAATGCCCC AGGGCATTTC ATTTTGTGAA
CATACTCTTA CGGGCGAGCA GACTTTTGAA GTATATGATG CCAGGTCTGA TGAAAGATTT
AAAGCGCATC CATCAGTTGT CGGCAAACCT CATATAAGGT TTTATGCAGG TACGGCTCTT
ATAGATGAGG ATGGCTCTAA GTTGGGTGCG CTCTGTGTAT TTGATACAGC ACCCCGAAAA
CTAAGCGATG CGCAAAAAGA GGGCCTGGAA ATTCTGGCCA GAACAGTTGT TGCCCATATT
TCCTTAAAAA AGAAAAGTGA AGCATTGATT GCGAATAACC ACCGCTATGA GGAACTGCTC
AACATCACTG CAGTTTCACC CGAAATACAT TGTATCCTGG ATTATACCGG TAAAGTGTTG
TTCATTAACA ATGCTATAAC GAATATTCTT GAATATACAG TTGAGGAAGC TATTGGCTTA
AATATCTGGA ATTTTTGCCA TAGGGAAGAT ATAGACCGTA TTGTAAAAAT CATAGAAGAG
GGGCTGAGGA ATAAGATCAA GGAGTTTGTA ATTGATTTCA GGATTGTCAG TAAAACAGGG
GTAATCCGCT GGCTGGGCTG GAGCATGGTA GCCAAAAACG GACGCTGGTA TGCTTACGGA
AGAGACATTA CAGAAAATAA GAAGGTAGAG CACGAATTGA TGAAGCTGTC TTTTGTGGCC
AGTAAGGTGA ACAATGCCAT CGTCATTAAT GATGCCAACA ACCATGTAAC CTGGGTAAAT
GAAGCTTTTG AAAAGATTAC GGGTTTTAAC CTGGATGATC TTAAGGGCAG GCGATTGGGC
GATCTTATTG CTGGTCCGGA AACAGATATG GCACTTATGG AAAGGGCAAG GGCACAGACC
AAACTGAACC AGTCCTTCAC GGTCGACCTC CTGGCTTACC GTAAAGATGG TCAACCCATC
TGGCTTTCCA TATACAATAC GGTAGTATTT AATGATCTGG GAAATGTGGA GATAGAAGTA
GAAATCATCA TTGACATTAC CGACAAAAAG AGAGTGGAAC GTGAAATGCT GGAGGCCAAA
GAACAGGCCC TGCAATTGAG CGAAGCCAAA GAGATGTTCC TCTCTGTAAT GAGCCATGAG
ATCAGAACGC CACTCAATGC TGTAATTGGC ATGACTCATC TGCTGATAGA AAATGATCCG
AGGGCCTCTC AGATCGAAGA CCTCAATATC CTTAAGTTTT CGGGTGAAAA CCTGCTGAAC
ATCATTAATG ATATCCTGGA TTTCACTAAA ATGGAAACCG GTAACCTTCA ACTTGAGGCA
CTTCCATTTA GCATGAAGGC GCTCACCACT GATATCATCA CCTCTTTATA TGTTGGTGCA
ACTAAAAAGG GGAATATTCT GGAGCTCTTG TACGATGACC AGATCCCGCA TCTTTCGGGA
GATAAAACAA GGTTATACCA GATACTGATG AATTTACTCG GTAATGCGAT TAAATTCACT
GATCAGGGAA AGATCATACT GAGTGTTAAA CTGCTTGAGC AGGATGCTGT AAATGCCAGC
CTGTATTTTG AGATCAGTGA TACAGGAATT GGCATACCTG AGGATAAACT TTCTTATATT
TTTGAGACCT TTACGCAGGC TAAAACGGAT ATTTCAAGGA AATATGGTGG AACAGGTCTG
GGCCTGGCCA TTACTAAAAA ACTGCTGGAA CTATACAATT CGGAAATTCA AGTAAAAAGC
AGGGAAGGAG CGGGTACTAC TTTTTCGTTT ACGCTCAGAT TTGCAAAGGC ATCTGCCGCT
GTGAGCGAAG TTAAGCCTGC GCTGCAACCA CTTGCACTAT CAGGTAAAAA AATACTGATC
GTAGATGACA ATGAGATTAA CCTGCTGATC GCTAAAAGAA TCCTGACAAA ATATAGTTTG
GAGATAGACT TTGCACTTAG CGGTGAAGAA GCCCTTGAAA TGGTACAAAA GAATATTTAT
GACCTGGTAT TTATGGATAT TAAAATGCCA GGCATAGATG GTTTTGAAAC TACGGTGCGT
ATCAGAAATC TTGGTGGCAA TTATTTTAGC ACGGTGCCTA TCATCGCTTT AACGGCATCC
ACATTAAACA ATGAAATTGT GAGGTTTAAG CAGTCGGGAA TGAATGGCCA TATTTTAAAA
CCATTTAAAC TTGAAGAGAT CAGAGATTTG CTATCAGCCT ATTTACAGCC CTGA
 
Protein sequence
MSTAQQDSST HESKRINALH GYSILNKPQE DQFDHMAKLA SFLCGTPIAI IALVDEDLLW 
FKSTIGITYN EMPQGISFCE HTLTGEQTFE VYDARSDERF KAHPSVVGKP HIRFYAGTAL
IDEDGSKLGA LCVFDTAPRK LSDAQKEGLE ILARTVVAHI SLKKKSEALI ANNHRYEELL
NITAVSPEIH CILDYTGKVL FINNAITNIL EYTVEEAIGL NIWNFCHRED IDRIVKIIEE
GLRNKIKEFV IDFRIVSKTG VIRWLGWSMV AKNGRWYAYG RDITENKKVE HELMKLSFVA
SKVNNAIVIN DANNHVTWVN EAFEKITGFN LDDLKGRRLG DLIAGPETDM ALMERARAQT
KLNQSFTVDL LAYRKDGQPI WLSIYNTVVF NDLGNVEIEV EIIIDITDKK RVEREMLEAK
EQALQLSEAK EMFLSVMSHE IRTPLNAVIG MTHLLIENDP RASQIEDLNI LKFSGENLLN
IINDILDFTK METGNLQLEA LPFSMKALTT DIITSLYVGA TKKGNILELL YDDQIPHLSG
DKTRLYQILM NLLGNAIKFT DQGKIILSVK LLEQDAVNAS LYFEISDTGI GIPEDKLSYI
FETFTQAKTD ISRKYGGTGL GLAITKKLLE LYNSEIQVKS REGAGTTFSF TLRFAKASAA
VSEVKPALQP LALSGKKILI VDDNEINLLI AKRILTKYSL EIDFALSGEE ALEMVQKNIY
DLVFMDIKMP GIDGFETTVR IRNLGGNYFS TVPIIALTAS TLNNEIVRFK QSGMNGHILK
PFKLEEIRDL LSAYLQP