Gene Pfl01_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_3040 
Symbol 
ID3712724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp3479112 
End bp3482273 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content62% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_348769 
Protein GI77459262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0526469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.364658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCACG CCAATGCCCT GCTGGAAAAA CTTGCCAACA GCTCGCAGCG CCTGAACAAG 
GGCTTGCTGA TCCTCGGCGC GCTTGCGCTG CTGTTGCTGG GTATCAGTTA TCTGGGCGTG
CAGCGCATGG TGGAAGAACA GCGCGATACT CTGCAGTTTC ACTTCGCCCG CCTGATGGAA
AACGTTCGTG AGCAGGAGGC CTTTCTCGGC GACATCTCAC GGGCCGGCGC CAGCGGTGAA
TACTTGCCGG TTACCGTGGC GTCGCCGATG GTGCAAAAAC TGTTGCCCGA GGAAGGCCCG
AACATTTATC AAGGCCGCGG GTTGCCGTTT TCGTTACCGT TCAGCGTGAA GATTGATCCG
CAGCGAATTG CCCCCGACCA ATATCCCAAG GTGTTTGCGC TGGGCAGCTA TCTGGCGGCT
TATTACAGCG CCTTCTGGGC GGCATCGCAT TACCAGTCGC CGCAGGTGAT CCTGCTCAAC
GGCCCCGATA ATTTCGACAT CGCCGTGCCT GCCACCGGCC GTTTGCGCGG CGCCGGGCCA
ACGCAGGTAG GCCGGTTCGT CAAGCTTATG ACCCAAATGA ACATGCAGCG CCACTCACAG
ACTTCAGAGC AAGTGCTCTG GGCGCCCTAC CCGTTCGAGC AGGACAACGA TGGCACCCCG
AGCCTGCTGG CCTACGTGCG GATGAACCTG AGCAAACCGA CGCTGAGGAT TGAGGGGGCG
AACACCTGGG TGGTGCTCGG CTCGCTGTTG AAACTGTCGC AGGTCAACAA TATCGAGCGT
TTGATGGAGT GGTCGATCTA CGACAATTTC ACCCTGATCA CGCCCAACGG CACGGTGTTG
ATCGGTGCGC TCAAGCCAAA TCAAATGCTC GATGACGGCG CGAACCTGAC CCGCGACGGC
CTGGTGTTCA AGCTGACCAG CGCCGGCGAC CAACGATGGA CGGCGATCTA TGTAATCAGC
GTGCAGAGTT TTCTCGATTA CGCGCTGTGG CCACTGCTGT GCCTGCTGGC TCTGGTGCTG
GCCCTGCTTG GCAGTGGCCG CGCGCTCAAT CGCTGGTACA CCGACCGCGT TATCCTCCCC
GCGCAAAGTG CCCACGCTAG CATCGCCGAA AGCGAAGCCT TCTGCCGTGC GGTCATCGAC
ACCGCGCCGA CCGGCCTGTG CGTGATCAGC CGCCGCAATC ATCAGGTGTT GCTGGAAAAT
CAGCTCGCCC AGCAATGGCA CGACTCTGGC GAACTGATCA GCCTGCTTGA CCAGCAGGCG
GCGACTGGCC ATGGCCATGC CGAACTGGAG ATAGACAGCC GGCATTTGCA TGTCGCCTTC
GTCGCCACCC GTTATCAGGG CCAGGACGCC TGGCTGTGTG CGCTGCACGA CGTCACCCGC
CATGTTGAAG ACTCGGCGGC GCTGGAAGTC GCGCGTCAGG CCGCCGACTC GGCGAATCAG
GCCAAGAGCC GCTTTCTGGC GACCATTAGT CATGAAATCC GCACGCCGCT GTATGGCGTG
CTCGGCACTC TGGAGCTGCT TGGCCTGACC GCGCTCGCCC CGCGCCAACA GGAATATCTG
GAAACCATTC AGCGCTCGTC GGCCAGTCTG TTCAAGCTGA TCAGCGATGT GCTGGATGTG
TCGAAGATCG AGGCCGGGCA GATGAACCTC GAGCTTCAGC CATTCTGCCC GCTAGAGCTG
ACCGAAGACG TAGTGCGCAG CTACGGCGCG TTTGCCCGCG GCAAAGGGTT GCAATTGTAT
GCCTGCATCG ATGCAGCACT GCCGGATCAC CTGCTCGGTG ATGCGCAGCG CATTCGCCAG
ATCCTCAACA ACCTGCTGAG CAACGCGATC AAGTTCACCG ACAACGGCCG TGTGGTCGTG
CGCGTTCGCG TGCTGCAAAA CTCAGGCGGT CAAGCCCGGG TGCAGTGGCA GGTCAGCGAC
TCGGGCGTGG GCATTTCCCA GGCGCAGCAA CAGCAATTGT TCGACCCTTT CTATCAAGTC
AACGAGGCCG ACAGTCATGC CGGCGCGGGT CTCGGTCTGG CCATTTGCAA ATGGCTGTGC
GAGTTGATGC ACGGCCAGTT GAATGTGGTC AGTGAACCGG GACTGGGCAG CTGCTTCAAC
TTGCAACTGA TGCTCGAATG CGCCTCGGAC AGCCTCGCCG ATTGCCCGGC GTTCGGCGCC
GACAGCCCCG CTGTCTACGT GCGTGCGCCA GTCGCTGAAC TGGCGCAACA TCTGATGGCG
TGGCTCAACC GCTTTGGCCT CGACTGTCGA CTGGTGACCA ACGAGCTGCC GCCTCCTTCG
GCGCTGCTGG TCGATCTTGC ACCACTGGCC AGCGCTACAG CTTTCGCTGG CCAGCGAATC
GTCGCAATTG CCGCTGGCCC CAACCCGGCG CAGGTGAGCG GCAATGGTTG GCAGGTCGAT
GCCGACGACG TGCGCGCGAT CGGCTGGGCG ATTGCCCTCG CCGTGCACGG CGCCGGTCAG
CAGCGCCCGC CGTCACGGCA AGCAAGCACC CGAGCGCTAA ACCTACGCGT GCTGGTGGCC
GAGGACAATG CGATCAACGC GGCGATCATC AAGGAACAAC TGGAGGCGCT GGGCTGCTCG
GTAGTCGTCG CGGCCGATGG CGAACAGGCT TTGGCGCACT GGGCACCGGG GCGCTTCGAT
TTGCTGCTGA CCGACGTCAA CATGCCGGTC ATGAACGGCT ATCAACTGGC CGCAGCACTG
CGAGAGCAGG ATCCTACGCT GCCGATCATC GGCGTCACCG CCAATGCCCT GCGCGAAGAA
GGCGAGCGCT GCGCCGCCGT CGGCATGAAC GCGTGGATGG TCAAACCGTT GAACCTGGCG
ACGCTGCGTG CGCAACTGCA AAGCCATTGC CAGATCACCA TCGCACCGAT TGCCGATGCT
CCCCCGACGC TATCGCCGAA AATGCGCGAG CTGTTTGTCG TGACCCTGCG CCGCGACATT
CAGAGCACCC TCAGCGCACT GGACGCGGCC AACGCCGACA GCGTCGCGCA GCAACTGCAC
AGCATGGCCG GGGCACTGGG CGCGGTGCAA GTCGCGACAC TGGCCAGCGC ATTTGTCGAA
CTGGAATGTC GCCTGACCGG CATGGCCGTC ACCCCGGCGC TGGCCGTGGA GGTGCGTCAA
CAACTGGCGC GCCTGAGCGA CCTGCTCGAC GCCCTTGAAT AA
 
Protein sequence
MPHANALLEK LANSSQRLNK GLLILGALAL LLLGISYLGV QRMVEEQRDT LQFHFARLME 
NVREQEAFLG DISRAGASGE YLPVTVASPM VQKLLPEEGP NIYQGRGLPF SLPFSVKIDP
QRIAPDQYPK VFALGSYLAA YYSAFWAASH YQSPQVILLN GPDNFDIAVP ATGRLRGAGP
TQVGRFVKLM TQMNMQRHSQ TSEQVLWAPY PFEQDNDGTP SLLAYVRMNL SKPTLRIEGA
NTWVVLGSLL KLSQVNNIER LMEWSIYDNF TLITPNGTVL IGALKPNQML DDGANLTRDG
LVFKLTSAGD QRWTAIYVIS VQSFLDYALW PLLCLLALVL ALLGSGRALN RWYTDRVILP
AQSAHASIAE SEAFCRAVID TAPTGLCVIS RRNHQVLLEN QLAQQWHDSG ELISLLDQQA
ATGHGHAELE IDSRHLHVAF VATRYQGQDA WLCALHDVTR HVEDSAALEV ARQAADSANQ
AKSRFLATIS HEIRTPLYGV LGTLELLGLT ALAPRQQEYL ETIQRSSASL FKLISDVLDV
SKIEAGQMNL ELQPFCPLEL TEDVVRSYGA FARGKGLQLY ACIDAALPDH LLGDAQRIRQ
ILNNLLSNAI KFTDNGRVVV RVRVLQNSGG QARVQWQVSD SGVGISQAQQ QQLFDPFYQV
NEADSHAGAG LGLAICKWLC ELMHGQLNVV SEPGLGSCFN LQLMLECASD SLADCPAFGA
DSPAVYVRAP VAELAQHLMA WLNRFGLDCR LVTNELPPPS ALLVDLAPLA SATAFAGQRI
VAIAAGPNPA QVSGNGWQVD ADDVRAIGWA IALAVHGAGQ QRPPSRQAST RALNLRVLVA
EDNAINAAII KEQLEALGCS VVVAADGEQA LAHWAPGRFD LLLTDVNMPV MNGYQLAAAL
REQDPTLPII GVTANALREE GERCAAVGMN AWMVKPLNLA TLRAQLQSHC QITIAPIADA
PPTLSPKMRE LFVVTLRRDI QSTLSALDAA NADSVAQQLH SMAGALGAVQ VATLASAFVE
LECRLTGMAV TPALAVEVRQ QLARLSDLLD ALE