Gene Pfl01_1655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_1655 
Symbol 
ID3716789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp1843757 
End bp1845028 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content58% 
IMG OID 
Productaromatic hydrocarbon degradation protein 
Protein accessionYP_347387 
Protein GI77457882 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0166252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00574559 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG TCATGCTCAA AACCACCCTT AGCCTTGCCG TAACCGTGGC ATCCACCCAG 
ATCTTCGCGG CTGGCTTTGC CATCAACGAA CACAGCATCA GCGGGATGGG GACTGGGTAC
GCCGGGCGAT CTTCTTCTGC CGACGACGCA AGCACTGTTT ACGGCAACCC TGCCGGCATG
TCGCGCATCA CGCGCGAACA AGTCACCGGT GGCGTTGCAT TCCTCGATGC AAAAACCGAT
ATCAGCGACG CCAGCTCCAG CCCGAACGGC GGCAGCAACA AAGGCGACAT GGTGCCCTTC
ACCTCCGTAC CTATGGGTTA CTACGTCAAG CCGATCGACG AGCATTGGGC ATTCGGTCTG
GGTGTGTACG TACCCTTCGG CCTGATCACC GACTATGAAA ACGGCTTCGC CGGCCGTTAC
TTCGGCAGCA AGTCCGAAGT CAAGATCATT ACCTTCCAGC CAACCGTCAG CTACAAGTTC
AACGACGTCG TGTCGATCGG TTTCGGCCCG ACCATCAACC GTATCGACGG CTCGCTTGAA
TCTAACCTGT CGATCACTCA GGCTGCGCCG GACGGCAAGG TCAAGATCAA GGGTGACGAC
ACCGCACTGG GCTACAACAT CGGGGTGCTG GTACAGGCTA CCGACAGCAC TCGCCTGGGT
CTGACCTACC ACTCGAAAGT CGACTACAAG CTCGAAGGCA ACACCAAGGT CAACTACGGT
GTGCTGGGCG CGATCGGCCT GGGTGCGAAC CAGAAGTACG ACGCTTCGCT GAAGATCACC
ACGCCTGAAT CCGTGGACCT CTCGGTCACC CAGGCGATCA ACGATCGCTG GAACGTCTAC
GCCGGTACCA CCTGGACCCG CTGGAGCCAG CTGGAAAAGA TCACCGTCAA GAACTCCGGC
GTTCAGCCAC TGCTGGCTGG CCAGTTCGGC GAGATCACCG AAGAACAGAA CTGGCATGAC
ACCTGGGCGT ACGCCATCGG TACGTCCTAC CAACTGAACA AGGAATGGGT ACTGCGTACC
GGTCTGACGT TCGACCAGTC GCCGACCAAC AACGTCGACC GTTCGCCACG CATACCGACC
GGCGACCGGA CCATCTTCAG TATCGGTGCC GGCTGGAGCC CGACCGAAGA CCTGACCATC
GACGTTGCCT ACTCGTACCT GAAGGAAGAG AAGGTCAACA TTCGCAACAC AAACGATCGT
GGCCAGAGCT ACAACTCGCA GTATGAAAAC TCGGCAAACG GCTTCGGTGT CGGCGCAACC
TACCGCTTCT GA
 
Protein sequence
MKKVMLKTTL SLAVTVASTQ IFAAGFAINE HSISGMGTGY AGRSSSADDA STVYGNPAGM 
SRITREQVTG GVAFLDAKTD ISDASSSPNG GSNKGDMVPF TSVPMGYYVK PIDEHWAFGL
GVYVPFGLIT DYENGFAGRY FGSKSEVKII TFQPTVSYKF NDVVSIGFGP TINRIDGSLE
SNLSITQAAP DGKVKIKGDD TALGYNIGVL VQATDSTRLG LTYHSKVDYK LEGNTKVNYG
VLGAIGLGAN QKYDASLKIT TPESVDLSVT QAINDRWNVY AGTTWTRWSQ LEKITVKNSG
VQPLLAGQFG EITEEQNWHD TWAYAIGTSY QLNKEWVLRT GLTFDQSPTN NVDRSPRIPT
GDRTIFSIGA GWSPTEDLTI DVAYSYLKEE KVNIRNTNDR GQSYNSQYEN SANGFGVGAT
YRF