Gene Pfl01_4499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_4499 
Symbol 
ID3717758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp5074043 
End bp5075992 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content60% 
IMG OID 
Productsulfatase 
Protein accessionYP_350227 
Protein GI77460720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000368912 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACTTCC TCAAGACGGC GCCTGTGCGC TTTTTGCTGC TGGTCACCGG CGCCTGGCTG 
GTGGTGTTTC TCCTGACCCG AAGCGTGCTG TTGCTGACTC ATCTGGACGA GGCCGGCGGC
GTCGCGCTTT CCGTGTTCGG CATCGGCTTG CTGTATGACC TGGGCTTTCT CGCCTATGCC
GCGCTGCCGA TGGGCCTGTA TTTACTGCTG TGTCCGCCGG CCTTGTGGCG CCGGCGCGGT
CATCGCTGGT TCCTTCAGGG ACTGTTGACG GTCAGTCTGT TCGCCATGCT GTTCACCTCC
GTGGCCGAAT GGCTGTTCTG GGACGAGTTC GGCGTGCGCT TCAATTTCAT CGCCGTCGAC
TATCTGGTGT ATTCCGATGA AGTCCTGAAC AACGTGCTGG AGTCGTATCC CATCGGCATG
TTGCTGAGCA TCCTTGCGCT GCTCGCCGTC GCATTGAGTT TCGCGTTGCG CAAACCTTTC
AACGCGGCGC TGGACGCCCC GCTGCCGCCA CTGCGCGGTC GCCTGCTCAA TGCCCTCGGT
CTGTTGGTGG TCGCGGGACT GAGCCTGCAA CTGCTCAGTC AGGACGCGCC GCGCGCTCAG
GGCGGCAATG CCTATCAGAA TGAACTGGCG AGCAATGGCC CGTATCAGTT TTTCGCCGCG
TTCCGTAACA ATGAACTGGA CTATGGCCAG TTCTACAACA GTCTGTCGCC GGAAAAAGTC
GCCGGCCAGA TTCGCGCTGA ATTGAGCGAA CCCAACGCGC GCTTCATCGG TCAGGATCCA
CAGGACATCC GCCGGTTGAT CGACAACCCG GGCACTGTGC GTAAACCGAA CATCGTGCTG
GTTACCATCG AAAGCCTGAG TGCCAAGTAC CTGGGCAGCA ATGGCGATGA ACGCAACCTG
ACACCGAATC TGGATGCCTT GCGCAAGCAG AGTCTGTACT TCAACAATTT CTACGCCACC
GGCACCCGCA CCGATCGCGG CCTGGAAGCC ATCACCCTGG CCATTCCGCC GACGCCCGGC
CGTTCGATCG TCAAGCGCAT CGGCCGCGAA AGCGGCTTCG CCAGCCTCGG CCAGCAACTC
AGCGCCGTCG GCTACGACAG CGTGTTCGTC TACGGCGGGC GCGGTTATTT CGACAACATG
AACGCGTTCT TCAGCGGCAA CGGCTACCGC GTCGTCGATC AGAGCAGCGT CGACGAATCG
GAAATTCATT TCAAGAATGC CTGGGGCATG GCCGACGAGG ATCTGTACAA GCAGACGCTG
AAACTGGCCG ATGCCGATCA CGCCAGACAG CAGCCATTCC TGTTGCAGCT GATGACCACG
TCCAACCACC GTCCTTATAC CTATCCGGAC AACCGGATCG ACATCAAGTC CGGCAACGGT
CGCGACGGCG CGGTGAAATA CACCGACTAC GCCATCGGCC AGTTCCTGGA GCAGGCGCGG
CAGAAACCGT GGTTCGACAA TACGATCTTC ATCTTCGTCG CCGACCACAC CGCCGGCAGC
GCGGGCAAGG AAGACTTGCC GATCAGCAAC TACCAGATCC CGCTGTTCAT CTATGCACCG
AAGTTGATCG AGCCACGGGA AAACGCGCAA CTGGCCAGCC AGATCGATCT AGCGCCAACC
CTGCTGGGGT TGCTGAACCT GGATTACCAA TCGACGTTCT TCGGTCGCAA CCTGCTGCAG
GACAACCCGC TGCCACCCCG GGTCGTGGTC GGCAACTATC AACATCTGGG ACTGTTCGAC
GGCAAGGATC TGGCGATCCT CAGCCCGCGC CAGGGCCTGC GTCGGCATGA CGATGCACTG
ACCGAAAGCC GCGAGTCCAA AGCCGCCAGC GACGACCCGC TGATCAGCCG CGCCATCACC
TATTACCAAA CCGCCAGTTA TGGCTTCAAG CAACAGCTGC TTGGCTGGAA AGCGCCCAAG
GAGGGCGCCG AGCAAGTCAG CGAACGTTAA
 
Protein sequence
MDFLKTAPVR FLLLVTGAWL VVFLLTRSVL LLTHLDEAGG VALSVFGIGL LYDLGFLAYA 
ALPMGLYLLL CPPALWRRRG HRWFLQGLLT VSLFAMLFTS VAEWLFWDEF GVRFNFIAVD
YLVYSDEVLN NVLESYPIGM LLSILALLAV ALSFALRKPF NAALDAPLPP LRGRLLNALG
LLVVAGLSLQ LLSQDAPRAQ GGNAYQNELA SNGPYQFFAA FRNNELDYGQ FYNSLSPEKV
AGQIRAELSE PNARFIGQDP QDIRRLIDNP GTVRKPNIVL VTIESLSAKY LGSNGDERNL
TPNLDALRKQ SLYFNNFYAT GTRTDRGLEA ITLAIPPTPG RSIVKRIGRE SGFASLGQQL
SAVGYDSVFV YGGRGYFDNM NAFFSGNGYR VVDQSSVDES EIHFKNAWGM ADEDLYKQTL
KLADADHARQ QPFLLQLMTT SNHRPYTYPD NRIDIKSGNG RDGAVKYTDY AIGQFLEQAR
QKPWFDNTIF IFVADHTAGS AGKEDLPISN YQIPLFIYAP KLIEPRENAQ LASQIDLAPT
LLGLLNLDYQ STFFGRNLLQ DNPLPPRVVV GNYQHLGLFD GKDLAILSPR QGLRRHDDAL
TESRESKAAS DDPLISRAIT YYQTASYGFK QQLLGWKAPK EGAEQVSER