Gene Pmen_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPmen_2101 
Symbol 
ID5109835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas mendocina ymp 
KingdomBacteria 
Replicon accessionNC_009439 
Strand
Start bp2329908 
End bp2331752 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content66% 
IMG OID640503345 
Productsulfatase 
Protein accessionYP_001187594 
Protein GI146307129 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.102341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0551221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAGCC ATCCATCGAC GCGCCGCGCC GACTCGCTGT ATTGGTTCTT CACCTGGCTC 
GCCGTCGCCG GGCATCTGTC CCGCGCCTTC GACCCGCCGG CGAGCCTCGC TCTGGGGCTG
TACCAGCTGG TGTTGCTCGG TAGCTACGCG CTGCTGTTCA TCGCTCCTCT GTGGTTGCTG
TGCCAGCTTG GCAGCCGGCT GCGCCGCAGT CTCGGCCTGA CGCTGTCGGT GCTGCTGGCA
GGCCTGCTGC AGTTGCTGAT CTACGCCGAC GGACTGCTTT GGCAGCTGTA TGGCTTTCAC
CTCAACGGTT TCGTCTGGAA CATCCTCACC ACGCCGGGCG GTATCGCGGC CCTGGGCTCT
TCCGAATCGA CCCAGAGCGG CTTCGCCCTG ATCGCCGCCG CGCTGTTGCT GGGCCAGGCG
CTGCTGCGCC TGCTGGCCAG TGCGCTGGCC AGATGGCAAC CGCGGCTACC CGCGCCGCGC
TGGCTGTGGG TGCTGCCGCT GTTTCTCCTG GCCACGCTGG GCGAGCGCGT CAGCTACGGC
GTCAGCCACT TCTACGGCTA CAGCCCCCTG CTCGAGACCG CTCAGCGCAT GCCCTTCTAT
CAGCCGCTGA CCATGCGCCG CTTCCTCGAG CAACAGCTCG GCCTGCAGCG ACCGCAGCGT
CTGGAGCTGG AAAACGTCGC ACTCAAGGGC CAGCTCAAGT ATCCGCAGGC GCCGCTGCGT
CTGACGCGGC CGGACAAACC GCTGAACCTG GTGTGGCTGG TGGCCGAATC CTGGCGTGCC
GACAGCCTCA ACCCACGGGT GATGCCGCAG ACCGACGCCT TCGCGGCGCG CGCGCAACGC
TTCGACAGTC ATTTCTCCGG TGGCAACGGC ACCCGTATCG GCATGTTCAG CCAGTTCTAC
GGCCTACCGG CCAACCTCTG GTTTCCGGTA CTGGATGCGC GTATCGGCAG CCCGCTGATC
GACGTGTTGC AACAGCAGGA CTACCAGATG CGCCTGTTCA CCAGCGCCAA GTTCAGCTAT
CCGGAGTTCG ACAAGACCCT GTTCGTCAAG GTGCCACCGG CGCAGATGCA ATCCTATGAC
CGCGGGCCGA GCTGGCAGCG CGACCGCAAG AACGTCGACG ACCTGCTGCA GTTCATCGAC
CAGCGCGACC GTGCGAAGCC TTTCATGACC TTCATGTTCT TCGAGTCGCC GCACGCCAAC
TACGACTTCC CGCCCGAGTC GGTGATCGAG CCGGACTACC TACCGGACTT CAGCTACGCC
AGCATGGACC TGGAGCGCGA CATCGACGGC ATCTACAAGC GCTACCTGAA CGCCGTGCAC
CACCTTGACG GGCAGATCGC CCGGGTCGTC GACCATCTCG AACAGCGCGG GCTGCTGGAC
GACACGCTGA TCGTGATCAC CGGCGATCAT GGCGAAGAGT TCATGGATAA TGGCCGCTGG
GGCCACAACT CCACCTTCGT CGATGCCCAG CTGCGCGTGC CGCTGGTGCT CTGGGTGCCG
GGCCGCGAGG CGCAGCGCAC CGAGCTGCGC ACCAGCCATG TCGACCTGCT GCCAACCCTG
CTGCCGCTGC TGGGAGTGAA CAACCCGGCG CATGACTACA GCATCGGCCA GAGCCTGTTC
AGCCCCAGTT CGCCGCGGCT GCTGGTGGCT GGCGACTGGG ACCGCCTGGC CTTCCTCGGC
GAACGGCACA AGGTGGTGCT GCCATTCACC AGCGGCAGTT TCACCGCCCT GCAGGCCAGC
CGAGCCGATG ATCGGCACCT GGCGAACGCC GCCAGCGTGC TGCAACAGGC TCTGCCACAG
ATCCGCAGCG AGCTGCAGGG CTTCAGACGC TTCCTCGCGC ACTGA
 
Protein sequence
MSSHPSTRRA DSLYWFFTWL AVAGHLSRAF DPPASLALGL YQLVLLGSYA LLFIAPLWLL 
CQLGSRLRRS LGLTLSVLLA GLLQLLIYAD GLLWQLYGFH LNGFVWNILT TPGGIAALGS
SESTQSGFAL IAAALLLGQA LLRLLASALA RWQPRLPAPR WLWVLPLFLL ATLGERVSYG
VSHFYGYSPL LETAQRMPFY QPLTMRRFLE QQLGLQRPQR LELENVALKG QLKYPQAPLR
LTRPDKPLNL VWLVAESWRA DSLNPRVMPQ TDAFAARAQR FDSHFSGGNG TRIGMFSQFY
GLPANLWFPV LDARIGSPLI DVLQQQDYQM RLFTSAKFSY PEFDKTLFVK VPPAQMQSYD
RGPSWQRDRK NVDDLLQFID QRDRAKPFMT FMFFESPHAN YDFPPESVIE PDYLPDFSYA
SMDLERDIDG IYKRYLNAVH HLDGQIARVV DHLEQRGLLD DTLIVITGDH GEEFMDNGRW
GHNSTFVDAQ LRVPLVLWVP GREAQRTELR TSHVDLLPTL LPLLGVNNPA HDYSIGQSLF
SPSSPRLLVA GDWDRLAFLG ERHKVVLPFT SGSFTALQAS RADDRHLANA ASVLQQALPQ
IRSELQGFRR FLAH