Gene Pmen_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPmen_2047 
Symbol 
ID5109781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas mendocina ymp 
KingdomBacteria 
Replicon accessionNC_009439 
Strand
Start bp2261408 
End bp2263150 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content64% 
IMG OID640503288 
Productextracellular solute-binding protein 
Protein accessionYP_001187540 
Protein GI146307075 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.120072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.523187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACA ATAACAACAA GACCCGACAT AGCATGGCGT TGGCCGCCTT GCTGGCGTTG 
TCCGGTTTGA GCGGTGCGGC CTGGGCGGAT GCCTATGAAG AGGCCGCGAA GAAATGGATC
GCCGAGGAGT TCAACCCCTC GACCCTGACG CCCGAGCAGC AGCTCGAAGA GCTGAAATGG
TTCATCAAGG CCGCCGAGCC GTTTCGCGGC ATGAACATCA GCGTGGTGTC GGAAACCATC
ACCACCCACG AGTACGAGTC CAAGGTGCTG GCCAAGGCGT TCAGCGAGAT CACCGGGATC
AAGCTCAAGC ACGACTTGCT GCAGGAAGGC GACGTGGTGG AGAAGCTGCA GACGCAGATG
CAGTCGGACA AGAACATCTA CGACGGCTGG GTCAACGACT CCGACCTGAT CGGCACCCAC
GCGCGTTACG GCAAGGCGGT GGCGATCAGC GACATGATCG AGGGCGAGGG CAAGGACTTC
ACCTCGCCGA CCCTGGATCT CGAGGACTTT ATCGGTCTGT CCTTCACCAC CGGGCCGGAC
GGCAAGCTCT ATCAGCTGCC CGACCAGCAG TTCGCCAACC TCTACTGGTT CCGCGCCGAC
TGGTTCGAGC GCCCGGAGCT GAAAGCCAAG TTCAAGGAAA TCTACGGCTA CGAGCTGGGG
GTGCCGGTCA ACTGGTCGGC CTATGAGGAC ATCGCCGAGT TCTTCTCGGT GCACGTCAAG
GAAATCGACG GTCAGCGCGT CTACGGCCAC ATGGACTACG GCAAGAAGGA CCCGTCGCTG
GGCTGGCGCT TCACCGATGC CTGGTTCTCC ATGGCCGGTG GCGGCGACAA GGGCCTGCCA
AACGGGTTGC CGGTAGACGA GTGGGGCATT CGCGTCGACG GCTGCCGGCC GGTGGGCTCC
AGCGTTGCCC GCGGCGGCGA CACCAACGGT CCGGCGGCGG TGTTCGCGAC GCAGAAATAC
GTCGACTGGA TGCGCCAGTA CGCACCGCGT GAGGCCCAGG GCATGACCTT CTCCGAGGCG
GGCCCGGTGC CGGCGCAGGG TGCCATCGCC CAGCAGATCT TCTGGTACAC CGCCTTTACC
GCCGACATGA CCAAGCCGGG CCTGCCGGTG GTCAACGAGG ACGGCACGCC GAAATGGCGT
ATGGCGCCGT CGCCGAAAGG GCCGTACTGG GAGGAGGGCA TGAAGCTTGG TTATCAGGAT
GCCGGTTCCT GGACCTTCCT CAAGTCCACC CCGGAGAAGC AGCGCCTGGC CGCCTGGCTC
TATGCGCAGT TCGTCACTTC CAAGACCGTG TCGCTGAAGA AGACCCTGGT CGGCCTGACG
CCGATCCGCG AGTCGGACAT CAACTCACAG GCCATGACCG ACGCCGCGCC CAAGCTCGGC
GGACTGGTGG AGTTCTACCG CAGCCCGGCC CGCGTGCAGT GGACGCCGAC CGGTACCAAC
GTGCCGGATT ATCCGCGTCT GGCCCAGCTG TGGTGGCAGT TCATCGCCGA GGCCGCCAGT
GGCGACAAGA CCCCGCAACA GGCGCTCGAT GGCCTGGCCG CAGCGCAGGA CACCATGCTC
GGCCGTCTGG AGCGCTCCGG CGTGCTGGGT GAATGCGGGC CGAAGCTGAA CGAGCCGCGG
GATCCGCAGT ACTGGTTCGA TCAGCCCGGT GCGCCCAAGC CCAAGCTGGA GAACGAGAAG
CCGCAGGGCG AAACCATCGC CTACGACGAG CTGCTCAAGT CCTGGGAGCA GGCGCGCAAC
TGA
 
Protein sequence
MFDNNNKTRH SMALAALLAL SGLSGAAWAD AYEEAAKKWI AEEFNPSTLT PEQQLEELKW 
FIKAAEPFRG MNISVVSETI TTHEYESKVL AKAFSEITGI KLKHDLLQEG DVVEKLQTQM
QSDKNIYDGW VNDSDLIGTH ARYGKAVAIS DMIEGEGKDF TSPTLDLEDF IGLSFTTGPD
GKLYQLPDQQ FANLYWFRAD WFERPELKAK FKEIYGYELG VPVNWSAYED IAEFFSVHVK
EIDGQRVYGH MDYGKKDPSL GWRFTDAWFS MAGGGDKGLP NGLPVDEWGI RVDGCRPVGS
SVARGGDTNG PAAVFATQKY VDWMRQYAPR EAQGMTFSEA GPVPAQGAIA QQIFWYTAFT
ADMTKPGLPV VNEDGTPKWR MAPSPKGPYW EEGMKLGYQD AGSWTFLKST PEKQRLAAWL
YAQFVTSKTV SLKKTLVGLT PIRESDINSQ AMTDAAPKLG GLVEFYRSPA RVQWTPTGTN
VPDYPRLAQL WWQFIAEAAS GDKTPQQALD GLAAAQDTML GRLERSGVLG ECGPKLNEPR
DPQYWFDQPG APKPKLENEK PQGETIAYDE LLKSWEQARN