Gene Pecwa_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPecwa_4044 
Symbol 
ID8532477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePectobacterium wasabiae WPP163 
KingdomBacteria 
Replicon accessionNC_013421 
Strand
Start bp4450750 
End bp4452360 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content53% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003261380 
Protein GI261823274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAC GAGCAATACT CCGTACCCTG CTGTTAGGTT CACTCTGCGT GGCCGCTACG 
CCGGCCATTC TATCGGCAAA AACCCCTGAC GATCAGCTGA TCGTGGGGAT GAACATGAAC
AACATGCTCT CACTCGATCC GGCCGCTATG ACCGGTAACG AAGTGGTGGG CATTATCGTC
AACCTCTATG ACTCGCTGGT GGTGCTCGAC CCGGCTAACC TGAGCAACAT CCTGCCTTCT
CTGGCGAAAA GCTGGAGCGT CAGTGATGAT GGTAAGGTCA TTACCTTCAA TCTGGTGGAT
AACGCCAAAT TCCATTCCGG TAACCCCGTA ACGGCGCAAG ACTTCGCCTG GTCGATGAAG
CGTCTGCTTA ACCTCAACAT GGCTCAGGCC ACGACGTGGA AATCCTACGG CTTCACCGCG
GAAAATGTGG AGAAAATGAT CCGTGCCAAA GATGCTCACA CCGTTGAAAT CGAACTGCCA
AAACCGAACG ACCCCAAGCT GGTGATCTAC TCGCTGGCTA CGTTGGGTAG CGGCTCGGTG
CTGGACAGTA AAACCGTCAT GCAACATGAG AAAAACGGCG ACTGGGGCAA TGGCTGGCTA
ACCACCAATG AAGCCGGTTC TGGCCCGTTC AAGCTGGACG TGTGGCAGGC GAAAGACGTG
CTGCGCATCA GCAAAGTTGA GAATAACTGG CAGGGCGACG CGAAAATGCG GCGCGTGATT
TTCCGCCATA TGACCGAATC ACAGGCGCTG CGTCTGATGA TTGAAAAAGG CGACATTGAC
GTGGCCTCTG GGATGTCGGT GCCAGATATC AATGCGTTGA AACAAGACAA AGATGTTGTG
GTTGATGAGG TGAAAAAAGG CACGCTGTAC TACGTTGCAA TGAGCCTGAA AAACGAATAC
TTCGCTAAAC CGAAAGTGCG CGAAGCGGTT CGTTACCTGA TTGATTACGA TGGCGTCAAT
AAGACGGTGA TGCCTGGCTA CGGCTTCTAT CATCAGCGCC CCATCCAAAA AGGGATGGAT
GCGACGTTGC CGGACCCTGG CTACAAGCTG GATGTGCCAC GCGCTAAAGC GCTGTTGGCC
GAAGCGGGCT ACCCGAATGG ATTTGAAACC ACACTGCGTG TTCTTTCCGA TCAGCCGTTC
CTGAATCTGG CGACGTCGGT ACAGTCCACG CTGGCACAGG CGGGCATCAA AGCCAAAATC
ATCTCCGGCA CTGGTAATCA GGTTTACGGC GCGATGCGCG ATCGTAACTT CGATATGCTG
GTTGGCCGCG GTGGTGGTGG CGTCGATCCG CATCCTCACT CCAGCTTGCG TTCTGTTGTC
TATAACCCGG ATAACAGCGA CGAAGCCAAA CTGACCAACT TCCAAGGCTG GCGCACCTCT
TTCTACGACA AACCCTTGAA CGACATGATC GATCAGGCGT TGTTGGAGAA AGATCCTCAG
AAACAGAAGC AGATGTATAT CAACGTACAG AATCGCTATG AAGAACTGTT CCCGGCCATT
ATTCCGGTGT CGCAGATGAT CGATTCTGTG GTGTTGCGTA AGGACGTAAA AGGCTATGTG
CCGCATCCGT CTTCCACGAC GCATCTGCGC GAGGTTTATA AGCAGCGCTA G
 
Protein sequence
MKARAILRTL LLGSLCVAAT PAILSAKTPD DQLIVGMNMN NMLSLDPAAM TGNEVVGIIV 
NLYDSLVVLD PANLSNILPS LAKSWSVSDD GKVITFNLVD NAKFHSGNPV TAQDFAWSMK
RLLNLNMAQA TTWKSYGFTA ENVEKMIRAK DAHTVEIELP KPNDPKLVIY SLATLGSGSV
LDSKTVMQHE KNGDWGNGWL TTNEAGSGPF KLDVWQAKDV LRISKVENNW QGDAKMRRVI
FRHMTESQAL RLMIEKGDID VASGMSVPDI NALKQDKDVV VDEVKKGTLY YVAMSLKNEY
FAKPKVREAV RYLIDYDGVN KTVMPGYGFY HQRPIQKGMD ATLPDPGYKL DVPRAKALLA
EAGYPNGFET TLRVLSDQPF LNLATSVQST LAQAGIKAKI ISGTGNQVYG AMRDRNFDML
VGRGGGGVDP HPHSSLRSVV YNPDNSDEAK LTNFQGWRTS FYDKPLNDMI DQALLEKDPQ
KQKQMYINVQ NRYEELFPAI IPVSQMIDSV VLRKDVKGYV PHPSSTTHLR EVYKQR