Gene YPK_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0972 
Symbol 
ID6090745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1089409 
End bp1090611 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content46% 
IMG OID641596035 
Productarabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_001719726 
Protein GI170023221 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.396202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCT TTAAACCGGC ATTACTGACT GTTTGTTTAT CGCTAAGCCT CATGGTTGGG 
GCTAATGCGG CAGAACCCTT TACCATTGCG CCATTAAAAA ATGTCCCAGC AGATTTTATT
AAAGGGGCTG ATATTTCCAC GTTAGCGGAA GTGGAACGAC AAGGTGGCAA ATTTTTTAAT
GAACAAAATG TTCAACAAGA TGCAATGGCT ATCCTGAAGG CTAACGGCGT GAATTATGTG
CGTCTGCGTT TGTGGGTCGA TCCCAAAGAC AGCGATGGGC AGAGTTACGG CGGTGGCAGT
AACGATTTGG CGACCACTTT GGCGCTGGCT AAACGTGCGA AAGCCCAGGG TTTAAAGGTA
TTGCTTGATT TCCATTACAG CGATTTCTGG ACCGATCCAG GTAAGCAATT TAAGCCGAAG
GCTTGGCAGA AAATGAATTA CGACCAGCTT AAAGTCGCCA TTCATGACTA TACCCGCGAT
ACCATTGCCA CCTTCAAAAA AGAGGGTGTC TTGCCTGATA TGGTGCAAAT CGGCAATGAA
TCCAATGGCG GTCTTCTCTG GCCAGAAGGA AAAAGCTGGG GGGAAGGCGG TGGTGAGTTT
GATCGGCTGG CGGGTTTGCT GAATGCGGCC ATCGGCGGTT TACGTGAGAA CCTCAGTTCC
CCTTCAGATG TGAAAATCAT GCTGCACCTC GCTGAAGGCA CCAAGAATGA CACCTTCCAT
TGGTGGTTTG ATGAAATAAC CAAACGTAAT GTGCCGTTCG ATATTATTGG TCTGTCGATG
TACACCTACT GGGACGGCCC GATTAGCGCC TTGCAAACCA ACATGGATGA TATCAGCCAG
CGTTACCAAA AAGATGTCAT CGTCGTGGAA GCCGCTTATG GCTATACCTT GGAAAATTGT
GATAACGCCG AAAATAGCTT TACCGCTAAA GAAGAGAAAG ATGGGGGTTA TCCCGGAACG
GTTCAAGGAC AAGCGAATTT CATTCATGAT CTGATGCAGA GTGTTATTAA TGTCCCCGAT
GGCAGAGGGA AGGGGATATT TTACTGGGAG CCTACCTGGA TTTCTGTTCC GGGAAATACT
TGGGCAACAC CGGCTGGAAT GAAATATATC AATGATAATT GGAAAGAAGG TAATGCACGT
GAAAATCAGG CGTTATTTGA TTGCCAAGGA AAAGTATTGC CTTCGATGAA AGTTTTTAAT
TAA
 
Protein sequence
MKFFKPALLT VCLSLSLMVG ANAAEPFTIA PLKNVPADFI KGADISTLAE VERQGGKFFN 
EQNVQQDAMA ILKANGVNYV RLRLWVDPKD SDGQSYGGGS NDLATTLALA KRAKAQGLKV
LLDFHYSDFW TDPGKQFKPK AWQKMNYDQL KVAIHDYTRD TIATFKKEGV LPDMVQIGNE
SNGGLLWPEG KSWGEGGGEF DRLAGLLNAA IGGLRENLSS PSDVKIMLHL AEGTKNDTFH
WWFDEITKRN VPFDIIGLSM YTYWDGPISA LQTNMDDISQ RYQKDVIVVE AAYGYTLENC
DNAENSFTAK EEKDGGYPGT VQGQANFIHD LMQSVINVPD GRGKGIFYWE PTWISVPGNT
WATPAGMKYI NDNWKEGNAR ENQALFDCQG KVLPSMKVFN