Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0917 |
Symbol | |
ID | 5385812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 1100384 |
End bp | 1101586 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640863883 |
Product | glycosy hydrolase family protein |
Protein accession | YP_001399901 |
Protein GI | 153950471 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 0.671223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTCT TTAAACCGGC ATTACTGACT GTTTGTTTAT CGCTAAGCCT CATGGTTGGG GCTAATGCGG CAGAACCCTT TACCATTGCG CCATTAAAAA ATGTCCCAGC AGATTTTATT AAAGGGGCTG ATATTTCCAC GTTAGCGGAA GTGGAACGAC AAGGTGGCAA ATTTTTTAAT GAACAAAATG TTCAACAAGA TGCAATGGCT ATCCTGAAGG CTAACGGCGT GAATTATGTG CGTCTGCGTT TGTGGGTCGA TCCCAAAGAC AGCGATGGGC AGAGTTACGG CGGTGGCAGT AACGATTTGG CGACCACTTT GGCGCTGGCT AAACGTGCGA AAGCCCAGGG TTTAAAGGTA TTGCTTGATT TCCATTACAG CGATTTCTGG ACCGATCCAG GTAAGCAATT TAAGCCGAAG GCTTGGCAGA AAATGAATTA CGACCAGCTT AAAGTCGCCA TTCATGACTA TACCCGCGAT ACCATTGCCA CCTTCAAAAA AGAGGGTGTC TTGCCTGATA TGGTGCAAAT CGGCAATGAA TCCAATGGCG GTCTTCTCTG GCCAGAAGGA AAAAGCTGGG GGGAAGGCGG TGGTGAGTTT GATCGGCTGG CGGGTTTGCT GAATGCGGCC ATCGGCGGTT TACGTGAGAA CCTCAGTTCC CCTTCAGATG TGAAAATCAT GCTGCACCTC GCTGAAGGCA CCAAGAATGA CACCTTCCAT TGGTGGTTTG ATGAAATAAC CAAACGTAAT GTGCCGTTCG ATATTATTGG TCTGTCGATG TACACCTACT GGGACGGCCC GATTAGCGCC TTGCAAACCA ACATGGATGA TATCAGCCAG CGTTACCAAA AAGATGTCAT CGTCGTGGAA GCCGCTTATG GCTATACCTT GGAAAATTGT GATAACGCCG AAAATAGCTT TACCGCTAAA GAAGAGAAAG ATGGGGGTTA TCCCGGAACG GTTCAAGGAC AAGCGAATTT CATTCATGAT CTGATGCAGA GTGTTATTAA TGTCCCCGAT GGCAGAGGGA AGGGGATATT TTACTGGGAG CCTACCTGGA TTTCTGTTCC GGGAAATACT TGGGCAACAC CGGCTGGAAT GAAATATATC AATGATAATT GGAAAGAAGG TAATGCACGT GAAAATCAGG CGTTATTTGA TTGCCAAGGA AAAGTATTGC CTTCGATGAA AGTTTTTAAT TAA
|
Protein sequence | MKFFKPALLT VCLSLSLMVG ANAAEPFTIA PLKNVPADFI KGADISTLAE VERQGGKFFN EQNVQQDAMA ILKANGVNYV RLRLWVDPKD SDGQSYGGGS NDLATTLALA KRAKAQGLKV LLDFHYSDFW TDPGKQFKPK AWQKMNYDQL KVAIHDYTRD TIATFKKEGV LPDMVQIGNE SNGGLLWPEG KSWGEGGGEF DRLAGLLNAA IGGLRENLSS PSDVKIMLHL AEGTKNDTFH WWFDEITKRN VPFDIIGLSM YTYWDGPISA LQTNMDDISQ RYQKDVIVVE AAYGYTLENC DNAENSFTAK EEKDGGYPGT VQGQANFIHD LMQSVINVPD GRGKGIFYWE PTWISVPGNT WATPAGMKYI NDNWKEGNAR ENQALFDCQG KVLPSMKVFN
|
| |