Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0479 |
Symbol | |
ID | 7978628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 531920 |
End bp | 533527 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644797456 |
Product | 4-phytase |
Protein accession | YP_002948656 |
Protein GI | 239826032 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0187568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAGGA AAAAAATATG GATAACATTG CTTGCATTGA TGCTTGTATT GTCCATGGCA CTAGTTGGCT GTGGAAAATC AGAAAAGACA AGCAGCGGTG AAAAAGGCAA ATCATCATCT CAAGACACGC TCGTATATGG ACGCGGCGAT GATTCCGTCT CGCTCGATCC AGCTACGGTC ACCGATGGCG AATCGTTAAA AGTGACGAAA AACATTTTCG ATACACTGCT AGATTATAAC GATGGCGATA CAACGGTAAA ACCGGCGTTA GCAACAAAAT GGACGATTTC CGATGATGGA TTGACGTACA CATTTGAGCT CCGCAAAGGC GTGAAGTTCC ATGATGGAAC GGATTTCAAC GCGGAAGCAG TCGTATTTAA CTTCGAGCGT TGGGCGAACG GCAACGCCGA AAAATTCCCG TATTACGGCT CGATGTTTGG CGGCTATAAA AACGATGAAA GCCATGTCAT CAAAGAAGTA AAAGCAGTGG ACGATTACAC GGTTCAATTT GTGTTGAAAC GTCCGCAGGC TCCGTTTTTG AAAAACATTG CAATGCCGCC GTTTGCGATT GCCAGCCCGG CAGCGATTAA AAAATCTGGA GACAAATTTG GCGAAAATCC GGTTGGAACT GGTCCATTCG TCTTTAAAGA ATGGAAACGA AACGAGCGCA TTGTCCTTGA AAAAAATAAA GATTATTGGG AAAAAGGCTA TCCAAAGCTT AATCAATTAA TCTTTGTTTC GATTCCGGAT AACTCTGCTC GCCTAAATGC GCTATTAAAA GGCGAAATTG ATTTAATGGA AGATGTAAAT CCGAACGATT TGAAACAAAT TGAAGGAAAT AAAGATTTGC AAGTTTTTAA ACGTCCATCG ATGAATGTTG GTTACGTCGG ATTGACGACG ACGAGAGGAC CGTTGAAAAA CAAATTAGTC CGTCAAGCAT TGAACTATGC GGTGGACAAA AAAGCGATCA TTGACGCATT TTATGCCGGC CAGGCGGAAT CGGCGAAAAA CCCGATGCCG CCGAGCATTC CTGGATATAA CGATGAAATT CAAGATTATC CGTTTGATTT GAACAAGGCG AAAGAATTAC TTGCAAAAGC AGGCTATCCG AACGGTTTTG AAATGGAATT ATGGGCAATG CCAGTGCCTC GTCCATATAT GCCGGATGGC AAAAAAATAG CGGAAGCACT TCAAGAAAAT TTCGCGAAAA TTGGTGTAAA AGCGAAAATT GTTACGTATG AATGGGCAAC GTATTTAGAA AAAGCAGCAA AAGGGGAAGC AGATGCGTTC TTGCTTGGTT GGACAGGAGA TAACGGTGAC GCGGACAACT TCTTGTACGC GCTGCTCGAC AAGGACAGCA TCGGCAGCAA CAACTATACG TATTACTCAA ATGACGAGCT TCATAAAATT CTAGTCGAAG CGCAAACGAT CAGCGATGAA AATAAACGGA ATGACCTTTA CAAAAAAGCG CAAGAAATTA TCAAAGAGGA TGCACCATGG ATTCCGCTCG TCCACTCGAC GCCACTGCTT GCTGGGAAAG CAAACATTAA AGGCTATAAC CCGCATCCGA CCGGTTTGGA TAAATTTACA AAAGTTGAAT TCGAATAA
|
Protein sequence | MVRKKIWITL LALMLVLSMA LVGCGKSEKT SSGEKGKSSS QDTLVYGRGD DSVSLDPATV TDGESLKVTK NIFDTLLDYN DGDTTVKPAL ATKWTISDDG LTYTFELRKG VKFHDGTDFN AEAVVFNFER WANGNAEKFP YYGSMFGGYK NDESHVIKEV KAVDDYTVQF VLKRPQAPFL KNIAMPPFAI ASPAAIKKSG DKFGENPVGT GPFVFKEWKR NERIVLEKNK DYWEKGYPKL NQLIFVSIPD NSARLNALLK GEIDLMEDVN PNDLKQIEGN KDLQVFKRPS MNVGYVGLTT TRGPLKNKLV RQALNYAVDK KAIIDAFYAG QAESAKNPMP PSIPGYNDEI QDYPFDLNKA KELLAKAGYP NGFEMELWAM PVPRPYMPDG KKIAEALQEN FAKIGVKAKI VTYEWATYLE KAAKGEADAF LLGWTGDNGD ADNFLYALLD KDSIGSNNYT YYSNDELHKI LVEAQTISDE NKRNDLYKKA QEIIKEDAPW IPLVHSTPLL AGKANIKGYN PHPTGLDKFT KVEFE
|
| |