Gene GWCH70_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0479 
Symbol 
ID7978628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp531920 
End bp533527 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content43% 
IMG OID644797456 
Product4-phytase 
Protein accessionYP_002948656 
Protein GI239826032 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0187568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAGGA AAAAAATATG GATAACATTG CTTGCATTGA TGCTTGTATT GTCCATGGCA 
CTAGTTGGCT GTGGAAAATC AGAAAAGACA AGCAGCGGTG AAAAAGGCAA ATCATCATCT
CAAGACACGC TCGTATATGG ACGCGGCGAT GATTCCGTCT CGCTCGATCC AGCTACGGTC
ACCGATGGCG AATCGTTAAA AGTGACGAAA AACATTTTCG ATACACTGCT AGATTATAAC
GATGGCGATA CAACGGTAAA ACCGGCGTTA GCAACAAAAT GGACGATTTC CGATGATGGA
TTGACGTACA CATTTGAGCT CCGCAAAGGC GTGAAGTTCC ATGATGGAAC GGATTTCAAC
GCGGAAGCAG TCGTATTTAA CTTCGAGCGT TGGGCGAACG GCAACGCCGA AAAATTCCCG
TATTACGGCT CGATGTTTGG CGGCTATAAA AACGATGAAA GCCATGTCAT CAAAGAAGTA
AAAGCAGTGG ACGATTACAC GGTTCAATTT GTGTTGAAAC GTCCGCAGGC TCCGTTTTTG
AAAAACATTG CAATGCCGCC GTTTGCGATT GCCAGCCCGG CAGCGATTAA AAAATCTGGA
GACAAATTTG GCGAAAATCC GGTTGGAACT GGTCCATTCG TCTTTAAAGA ATGGAAACGA
AACGAGCGCA TTGTCCTTGA AAAAAATAAA GATTATTGGG AAAAAGGCTA TCCAAAGCTT
AATCAATTAA TCTTTGTTTC GATTCCGGAT AACTCTGCTC GCCTAAATGC GCTATTAAAA
GGCGAAATTG ATTTAATGGA AGATGTAAAT CCGAACGATT TGAAACAAAT TGAAGGAAAT
AAAGATTTGC AAGTTTTTAA ACGTCCATCG ATGAATGTTG GTTACGTCGG ATTGACGACG
ACGAGAGGAC CGTTGAAAAA CAAATTAGTC CGTCAAGCAT TGAACTATGC GGTGGACAAA
AAAGCGATCA TTGACGCATT TTATGCCGGC CAGGCGGAAT CGGCGAAAAA CCCGATGCCG
CCGAGCATTC CTGGATATAA CGATGAAATT CAAGATTATC CGTTTGATTT GAACAAGGCG
AAAGAATTAC TTGCAAAAGC AGGCTATCCG AACGGTTTTG AAATGGAATT ATGGGCAATG
CCAGTGCCTC GTCCATATAT GCCGGATGGC AAAAAAATAG CGGAAGCACT TCAAGAAAAT
TTCGCGAAAA TTGGTGTAAA AGCGAAAATT GTTACGTATG AATGGGCAAC GTATTTAGAA
AAAGCAGCAA AAGGGGAAGC AGATGCGTTC TTGCTTGGTT GGACAGGAGA TAACGGTGAC
GCGGACAACT TCTTGTACGC GCTGCTCGAC AAGGACAGCA TCGGCAGCAA CAACTATACG
TATTACTCAA ATGACGAGCT TCATAAAATT CTAGTCGAAG CGCAAACGAT CAGCGATGAA
AATAAACGGA ATGACCTTTA CAAAAAAGCG CAAGAAATTA TCAAAGAGGA TGCACCATGG
ATTCCGCTCG TCCACTCGAC GCCACTGCTT GCTGGGAAAG CAAACATTAA AGGCTATAAC
CCGCATCCGA CCGGTTTGGA TAAATTTACA AAAGTTGAAT TCGAATAA
 
Protein sequence
MVRKKIWITL LALMLVLSMA LVGCGKSEKT SSGEKGKSSS QDTLVYGRGD DSVSLDPATV 
TDGESLKVTK NIFDTLLDYN DGDTTVKPAL ATKWTISDDG LTYTFELRKG VKFHDGTDFN
AEAVVFNFER WANGNAEKFP YYGSMFGGYK NDESHVIKEV KAVDDYTVQF VLKRPQAPFL
KNIAMPPFAI ASPAAIKKSG DKFGENPVGT GPFVFKEWKR NERIVLEKNK DYWEKGYPKL
NQLIFVSIPD NSARLNALLK GEIDLMEDVN PNDLKQIEGN KDLQVFKRPS MNVGYVGLTT
TRGPLKNKLV RQALNYAVDK KAIIDAFYAG QAESAKNPMP PSIPGYNDEI QDYPFDLNKA
KELLAKAGYP NGFEMELWAM PVPRPYMPDG KKIAEALQEN FAKIGVKAKI VTYEWATYLE
KAAKGEADAF LLGWTGDNGD ADNFLYALLD KDSIGSNNYT YYSNDELHKI LVEAQTISDE
NKRNDLYKKA QEIIKEDAPW IPLVHSTPLL AGKANIKGYN PHPTGLDKFT KVEFE