Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2528 |
Symbol | |
ID | 7976297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2556716 |
End bp | 2558014 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644799329 |
Product | spore coat assembly protein SafA |
Protein accession | YP_002950489 |
Protein GI | 239827865 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02899] spore coat assembly protein SafA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAATCC ATATTGTCCA AAAGGGCGAT ACGCTTTGGA AAATTGCGCA AAAGTATGGG GTCGATTTTG AGCAGCTAAA GAAAATGAAC GGTCATTTGA GCAATCCAGA TATGATTATG CCGGGAATGA AAATTAAAGT ACCGACTGCG GGAGTTCCGG TAAAAAAAGA AGCGCCGAAA AAAGAAACAA AAATTAATCT GGCTCCGAAA AAAGAAGAAA ATATAGAGCA CCCGTATGCC CATACAAAGC CGTTTGCATC GGTGAACATT GAGGCGGAGT TTTCGGAGCC TATAAAAGAA GAAAAGGTAA ACAAGGCTCC CGAGGCAGTG GTGAACGAAG CGCCGAAAGC GCCGATCAAG GAAGCGCCAA AGGCACCGAT CAAGGAAGCG CCAAAAGCGC CGATCGAGGA AGCGCCAAAG GCACCGATCA AGGAAACGCC AAAAGCGCCG GTGAACGAAG GGACGAAATC AGATAATGCC GCACTAGGTT CTGAGGAAAA GCAGTTCAAC ATTCCACCGT TATCGCATAC GATTCCACCA GTAACGCCAA ACGTAAATAT TAACTTTTCT AATGCGATTT CTAATGTTCC GCCGATTCCG CCAAAACCAG AAAATATTTT ACCAGGAATC ATGAAACAAG AATTAGAAAG CCCTGCTGAG GCGGTAGAAG AAAAAGAATT AGCAGCGGAT GATGATACAC CACCAGAACT TCCAAAAGCT CCATACGTTC CGATGATGCA GCAGCCATAT GCGATGGGAG GAGCGCCTGT TGCACCAATG CCACCACAGC CGTGTGGCCC GGTAACTCCT ATTTTACCAG GTGCGGGATA TTATTTTCCG CCTATGCCAA CCATGCCAGT TAACTATCCA ACATATCTAC AACCATCAAC ATACGGGGAT GCGGAAAGTG GTTCTCATCC ATTTCCTGGT ATTGAAGAAA GCGGCGGTGA ATCAGGTGAA GTGCCACTAA TGCCAGGGCA TACATCACCA ACAGTAAATC CGGCATCGGC AAATATGCCA CCATTTCCTT CATATTCACC TGCTTCTTCT TATCCGATTA TGCCATGTGC ACCGATTTTA CCAGTAATGC AAGGATATGG ATGGCATCCA GCGTTTTATC CATACATGCC GCCGGCGTCG TACGGCTACT ATCCGCCAGC TGCACCTGCT CCTTATCCAT ATCCGGCGGC AGGGACGGCA TATCCATTCA CACAAGCACC AACCACTTCA GCGTTTCCTC GTACGGAAGA ACAATTATTT ACAGAACCGC ATGACGAGGA GAGTAACGAT TATTGGTGA
|
Protein sequence | MKIHIVQKGD TLWKIAQKYG VDFEQLKKMN GHLSNPDMIM PGMKIKVPTA GVPVKKEAPK KETKINLAPK KEENIEHPYA HTKPFASVNI EAEFSEPIKE EKVNKAPEAV VNEAPKAPIK EAPKAPIKEA PKAPIEEAPK APIKETPKAP VNEGTKSDNA ALGSEEKQFN IPPLSHTIPP VTPNVNINFS NAISNVPPIP PKPENILPGI MKQELESPAE AVEEKELAAD DDTPPELPKA PYVPMMQQPY AMGGAPVAPM PPQPCGPVTP ILPGAGYYFP PMPTMPVNYP TYLQPSTYGD AESGSHPFPG IEESGGESGE VPLMPGHTSP TVNPASANMP PFPSYSPASS YPIMPCAPIL PVMQGYGWHP AFYPYMPPAS YGYYPPAAPA PYPYPAAGTA YPFTQAPTTS AFPRTEEQLF TEPHDEESND YW
|
| |