Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_0881 |
Symbol | |
ID | 8524704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 879909 |
End bp | 883142 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003252030 |
Protein GI | 261418348 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGGT GGCGCGGCTG GCTTTTATGG GGAATGTTGT TTTGGCTGAT GGCAAGCCTA TGGCCGATGA AAGCAGGTGT TGTGTATGCC GATCAAACAG AGGGGGCCAT TACGTTCGTT TCGGATGAGA AGATTGAGGG ATGGGCGACC GCACCAGGGA GCGGCAACGG GCAAGGGAAC AGCATTCGCT ACTTTATGAT TCGCATTCAG CCGAAGGAAC AAAGCGGCAC GGCAGTCAGC TACCGAGCCG AAGCGGTGGA AGCGCCGGCG GACCGAAACG GTGAGAAAAA GTATACGTTT TCTCTTGATA TGCGCGGCAA GTGGCCGTCG GCTCCGGGAA CGACGGTGAC TTACCAAATC ACCGTCGATG CGTATCGCGT GTTAGGAAAT GGAAAAGAGG ATGTGTATTT TTCATTCCCA CAGACGCCTT ATCAATATAC GAGGCAAACT GGGGTGAGCA CGGCTAAATT GGATTTTTCC CTCTCATTTT CACAGCCGGA ATACGCCAAG CCGCCGAACG GCGATGCCCA AGGCCGGCTC GATGTGACGC TCGTTCCACA AGGCGCTGTT TCAGGAATCA TCCGCCCGCC GATCGATGTC GTGTTTGTCA TGGACGTGTC GGGGTCGATG ACGGCAATGA AGCTGCAAAG CGCCAAATCT GCTCTGCAAG CGGCGGTCAA CTACTTTAAG TCCAATTACA ACCAAAATGA CCGCTTTGCT CTTATTCCGT TTTCCGATGG AGTGCGGGAA GCAAGTGTGG TTCCGTTTGG GAAGTACTCG AATGTTGCAA GTCAGCTTGA TGCCATTCTC AATACCGGCA ACAGCTTGAC CGCCGGCGGC GGAACGAACT ATTCGGCTGC TTTGTCTCTT GCTAAATCGT ATTTCACGGA TCCAACGAGG AAAAAGTATA TCATTTTCTT AACGGATGGC ATGCCGACGG TGTTGAATGC AGTCGATACG ATTACGTATC GGGAGGTCAA GCAGAATTTT TGGGGTGGAT ATTCCTATAC AGGGAATAGA GTAACCGATT CTCTTTCGGT CACATACGAA TTGTACAGCG ATGGACGGAC TGCCGGTATC CGTTTTACCG ACAACAAAGG ATATTCTCGC CGATTTTACA GCGACGGCCA AGATTATGTT AATGGATGGC GGGTGTCTTG GGATAACGGC TATTCGTTCA CTTACAGCAG CATTGAAGGA AAAATCCGAG CCGATGCCAC AGCGGTGGCA AAAACGCTCG GGATGAACAA TATCACCCTT TACTCAATCG GCTTCGGCGA CAATGATGAA GTCGATATGG ACTATTTACG GTCGCTTTCA GCGACTGCCG GCGGCGAGGC GCGGCAAGGA ACGACGCAAA ATTTGACCGC TTTGTTCCAA CAGTTTTCCC AGCTTGCCAC CACGCCAGCG ATCACAGGAA CGATCCGCAT CCCGCTTTCG TCGTTTGGCG GCAATGTCGC CGTTGCGGAA AACGGCCAAG TATGGCTGGA TGAAAACAAG CAAAACGCGT ATATTTCGTT TTCCATCCCA TACCAAGTTG GTCAAGGCAC TCCGGCGCCG GTGACGATTC CGATTCCCGT GTCATTTAAA GCAAAAGGGA CATATACGTT CACTGCTGAA CTGACGTATC GCGATGTGTA TGGCCAACCA CAGCCGGCTG TGACGAAAAC GGTGACCGTC ACGGTCAAAG ACGAAGCGCC GCCGTCGTTT ACTGGAACGG TGAAGCTGCA AGGGCTGACC AATGATGTCA CAAGCTTAAT CAAACGGGGG GCAACCGATG GCGACGACAA CCGCTTCCGG GCCACGTATT CGTTGTCGCT TGTCGGGTAT GTCGGCAATG CGACCGGGAC GATCAGCAAT GTGAGCATTG TCCAAGAACT GCCCGATGGC ATCTCCGTTG TTCCGGATGG CAAAGTCACA ACCTATGTGG AGAATGGCAA GCGTTATGCC AAATGGTCTT TCACCAATCA AATGTTCGAC TACAGCCAGC TCAACAAACT GACGCTCTCG GCGCAATGGG TGATGCAGGC TGACTTTGCC ATGAACAGTG TTCAATTGCC GCCAGCTTCG GTGACATTCA CCGACAGCCG TTATGGCGCC AAAACATCGA CGCTCGTTCC GCCGGCGGAA CGAATCGGCA TGACAGTTCG CTTGGATGAT TTTCCAAACT TTTACTATGA AGGGAATGGA AAGGGTACTA TTACAAAACA CAAATTTTCT GCTTTATCAG ACGACATTGT CGGACAGGCG AATCCTAATT CATATGGATT ATTGCCGTTG CCTGTTAAAG CGCTGGAGTA TGACCCGAAT GATGATGGCG TGCTGCTTGT CACGTACAGC AACGGCCAGA CGGTATCGGT GTACATTAAG CCTCATCTTT CCGTTGTGAC AGCCAATGGC GTCTTGCCAA ACGGAGCGAC GACTTATGAG GCTCCGACCG TTAAAATAAC CGGGTTGGTA GCCGGTGAAG GGGTGTCATA TCAGTACCAA GCAGTGCGAA ATGGGGTGGC ATCCGCTTGG ACGCCGCTCT CGTCGCCGTA TGTTGTTGCC ATTTCCAATG ACGGTTCCTA CACGGTGAAT GTGCAGGCGT CAGGAGGATT GACGAGAGGA ACGGGTTTAA CGAGCACGGC GTTTACGTAT ACAAAGCGGA TTACGGACCT GCAGCTTGGT TCATATCAGC CGACGATGAA TGTCGGAGAT ACGCAGAGGA TTCCAGTGAC GATCGAGCCG AGCGACGCGA CGAACCCAAC GCTTGCGTGG ACATCAAGCG ACCCGAATGT CGCATCCGTA GACCAAAACG GCATGGTGAC AGCGCGAAAG CCTGGAACGG TGACGATCAC CGTCCGCGCG ACTGACGGCA GCAACGCGTC GGCATCGGCG GCCATCAAGG TGATGAATCC GTACGTGGCG CTTCAAGGTA TGGAGTTCCG CCGTCCGGTG TTGTATATGA AAGTCGGCGA ACGGCTTGAC GCAGCACCTG AGCTTCGTTT TGTTCCGTCC AATGCAAGCG ATCGAACGCT CCGCTCTGTG ACATCGTCTG ACTCCCGGTT TGTCGACGTT GTCCAAGAAA ATGGCCGATG GTATTTGGAA GCAGTGGATG TCGGCTATGC GTATGTGACG GCGACCGCCA ATGCGAGAAC GCCGGATGGC AGGCCAATTC AAGCCTCAGT GTGGGTCTTT GTCCAACCGC GTGAAGATGA GGGGGGAAGC GGCAGCAGTG GTGGGCGTTG GTAA
|
Protein sequence | MKRWRGWLLW GMLFWLMASL WPMKAGVVYA DQTEGAITFV SDEKIEGWAT APGSGNGQGN SIRYFMIRIQ PKEQSGTAVS YRAEAVEAPA DRNGEKKYTF SLDMRGKWPS APGTTVTYQI TVDAYRVLGN GKEDVYFSFP QTPYQYTRQT GVSTAKLDFS LSFSQPEYAK PPNGDAQGRL DVTLVPQGAV SGIIRPPIDV VFVMDVSGSM TAMKLQSAKS ALQAAVNYFK SNYNQNDRFA LIPFSDGVRE ASVVPFGKYS NVASQLDAIL NTGNSLTAGG GTNYSAALSL AKSYFTDPTR KKYIIFLTDG MPTVLNAVDT ITYREVKQNF WGGYSYTGNR VTDSLSVTYE LYSDGRTAGI RFTDNKGYSR RFYSDGQDYV NGWRVSWDNG YSFTYSSIEG KIRADATAVA KTLGMNNITL YSIGFGDNDE VDMDYLRSLS ATAGGEARQG TTQNLTALFQ QFSQLATTPA ITGTIRIPLS SFGGNVAVAE NGQVWLDENK QNAYISFSIP YQVGQGTPAP VTIPIPVSFK AKGTYTFTAE LTYRDVYGQP QPAVTKTVTV TVKDEAPPSF TGTVKLQGLT NDVTSLIKRG ATDGDDNRFR ATYSLSLVGY VGNATGTISN VSIVQELPDG ISVVPDGKVT TYVENGKRYA KWSFTNQMFD YSQLNKLTLS AQWVMQADFA MNSVQLPPAS VTFTDSRYGA KTSTLVPPAE RIGMTVRLDD FPNFYYEGNG KGTITKHKFS ALSDDIVGQA NPNSYGLLPL PVKALEYDPN DDGVLLVTYS NGQTVSVYIK PHLSVVTANG VLPNGATTYE APTVKITGLV AGEGVSYQYQ AVRNGVASAW TPLSSPYVVA ISNDGSYTVN VQASGGLTRG TGLTSTAFTY TKRITDLQLG SYQPTMNVGD TQRIPVTIEP SDATNPTLAW TSSDPNVASV DQNGMVTARK PGTVTITVRA TDGSNASASA AIKVMNPYVA LQGMEFRRPV LYMKVGERLD AAPELRFVPS NASDRTLRSV TSSDSRFVDV VQENGRWYLE AVDVGYAYVT ATANARTPDG RPIQASVWVF VQPREDEGGS GSSGGRW
|
| |