Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1634 |
Symbol | |
ID | 7976280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1709203 |
End bp | 1711119 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644798517 |
Product | von Willebrand factor type A |
Protein accession | YP_002949689 |
Protein GI | 239827065 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4548] Nitric oxide reductase activation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.233072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCGGT TTATCCAATT TAACGATAAA AAGATTGATT CCTTTTTGTT TATGCAGCTG TCCGATTTAG TGAAAACGTT AGCGAAACAT AGCGATTGGG AAGTAGAGTT CGGCTTTCAG TCTTATGTTG ACTTTCCAAA CCGAAAATTA TATGTCAGTC ATTTTTGGGA CAATCGGCCG AAAGAAGAAA AAGAAAACGG ATTAAAAAGC GATGTCTGTT TGCGAGCCGT CGGAACGCTT TTTCATACTG ATTTTTCTGA AGTCACCGCA TTTCTTAACA AAACGAAAAA CATATCGATC CCTAGTTTTG CAAAGCAACT ATTTACACTA GCAGAAGATT TGCGTCTTGA GGAAATTTGT AAAAAAGAGC GTCCTGGCAC GAAAAAATGG TTTCGCATCC GCCGCGATGT GTACCGCCGT TATTTTTACA GTCAAACGAA CGCCAATTTG ACAAGAAGCG TATATACGGA TGCATTATTT TCTGTTATCT ATTTGTTATT AACATCCGAT TCGCCGCTAG AGGACGTTCC AGTTATTCAT GAGCCAATTG ACCGAATGAT GCCTTGGCTT CGCCAGACGC TTCCCCAATT TTTTGACGCT GCTTCGACGA AAGAAGTGGC CCGTATCACG TGGATGATTA CAGAAGCATT CGATGACGTA TTGGAAAATG ATATGTTAAA CACGTATTTT TATTTACCAG AACAAAGTTA TAATGAAGAG ATGGGACTGA CGCTAGAAGA CTTAAAACGG ATCGACCCGC TCAATAACTG TGACATCTTG GATAAAGAAA AATGTGGGGA CGAAGATGTC CATGATCAAG AACTGCCAAC GTGGCACCGT GAAACGAGTG ACATGACGAA AAGTTTCTTG CGTTTTGAAC TTGAACAAGG CTCGCGAACC GATTTATTGG GAGATGGAGC GCGTGAAGGC GAAGACGGGG ATCAGGCACT TGCGATGGTG CAAGGATCTG CGCGAAAATC GAATCGAAAC GATTATAGGA AGCTAAATGC CTATGAACAG AAACGGGAAA GCAAACAAGC AGGTAAAGGG GATCGCTACG GGAAAGATAA CCGGTATGCT GAAGCAATTT TTCTTTTTCC AACTTCTCCT TCGTCTGAAC AAATAGCACA ATATGAACAA AAGAAAATCG ATATTTTACC ATATCAAAAA AAATTAAAAC AAATGATGGA AAAAACATTA GAACATAAAA AGACACTTCC ACGCACCGAT TTACATTTTG GCCGGCTGCA TAAAAAACTG CTTCGTCTAT GGACAGACGA ACAGCCGCGC TTGTTTTATA AAAAACATCA GCCATCTTCT CGTATTGATG CCGTTTTTAC GCTGCTAGTC GATTGTTCAG CGTCGATGTA TGACAAAATG GAAGAAACGA AACGCGGTAT TATTTTGTTC CATGAAGCAT TAAAATCGTT GCTTGTGCCC CATCAGATCG TTGGATTTTG GGAAGATCCA AATGAGGTGA CGGAAACGAA GCAGCCAAAC TATTTTCAAA CGGTAATTTC ATTTGCACAA TCACATAAAA AAGAAAGCGG CCCTGCGATT ATGCAGCTGG AGGCCCAGGA AGATAATCGC GACGGATTTG CGATTCGCAT CATAACGGAA CAACTTTTAA AGCGGCCGGA AAAACAAAAA TTTTTATTGG TGTTTTCCGA CGGAGAACCA GCGGCGTTTG GTTATGAACA AAACGGTATT ATCGATACGC ATGAAGCGGT GTTAGAAGCG CGCAAACATC ATATCGAAGT GATTAACGTC TTTTTGGCCA ATGGTGAAAT TGACGAAGGA CAGAGAGAGA CGATTCGAAA TATTTATGGA AAACATAGCA TTGTTGTTCC AAACGTGGAG CAACTTCCAG ATTTCTTATT CCCGCTATTG AAAAAACTAT TGTATAAAAG TTTATAA
|
Protein sequence | MERFIQFNDK KIDSFLFMQL SDLVKTLAKH SDWEVEFGFQ SYVDFPNRKL YVSHFWDNRP KEEKENGLKS DVCLRAVGTL FHTDFSEVTA FLNKTKNISI PSFAKQLFTL AEDLRLEEIC KKERPGTKKW FRIRRDVYRR YFYSQTNANL TRSVYTDALF SVIYLLLTSD SPLEDVPVIH EPIDRMMPWL RQTLPQFFDA ASTKEVARIT WMITEAFDDV LENDMLNTYF YLPEQSYNEE MGLTLEDLKR IDPLNNCDIL DKEKCGDEDV HDQELPTWHR ETSDMTKSFL RFELEQGSRT DLLGDGAREG EDGDQALAMV QGSARKSNRN DYRKLNAYEQ KRESKQAGKG DRYGKDNRYA EAIFLFPTSP SSEQIAQYEQ KKIDILPYQK KLKQMMEKTL EHKKTLPRTD LHFGRLHKKL LRLWTDEQPR LFYKKHQPSS RIDAVFTLLV DCSASMYDKM EETKRGIILF HEALKSLLVP HQIVGFWEDP NEVTETKQPN YFQTVISFAQ SHKKESGPAI MQLEAQEDNR DGFAIRIITE QLLKRPEKQK FLLVFSDGEP AAFGYEQNGI IDTHEAVLEA RKHHIEVINV FLANGEIDEG QRETIRNIYG KHSIVVPNVE QLPDFLFPLL KKLLYKSL
|
| |