Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0521 |
Symbol | |
ID | 7978236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 591286 |
End bp | 592632 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644797522 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002948696 |
Protein GI | 239826072 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACATAG TTAACTTGTT AGTGGTAGCA TTGCTTATTG CATGTACCGC TTTTTTCGTA GCTTCGGAAT TTGCGATTGT CAAAGTTCGC AGCTCGCGCA TTGACCAATT AGTTAGCGAA GGAAATAAAC GGGCGATTGC TGCCAAAAAG GTGATCTCCA ACCTTGATGG CTATTTGTCG GCGAACCAAT TAGGCATTAC GATTACATCG TTAGGACTTG GTTGGCTTGG TGAACCGACG GTTGAACGGA TGCTAACACC GCTTTTCGAA CGCATCCATT TATTAGAATC GGTTTCGCAC GTTTTATCCT TCGTTATTGC GTTTTCGACG ATTACATTTC TTCACGTCGT TGTCGGCGAG CTGGCTCCAA AAACGTTTGC CATCCACAAA GCGGAGGCGA TTACGCTGCT TACAGCCCAA CCGCTTATTT TATTTTATAA AGTGATGTAT CCGTTTATTT GGGCGCTCAA CAATTCCGCG CGCCTCGTCG CCAGAATGTT TGGGTTAAAG CCAGCGGCAG AACATGAAAT TGCCCATTCT GAAGAAGAGT TGCGCCTTAT TTTATCAGAA AGTTACAAAA GTGGAGAGAT TAACCAATCA GAATATCGAT ATGTGAACAA TATTTTCCGA TTTGATGATC GGGTTGCAAA AGAAATTATG GTACCGCGCA AAGAAATTGT TGCGCTCGAT ATTAATCGAA GCGTGAAAGA GAATTTGGAA ATTATTAAAG AGGAAAAATA TACTCGTTAT CCAGTTATTG ATGGCGATAA AGATCACGTT CTTGGACTTA TTAATGTGAA AGAAGTGTTT ACCGATCTTG TGACAAATCC ATCCGAAGAA AAACAAATGA AAGATTATAT CCGCCCAATC ATTCAAGTGA TTGAATCGAT CGCTATTCAT GATTTGCTTG TGAAAATGCA GAAAGAACGC ATCCACATGG CCATTTTAGT CGACGAATAT GGCGGAACAT CGGGGCTTGT TACCGTCGAG GATATTTTAG AAGAAATCGT TGGAGAAATT CAAGACGAGT TTGACGTAGA TGAAATCCCG TTGATTCAAA AAGTTGATGA AACACGTACA ATTATCGACG GGAAAGTGCT GATTAGCGAA GTGAACGATT TGTTTGGCCT TTCCATTGAT GATGAGGATG TCGATACGAT TGGAGGATGG ATTTTAACGA AGCATTATGA TATTAAAGTC GGCGATAGCG TCGAAATCGA TAATTACTTG TTTACGGTGA AGGAGATGGA TGGTCACCAC GTGAAGACGA TAGAAGTAGT AAAACAGGAG AAAGAAGAGA AAGCGGCAGA TCATGAACTC GGTGAAAAAG AGGAATTGCA TTTATGA
|
Protein sequence | MDIVNLLVVA LLIACTAFFV ASEFAIVKVR SSRIDQLVSE GNKRAIAAKK VISNLDGYLS ANQLGITITS LGLGWLGEPT VERMLTPLFE RIHLLESVSH VLSFVIAFST ITFLHVVVGE LAPKTFAIHK AEAITLLTAQ PLILFYKVMY PFIWALNNSA RLVARMFGLK PAAEHEIAHS EEELRLILSE SYKSGEINQS EYRYVNNIFR FDDRVAKEIM VPRKEIVALD INRSVKENLE IIKEEKYTRY PVIDGDKDHV LGLINVKEVF TDLVTNPSEE KQMKDYIRPI IQVIESIAIH DLLVKMQKER IHMAILVDEY GGTSGLVTVE DILEEIVGEI QDEFDVDEIP LIQKVDETRT IIDGKVLISE VNDLFGLSID DEDVDTIGGW ILTKHYDIKV GDSVEIDNYL FTVKEMDGHH VKTIEVVKQE KEEKAADHEL GEKEELHL
|
| |