Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2618 |
Symbol | |
ID | 7978281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2653334 |
End bp | 2654524 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644799419 |
Product | hypothetical protein |
Protein accession | YP_002950578 |
Protein GI | 239827954 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000218897 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGTTA AATGGAAGCT GCTTGCTGTT TTTACAGCGT TTGCGGTAAT GATGGGAGCT TGCTCAAACA GCGAGGAAAC GGCAACAACC AAAACAAAGG AAAAAGAACA AACAGCCGCT GAACAAAAAG AAGAAAAGCC AGCGGAAACA GAGAAAAAAG AAGAAATAAA CTATGCCGAT GTGTTTAAAC AGGCGATTGC CGAGCTGGAA AAAGCAAAGC AAGGGCAAAA AGTTGACTTT GATAAAGTAA CAAAACTATA TGAAGAAAAT TTACAGACGC TTGTACAAAA ACGTGACGCC GAATTTCAAG ATACGCTTGA CCAGCATATT ACTACTGCGC TGGCGGCAGG GAAAGATGGC TCGCTCGATC CAGTAGTCGC CAAACAAATC TTTGATAAGC TTATGCAAAA AGTATTTTAT ACGACAATCA AGCATGAGTT TACTGAAGTA GAGGAAAATT GGGCAAATAA AGAAGCGGTG AAAGAAGAAA TCGAAGAAGC GAAGCAGTTT TACGCGATTT TACAGCCAAC GGTAGAAAAG CGCGACGCCG CTTATGGAAC AAAGTTAGCG GACGCGATTA ATGGAGCGTT TGCACAAATT GAAGATGCCG CCGCGAAAGA CGATTTGCTC GCATTCTCCT TAGGAAAGCA AGTCGTGGAT AAAACGTTGA TGAAAACCTT TTATTTAGCG ACAGGCGCGC TTCCGCATGG CTATGCTTCA AAGGCGGCAA ATGCGGCGAA GCAGGATGAA AAAGAAGCGA AAGTCAAACA AGCGGAAGGA TGGGCATTTT ATCAATCTGT ATATCCATAT ATGAAAAAGC ATGCGCCAGA AGAAGCGGAC TACATTTTAA AACAATTTGA TTTGCAAACA GATGTGAAAA CCCTAGATCC GGCAGCGATC AATAAAGCGT TTGTCCGCGG CTGGGCAAAG GTGGCGCTTC ATGAATATGA AGAAAGCAAA GAAAGCTGGG GGCAAGATAA ATCGGTGATT ACTGCGTTAG AAGGCGCTTT ATTTATTAAT ATGATGGAAA GTGACTTAAA AACACTGTTA GGCGATCAAG CTTATGCGTC ATTGAATGAT CAGGCGCAGC GCTACCTTGA GGCGGCAAAA ACAAAAAATA AAGCAGAGGG AGAAAAACTT CTTTCTCAAT TAGAAGCAAC GTTAAACACT GTTATGGAAA AGGCAAAATA A
|
Protein sequence | MSVKWKLLAV FTAFAVMMGA CSNSEETATT KTKEKEQTAA EQKEEKPAET EKKEEINYAD VFKQAIAELE KAKQGQKVDF DKVTKLYEEN LQTLVQKRDA EFQDTLDQHI TTALAAGKDG SLDPVVAKQI FDKLMQKVFY TTIKHEFTEV EENWANKEAV KEEIEEAKQF YAILQPTVEK RDAAYGTKLA DAINGAFAQI EDAAAKDDLL AFSLGKQVVD KTLMKTFYLA TGALPHGYAS KAANAAKQDE KEAKVKQAEG WAFYQSVYPY MKKHAPEEAD YILKQFDLQT DVKTLDPAAI NKAFVRGWAK VALHEYEESK ESWGQDKSVI TALEGALFIN MMESDLKTLL GDQAYASLND QAQRYLEAAK TKNKAEGEKL LSQLEATLNT VMEKAK
|
| |