Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2968 |
Symbol | |
ID | 7977268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2990190 |
End bp | 2991152 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644799768 |
Product | protein of unknown function DUF199 |
Protein accession | YP_002950907 |
Protein GI | 239828283 |
COG category | [S] Function unknown |
COG ID | [COG1481] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00647] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000176977 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTTG CGTCAGAAAC GAAAAAAGAA CTAACGAATT TGGAAGTGAA ACCGTGCTGT TTAAAAGCAG AATTGTCAGC GCTTCTGCGC ATGAATGGTT TGCTGTCTTT TTCCAATCAG CAAATACTTG TGGATGTGCA AACGGAAAAT GCCGCGATCG CGCGAAGAAT TTATACTCTT TTAAAAAAAG GGTATACGGT GACAGTAGAA CTATTGGTTC GAAAAAAAAT GCGCCTGAAA AAAAATAATG TATACATCGT TCGAATTATT GACGGCGCTC ATGAACTATT AAAGGATTTA AAAATATTGA AGGAGGATTT TTCACTAATT CGCGTGATTT CACCAGAATT AGTGAAAAAA AAGTGCTGTA AGCGTTCTTA TTTGCGCGGT GCGTTTCTTG CCGGAGGATC TGTAAACAAT CCGGAAACGT CATCGTACCA TTTAGAAATT TTTTCTCTTT ATGAAGAACA TAACAATTCA TTATGTGAAT TAATGAACAG CCACTTTTTT CTTAATGCCA AAACATTGGA ACGAAAAAAA GGGTTTATTA CGTATTTAAA AGAAGCGGAA AAAATCTCTG AGTTTTTAAA TATTATCGGC GCACATCAAG CGTTATTGCG CTTTGAGGAT ATTCGCATTG TACGCGATAT GAGAAATTCG GTAAACCGTC TTGTCAATTG CGAAACAGCA AACTTAAATA AAACGATTGG CGCTGCGCTT CGCCAAGTAG AAAACATTCG TTATATTGAT GAAACCATCG GATTAAGCGC CCTTCCTGAT AAATTGCGGG AAATTGCGGA ATTGCGGATG ATGTATCAAG ACGTAACGTT AAAAGAATTA GGCGAGTTAG TATCCGGTGG AAAGATTAGT AAATCAGGAA TTAATCACCG TCTGCGAAAA ATTGATGAGA TTGCGGAACG ATTGCGGGCA GGTAAGCCTG TTGATTTTCA TAAGTCATTA TAA
|
Protein sequence | MSFASETKKE LTNLEVKPCC LKAELSALLR MNGLLSFSNQ QILVDVQTEN AAIARRIYTL LKKGYTVTVE LLVRKKMRLK KNNVYIVRII DGAHELLKDL KILKEDFSLI RVISPELVKK KCCKRSYLRG AFLAGGSVNN PETSSYHLEI FSLYEEHNNS LCELMNSHFF LNAKTLERKK GFITYLKEAE KISEFLNIIG AHQALLRFED IRIVRDMRNS VNRLVNCETA NLNKTIGAAL RQVENIRYID ETIGLSALPD KLREIAELRM MYQDVTLKEL GELVSGGKIS KSGINHRLRK IDEIAERLRA GKPVDFHKSL
|
| |