Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1931 |
Symbol | |
ID | 7979462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1990260 |
End bp | 1991627 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644798760 |
Product | protein of unknown function UPF0236 |
Protein accession | YP_002949930 |
Protein GI | 239827306 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTC AACAACATCT TACCACAAAT TCGTTGACAT GGAAAGAGAT CGAACTTGAT TTGTTTCGAG CCCTGCAAAA CGCCTTCGCT GAGCTGTTTA CGGCTCTGTT GGAGGACATC GACCGACAAT TGGCGGAAAC CCGGGACAAG CGCCGATACC ACTTGAAAGA CAAACGACGC ACCACGATTC AAACCCTGTT TGGCGAAGTC ACCTTTGAGC GGAACTACTA TTTAGACCGG GAACAAAACC GTTACACGTT TTTGCTTGAT TCCTTTTTAG CGTTTGATGG ATCGCAGTCA ATCAGCCCTT GTCTAGAAGA AACGGCGGTG GGATTGGCGG TGGAGTGCTC TTCCTATCGC AAAGCGGCTC GTACGCTTGC CCAGATGGTC GGGTATCCGG TGATGAGCCA TGAGGCGATC CGCCAGTTGG TGCTCGAAGC TGAAGCTCCG CTGCACTGCC CGGTGGACCA GCGATATGGA CGGGTGCTGT TTGTGGAGGC CGATGGACTG TTTGTTTCTC GCCAAGGGAA GGGAAAACGG GCCAAGGAAG ACAAAATCCT GACCGTTCAC GAAGGGTGGA AGCGCAACGG CTCACGGATC GAATTTGTCA ATCAGCGCCA TTACGTCCAT GAGGGCAAGA GGGAGGTGTG GGAAGGCTTC GAGGAATTTT TGATGAACGA ATATGCCTAT GATCCGTGTC GGGATCTTCT TGTCATCAAC GGGGACGGCG CTCCATGGAT TACCGCGTGC CGGGAGTATT TCAAGGGACG GGTCTGCTTC CAATTGGATC GATTCCACGT GGCGCGTGAG TTGCGCCAAT GCCTCTCGGG CCATCCACGG TGGCAGGCGA TTCGGCAAAA GCTGGCGAAG CAGGATGAAG AGGGGTTGCT TGTGGAACGG AACAGCGCCC TCGGCACGCT GGGGGACGAA GCGAAAGAAC AACAGCTGGC TGCCTTGATC CACCGGATCG AATCGATGCC GGGATGCATC CGTGATTACC GGGAATGGCT GAAGGAGCAA GGAGTGGACA CAACGGGCAT GTATCCGATG GGGAGGGCTG AAAGCGTGAT GAGCCAGTTG GCGTATCGGG TGAAATACCG CCGCAGTTGG ACAGACAAGG GACTCAGGGC GTTTTTCAAG GCAATGATTG CCCGGATGGA TGGGATTCGT CTTTTCGGAC GTCGTTTAGG AGAAGAATCG TCGCAGCCGG CGGAGGAAAC GGCATCCACC AAACAGACGA TCGTGAACAA GGCGAAACAA CGCGTCCGCC GTCTTCTTCC GGAGGTAACG CGGAATAACG TGCCATATTT ACAGCAATCG TCCGGGACAC CGATCTATCA TGCCCTGTCT GAACTCAAGG GATGGTAA
|
Protein sequence | MNIQQHLTTN SLTWKEIELD LFRALQNAFA ELFTALLEDI DRQLAETRDK RRYHLKDKRR TTIQTLFGEV TFERNYYLDR EQNRYTFLLD SFLAFDGSQS ISPCLEETAV GLAVECSSYR KAARTLAQMV GYPVMSHEAI RQLVLEAEAP LHCPVDQRYG RVLFVEADGL FVSRQGKGKR AKEDKILTVH EGWKRNGSRI EFVNQRHYVH EGKREVWEGF EEFLMNEYAY DPCRDLLVIN GDGAPWITAC REYFKGRVCF QLDRFHVARE LRQCLSGHPR WQAIRQKLAK QDEEGLLVER NSALGTLGDE AKEQQLAALI HRIESMPGCI RDYREWLKEQ GVDTTGMYPM GRAESVMSQL AYRVKYRRSW TDKGLRAFFK AMIARMDGIR LFGRRLGEES SQPAEETAST KQTIVNKAKQ RVRRLLPEVT RNNVPYLQQS SGTPIYHALS ELKGW
|
| |