Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1829 |
Symbol | |
ID | 7976454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1894739 |
End bp | 1896115 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644798665 |
Product | protein of unknown function UPF0236 |
Protein accession | YP_002949835 |
Protein GI | 239827211 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000497336 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTC AACAACATCT TACCACAAAT TCGTTGTCAT GGAAAGAGAT CGAACTTGAT TTGTTTCGAG CCTTGCAAAA CGCCTTCGCC GAGCTGTTTA CGGCTCTGTT GGAGGACATC GACCGACAAT TGGCGGAAAC CCGGGACAAG CGCCGGTACC ACTTGAAAGA CAAACGACGC ACCACGATTC AAACCCTGTT TGGCGAAGTT ACCTTTGAGC GAAACTACTA TTTAGACCGA GAACAAAACC GTTACACGTT TTTGCTTGAC TCCTTTTTAG CGTTTGATGG ATCACAGTCA ATCAGCCCTT GTCTAGAAGA AACAGCGGTG GGATTGGTTG TGGAGTGCTC TTCCTATCGC AAAGCGGCTC GTACGCTTGC TCAGATGGTC GGGTATCCGG TGATGAGCCA TGAGGCGATC CGCCAGTTGG TGCTCGAGGC CGAAGTTCCG CTGCACTGCC CGGTTGACCA GCGATATGGA CGGGTGCTGT TTGTGGAGGC CGATGGACTG TTTGTCTCTC GCCAAGGGAA GGGAAAACGG GCCAAGGAAG ACAAAATCCT GACCGTTCAC GAAGGGTGGA AGCGCCACGG CTCACAGATC GAATTCGTGA ACCAGCGCCA TTACGTCCAT GAGGGCAAGA GGGAGGTGTG GGAAGGCTTC GAGGAATTTT TGATGAACGA ATATGCCTAT GATCCGTGTC GGGATCTTCT TGTCATCAAC GGGGACGGCG CTCCATGGAT TACCGCGTGC CGGGAGTATT TCAAGGGACG GGTCTGCTTC CAATTGGATC GATTCCACGT GGCGCGTGAG TTGCGCCAAT GCCTCTCGGG CCATCCACGG TGGCAGGCGA TTCGGCAAAA GCTGGCGAAG CAGGATGAAG AGGGGTTGCT TGTGGAACGG AACAGCGCCC TCGGCACGCT GGGGGACGAA GCGAAAGAAC AACAGCTGGC TGCCTTGATC CACCGGATCG AATCGATGCC GGGATGCATC CGTGATTACC GGGAATGGCT GAAGGAGCAA GGAGTGGACA CAACGGGCAT GTATCCGATG GGGAGGGCTG AAAGCGTGAT GAGCCAGTTG GCGTATCGGG TGAAATACCG CCGCAGTTGG ACAGACAAGG GACTCAGGGC GTTTTTCAAG GCAATGATTG CCCGGATGGA TGGGATTCGT CTTTTCGGAC GTCGTTTAGG AGAAGAATCG TCGCAGCCGG CGGAGGAAAC GGCGTCGACG GCATCCACCA AACAGACGAT CGTGAACAAG GCGAAACAAC GCGTCCGCCG TCTTCTTCCG GAGGTAACGC GGAATAACGT GCCATATTTA CAGCAATCGT CCGGGACACC GATCTATCAT GCCCTGTCTG AACTCAAGGG ATGGTAA
|
Protein sequence | MNIQQHLTTN SLSWKEIELD LFRALQNAFA ELFTALLEDI DRQLAETRDK RRYHLKDKRR TTIQTLFGEV TFERNYYLDR EQNRYTFLLD SFLAFDGSQS ISPCLEETAV GLVVECSSYR KAARTLAQMV GYPVMSHEAI RQLVLEAEVP LHCPVDQRYG RVLFVEADGL FVSRQGKGKR AKEDKILTVH EGWKRHGSQI EFVNQRHYVH EGKREVWEGF EEFLMNEYAY DPCRDLLVIN GDGAPWITAC REYFKGRVCF QLDRFHVARE LRQCLSGHPR WQAIRQKLAK QDEEGLLVER NSALGTLGDE AKEQQLAALI HRIESMPGCI RDYREWLKEQ GVDTTGMYPM GRAESVMSQL AYRVKYRRSW TDKGLRAFFK AMIARMDGIR LFGRRLGEES SQPAEETAST ASTKQTIVNK AKQRVRRLLP EVTRNNVPYL QQSSGTPIYH ALSELKGW
|
| |