Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1001 |
Symbol | |
ID | 7976788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1050519 |
End bp | 1051739 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644797954 |
Product | sporulation integral membrane protein YlbJ |
Protein accession | YP_002949127 |
Protein GI | 239826503 |
COG category | [S] Function unknown |
COG ID | [COG3314] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02871] sporulation integral membrane protein YlbJ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0990759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAA AAGCAAAAAC CGTCTTTCTC GCTTCAACGG TTACTTTATT TGCACTGTCA TTAATTTGCT ACCCTCAGCA ATCATTAGAA GCATCCATCC GCGGTTTAAA TATGTGGTGG GAAGTCGTTT TTCCATCACT ATTACCTTTT TTTATCGTTT CTGAATTATT GATTAGCTTC GGTGTCGTTA ATTTGCTTGG AGTTTTGTTA GAACCGCTTA TGCGGCCGCT TTTTAGAGTT CCTGGCGTCG GCGGTTTCGT TTGGGCAATG GGAATGGCTT CCGGCTATCC ATCAGGAGCA AAACTAACTG CACGGCTTTA TCAAGAAAAA CAAATTTCTA CGATCGAAGC AGAACGGTTA GCCTCATTTA CTAATTCATC CAACCCATTG TTTATTTTTG GCGCGGTGTC GATCGGATTT TTTCACAATC CAAATCTTGG CATTATTTTA GCACTTTCCC ACTATATAGG AAACATTTGT GTAGGAATGA TTATGAGATT CCATGGAAAA TCACAAGAAA AAGGAAAGCA AAAACGACCG AGTCATTTGT TTCCTTTTCC TTACGCATTC CGAGTGCTTC ATGAAACCCG TCTAAAAAAC GAACAGCCGC TTGGAAAATT GTTAGGAGAC GCCGTTCGGT CTTCTGTACA AACATTGTTG ATGATCGGTG GGTTTATTAT TCTCTTTTCC GTCATTAATA AGCTGCTTTA CATGATGCAT ATTACGGAAT ATATATCCTT TATTTTCCAG TATATTCTTC ACTTATTTCA ACTTCCAAAA GAACTAAGCA TCCCGATGAT TTCCGGTCTA TTTGAAATTA CGCTCGGCAG TCAGATGATT AGCCAAACCG ATAAAGCCGA ACTGTTGGAA AAAGCCATTG CAACAAGCTT TATTCTTGCT TTTGGCGGAT TTTCCGTGCA AGCACAAGTA GCAAGCATCC TCGCTGAAGC AAACATCCGC TTTAAACCAT TTTTTATCGC CAGAATCATG CATGGATGTT TTGCCGCATG TTTTACATAT ATACTATGGA AACCGCTCTA CGTCCACCCA GCCGATGGAA ATATGCGCGT TATTCCAACA TTTTTAATAG AACGCTCACC AAGCTGGATC AACCATTATT GGGAGCTGTT GCATCAATTC GGACCAATCA TTACGATCGT CTTTTTATGT CTATATATAT GGCTTACTGC TGCTCAATGG CAAAAAAAGG TCATAGAATA A
|
Protein sequence | MKPKAKTVFL ASTVTLFALS LICYPQQSLE ASIRGLNMWW EVVFPSLLPF FIVSELLISF GVVNLLGVLL EPLMRPLFRV PGVGGFVWAM GMASGYPSGA KLTARLYQEK QISTIEAERL ASFTNSSNPL FIFGAVSIGF FHNPNLGIIL ALSHYIGNIC VGMIMRFHGK SQEKGKQKRP SHLFPFPYAF RVLHETRLKN EQPLGKLLGD AVRSSVQTLL MIGGFIILFS VINKLLYMMH ITEYISFIFQ YILHLFQLPK ELSIPMISGL FEITLGSQMI SQTDKAELLE KAIATSFILA FGGFSVQAQV ASILAEANIR FKPFFIARIM HGCFAACFTY ILWKPLYVHP ADGNMRVIPT FLIERSPSWI NHYWELLHQF GPIITIVFLC LYIWLTAAQW QKKVIE
|
| |