Gene GWCH70_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1001 
Symbol 
ID7976788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1050519 
End bp1051739 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content40% 
IMG OID644797954 
Productsporulation integral membrane protein YlbJ 
Protein accessionYP_002949127 
Protein GI239826503 
COG category[S] Function unknown 
COG ID[COG3314] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02871] sporulation integral membrane protein YlbJ 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0990759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAA AAGCAAAAAC CGTCTTTCTC GCTTCAACGG TTACTTTATT TGCACTGTCA 
TTAATTTGCT ACCCTCAGCA ATCATTAGAA GCATCCATCC GCGGTTTAAA TATGTGGTGG
GAAGTCGTTT TTCCATCACT ATTACCTTTT TTTATCGTTT CTGAATTATT GATTAGCTTC
GGTGTCGTTA ATTTGCTTGG AGTTTTGTTA GAACCGCTTA TGCGGCCGCT TTTTAGAGTT
CCTGGCGTCG GCGGTTTCGT TTGGGCAATG GGAATGGCTT CCGGCTATCC ATCAGGAGCA
AAACTAACTG CACGGCTTTA TCAAGAAAAA CAAATTTCTA CGATCGAAGC AGAACGGTTA
GCCTCATTTA CTAATTCATC CAACCCATTG TTTATTTTTG GCGCGGTGTC GATCGGATTT
TTTCACAATC CAAATCTTGG CATTATTTTA GCACTTTCCC ACTATATAGG AAACATTTGT
GTAGGAATGA TTATGAGATT CCATGGAAAA TCACAAGAAA AAGGAAAGCA AAAACGACCG
AGTCATTTGT TTCCTTTTCC TTACGCATTC CGAGTGCTTC ATGAAACCCG TCTAAAAAAC
GAACAGCCGC TTGGAAAATT GTTAGGAGAC GCCGTTCGGT CTTCTGTACA AACATTGTTG
ATGATCGGTG GGTTTATTAT TCTCTTTTCC GTCATTAATA AGCTGCTTTA CATGATGCAT
ATTACGGAAT ATATATCCTT TATTTTCCAG TATATTCTTC ACTTATTTCA ACTTCCAAAA
GAACTAAGCA TCCCGATGAT TTCCGGTCTA TTTGAAATTA CGCTCGGCAG TCAGATGATT
AGCCAAACCG ATAAAGCCGA ACTGTTGGAA AAAGCCATTG CAACAAGCTT TATTCTTGCT
TTTGGCGGAT TTTCCGTGCA AGCACAAGTA GCAAGCATCC TCGCTGAAGC AAACATCCGC
TTTAAACCAT TTTTTATCGC CAGAATCATG CATGGATGTT TTGCCGCATG TTTTACATAT
ATACTATGGA AACCGCTCTA CGTCCACCCA GCCGATGGAA ATATGCGCGT TATTCCAACA
TTTTTAATAG AACGCTCACC AAGCTGGATC AACCATTATT GGGAGCTGTT GCATCAATTC
GGACCAATCA TTACGATCGT CTTTTTATGT CTATATATAT GGCTTACTGC TGCTCAATGG
CAAAAAAAGG TCATAGAATA A
 
Protein sequence
MKPKAKTVFL ASTVTLFALS LICYPQQSLE ASIRGLNMWW EVVFPSLLPF FIVSELLISF 
GVVNLLGVLL EPLMRPLFRV PGVGGFVWAM GMASGYPSGA KLTARLYQEK QISTIEAERL
ASFTNSSNPL FIFGAVSIGF FHNPNLGIIL ALSHYIGNIC VGMIMRFHGK SQEKGKQKRP
SHLFPFPYAF RVLHETRLKN EQPLGKLLGD AVRSSVQTLL MIGGFIILFS VINKLLYMMH
ITEYISFIFQ YILHLFQLPK ELSIPMISGL FEITLGSQMI SQTDKAELLE KAIATSFILA
FGGFSVQAQV ASILAEANIR FKPFFIARIM HGCFAACFTY ILWKPLYVHP ADGNMRVIPT
FLIERSPSWI NHYWELLHQF GPIITIVFLC LYIWLTAAQW QKKVIE