Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1287 |
Symbol | |
ID | 7976067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1338780 |
End bp | 1340006 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798231 |
Product | protein of unknown function DUF395 YeeE/YedE |
Protein accession | YP_002949404 |
Protein GI | 239826780 |
COG category | [R] General function prediction only |
COG ID | [COG2391] Predicted transporter component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0018382 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATATTG TTGTGAAAAA ACAAACGAAT ATACAAGTAA AACAAGAAAC GAACGCCCCG AAAAATCCGA CTTCGTTTAT TATCGCCGTA GCGTTAGTCG CATTAATCGG TGGTGCTTTG TTTTTATATA GTCGCGTTTC TTGGCAACAA GCGCTTTTAT ATGTGTTAGG AGCTTTTGGT GGTTTCGTGT TATATCAAGC TCGTTTTGGC TTTACGACCG CTTGGCGCAA ATTCATTTTG TATCGTGAAG GTGAAGGTAT TCGCGCACAA ATGATTATGA TGTTAGTAGC AAGCATTTTC TTCATGCCGC TATTGTTGAA AGGGTCGATA TTTGGTCATC CGGTTGCGGG AAACGTTCAT GATGTCGGCA TTTCTGTTAT TGTCGGCGCG TTTATTTTCG GAATCGGCAT GCAGCTTGGC GACGGATGTG CTTCAGGGAC GCTTTACCAT ATCGGCGGCG GAGATACGAA CGGAATTGTG ACACTAATCG GTTTTATCGC CGGTTCTGTT ATTGCGACGA CTCATTTTGA TTTTTGGATG AATACACCGC ATTTTGCGCC AATTTCACTC ATTCATCAAC TTGGCGCTGT CGGAGGCTTT CTCCTTCAAC TTGTGCTATT GGCGTTCGTT TATTACATCG TGACCGTTAT CGAAAAACGC CGTCACGGAA AGCTGATTTC AGCGAAAACC GAAAACAAAA ACGGCTGGAA AGCAATTTAT AAAGGACCAT GGTCGCTTCT TGTCGGCGCG CTTTTACTCG CCGTTATGAA TGCTCTTGTA TTAATGATCA ACGGCAAACC GTGGGGCATT ACCTCCGCGT TCGCTCTTTG GGGAGCGAAA TTCGTGCAAT TGTTTGGCGT CGATCCAACC GAGTGGGCAT ATTGGCAGGA TCCAGCGAAA TTAAAGGCGC TAAAAAGTCC GTTATATCAA GACACAACAA CAGTAATGGA TATTAGCTTG ATGTTCGGCG CGCTATTAGC CGCGGCCTTT GCGGGACGAT ACGCAAAACC GATTCAATGG AAGCGCCCAT CGCGCATGAC GATCGGCGCC CTTATCGGCG GATTGATGAT GGGTTACGGC ACTCGTCTTG CGTTCGGCTG CAACATCGGC GCTTACTTCA GCGGCATCGC CTCTTTCAGC GTCCACGGCT GGATTTGGTT TGTCTTCGCC TTTCTCGGCA GCATCATCGG CGTCAAGCTG CGTCCGTATT GTGCGTATAA AAACTGA
|
Protein sequence | MDIVVKKQTN IQVKQETNAP KNPTSFIIAV ALVALIGGAL FLYSRVSWQQ ALLYVLGAFG GFVLYQARFG FTTAWRKFIL YREGEGIRAQ MIMMLVASIF FMPLLLKGSI FGHPVAGNVH DVGISVIVGA FIFGIGMQLG DGCASGTLYH IGGGDTNGIV TLIGFIAGSV IATTHFDFWM NTPHFAPISL IHQLGAVGGF LLQLVLLAFV YYIVTVIEKR RHGKLISAKT ENKNGWKAIY KGPWSLLVGA LLLAVMNALV LMINGKPWGI TSAFALWGAK FVQLFGVDPT EWAYWQDPAK LKALKSPLYQ DTTTVMDISL MFGALLAAAF AGRYAKPIQW KRPSRMTIGA LIGGLMMGYG TRLAFGCNIG AYFSGIASFS VHGWIWFVFA FLGSIIGVKL RPYCAYKN
|
| |