Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2410 |
Symbol | |
ID | 7975998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2445903 |
End bp | 2447024 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644799212 |
Product | protein of unknown function DUF34 |
Protein accession | YP_002950372 |
Protein GI | 239827748 |
COG category | [S] Function unknown |
COG ID | [COG3323] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0245658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACAGAA TTCCAAGCGG TCATGAAATC ATTCAATTAT TTGAGCAATT TGCACCAAAA CATTTAGCGA TGGAAGGCGA CAGAATCGGT TTGCAAATTG GAACGTTAAA TAAACCAGTC AAAAAGGTAA TGATTGCCTT GGATGTTTTA GAAGAAGTAA TCGATGAGGC TGTAGCGGAA GAGGTAGATC TTATTATCGC GCACCATCCA CCGCTTTACC GTCCGCTAAA GCAGATCATT ACTGATCAAG CGCAAGGACG TATCATTGAA AAATGTATGA AACATCATAT TGCCATTTAT GCTGCTCATA CGAATTTAGA TATCGCCAAT GGAGGGGTAA ATGACTGGTT AGCGGAGGCG TTGGGGCTGG AGCATGTAGA TGTATTGATT CCGACGTATG AAGAGCCGCT GAAAAAACTA GTCGTTTATG TTCCGGAGAC ACATGCCGAC CTTGTTCGTG AGGCGATCGG TAACGCGGGG GCAGGCCATA TCGGCAACTA CAGCCATTGT ACGTTCAACG GCCGCGGTAT CGGTACTTTT TTACCGCTAG AAGGGGCGAA TCCGTTTATC GGGAAATCCG GGACACTGGA GCAAGTGGAA GAGGTGCGTA TTGAAACGAT TGTCCCGGCT TCCTTGCAAA ACAAGGTTAT TTCTGCGATG TTAAAAGCAC ATCCATACGA AGAAGTGGCG TATGACATAT ATCCGCTTGA AAATAAAGGA AAAGTATTCG GATTAGGAAG AATTGGCCGC TTGCCGGAAG CGATGACACT TGGGGAGTTT GCAGAGCATG TGAAAAAAGC GTTGGACGTT CCGGCGGTGC GTGTTGTTGG CCATTTGCAA GATATGGTTC AAAAAGTCGC AGTGGTTGGC GGTGATGGAA ATAAATATAT CTCACAAGCA AAATTAGCCG GCGCTGATGT TTATGTGACG GGGGATGTAT ATTATCATGT TGCGCACGAT GCAATGATGC TTGGGCTAAA TATCGTGGAT CCGGGACATA ATGTGGAGAA AGTAATGAAA CAAGGGGTCG CTCGCTTTTT AGAAAATGCG TTTGCTAAAC ATCAGTTTGC AACGACCGTC TGCATTTCGA AAGTGCATAC CGATCCATTT ACGTTCGTAT AA
|
Protein sequence | MNRIPSGHEI IQLFEQFAPK HLAMEGDRIG LQIGTLNKPV KKVMIALDVL EEVIDEAVAE EVDLIIAHHP PLYRPLKQII TDQAQGRIIE KCMKHHIAIY AAHTNLDIAN GGVNDWLAEA LGLEHVDVLI PTYEEPLKKL VVYVPETHAD LVREAIGNAG AGHIGNYSHC TFNGRGIGTF LPLEGANPFI GKSGTLEQVE EVRIETIVPA SLQNKVISAM LKAHPYEEVA YDIYPLENKG KVFGLGRIGR LPEAMTLGEF AEHVKKALDV PAVRVVGHLQ DMVQKVAVVG GDGNKYISQA KLAGADVYVT GDVYYHVAHD AMMLGLNIVD PGHNVEKVMK QGVARFLENA FAKHQFATTV CISKVHTDPF TFV
|
| |