Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1915 |
Symbol | |
ID | 7978741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1971555 |
End bp | 1972913 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798746 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_002949916 |
Protein GI | 239827292 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000454942 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTGC GAGAAAGTGA GATAAAAAGC GGACCGGCAT GTGCGACTGT TAGTCTTGGA AACTATCGTT TTAGTGATCC TCCAGATCCT AGCAAATGGG CTGACTGCGT TCATTGCGGC ATGTGCTTAG AATCGTGTCC GACATACGAA CAAACTGGTC AAGAACAACA TTCGCCGCGC GGACGCGTGC ATTTGATAAA ATCGGTAGCA GAAGGGAAGT TAGAGATTAC AGAAGATTTG ATTGATCCCA TTTTTATGTG CCTAGATTGT CGGGCTTGCA CAACTGCGTG CCCTGCCGAT GTCGATGTCG GCGGATTGAT TGAAGAAGTA CGCGGACAAA TCCGGCAAGC GATTCCGCTG ACAGGATGGA AAGCGTTTGT CAACAATTTC TTCTTAAAAG GTGTATTTCC TCATCCACCC CGTCTTCATC TGCTCGGGAG TTTGTTAAAG CTATATCAAA AAAGCGGACT GCAGATGATC GCCCGCAAGA CGAAACTGCT TCACATCATG CCGAAGCATC TAGTGGAGAT GGAAGCGATT TTGCCGGAAG CAGGCGTGCC GGTCCGAAAA AAATATAAAC ATATGAATGT CATTAAAGCA AAAGGAGAAA CGAAACATAC GGTCGCGCTT TTAACAGGAT GCGTCATGGA TGTAATGTTC AGCGATATTA ACGAGGCGAC GATTCGCGTG TTGACACGAA ACGGAAACGA TGTCGTCATT CCGCAAAATC AAACGTGCTG CGGGGCGCTT CATGTGCATG CAGGGGACCG CGAGATGGGC CGGAAGCTTG CCAAACAAAA TATTGAAGCA TTTCAACATG CAGACCGCGT GATCGTGAAC GCGGCAGGGT GTGGCTCTAT GTTAAAGGAA TATCCAGAGT TGTTCCGCAA CGATCCAGAG TGGCGGGAAA AAGCGGAAGA TTTTTCCCGT AAAGTCGAAG ATATTTCAAA ATATTTGCAT GATACGGGGT ATGAAAAGCC GAAAGCGGAA TTAAATGTGC GCATTACGTA CCATGACGCT TGCCATTTGG CGCACGGCCA AGGAATTCGT CAAGAACCGC GCACGATTTT ATCGAGCATT CCGGGAGTGG AAATGGTTTC GATGCCAAAC GCCGACCGCT GCTGCGGAAG CGCGGGAATT TATAATCTTA CCCATCCGGA TATGGCAAGC GCCGTATTGG AAAGCAAAAT GGAAAACGTT CCAGAAGATG TGGAACTTAT TACAATGGGG AATCCTGGAT GCATGTTGCA AATGGCGATG GGAGTTTTGA AATACGGCCG AAATCAAAAA GTCGTTCATA CCGTACAGCT CCTAGACTGG GCATATCAAA AAGAAGACGG AAAGGAGGTA CGTGTATGA
|
Protein sequence | MSLRESEIKS GPACATVSLG NYRFSDPPDP SKWADCVHCG MCLESCPTYE QTGQEQHSPR GRVHLIKSVA EGKLEITEDL IDPIFMCLDC RACTTACPAD VDVGGLIEEV RGQIRQAIPL TGWKAFVNNF FLKGVFPHPP RLHLLGSLLK LYQKSGLQMI ARKTKLLHIM PKHLVEMEAI LPEAGVPVRK KYKHMNVIKA KGETKHTVAL LTGCVMDVMF SDINEATIRV LTRNGNDVVI PQNQTCCGAL HVHAGDREMG RKLAKQNIEA FQHADRVIVN AAGCGSMLKE YPELFRNDPE WREKAEDFSR KVEDISKYLH DTGYEKPKAE LNVRITYHDA CHLAHGQGIR QEPRTILSSI PGVEMVSMPN ADRCCGSAGI YNLTHPDMAS AVLESKMENV PEDVELITMG NPGCMLQMAM GVLKYGRNQK VVHTVQLLDW AYQKEDGKEV RV
|
| |