Gene GWCH70_1915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1915 
Symbol 
ID7978741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1971555 
End bp1972913 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content47% 
IMG OID644798746 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_002949916 
Protein GI239827292 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000454942 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTGC GAGAAAGTGA GATAAAAAGC GGACCGGCAT GTGCGACTGT TAGTCTTGGA 
AACTATCGTT TTAGTGATCC TCCAGATCCT AGCAAATGGG CTGACTGCGT TCATTGCGGC
ATGTGCTTAG AATCGTGTCC GACATACGAA CAAACTGGTC AAGAACAACA TTCGCCGCGC
GGACGCGTGC ATTTGATAAA ATCGGTAGCA GAAGGGAAGT TAGAGATTAC AGAAGATTTG
ATTGATCCCA TTTTTATGTG CCTAGATTGT CGGGCTTGCA CAACTGCGTG CCCTGCCGAT
GTCGATGTCG GCGGATTGAT TGAAGAAGTA CGCGGACAAA TCCGGCAAGC GATTCCGCTG
ACAGGATGGA AAGCGTTTGT CAACAATTTC TTCTTAAAAG GTGTATTTCC TCATCCACCC
CGTCTTCATC TGCTCGGGAG TTTGTTAAAG CTATATCAAA AAAGCGGACT GCAGATGATC
GCCCGCAAGA CGAAACTGCT TCACATCATG CCGAAGCATC TAGTGGAGAT GGAAGCGATT
TTGCCGGAAG CAGGCGTGCC GGTCCGAAAA AAATATAAAC ATATGAATGT CATTAAAGCA
AAAGGAGAAA CGAAACATAC GGTCGCGCTT TTAACAGGAT GCGTCATGGA TGTAATGTTC
AGCGATATTA ACGAGGCGAC GATTCGCGTG TTGACACGAA ACGGAAACGA TGTCGTCATT
CCGCAAAATC AAACGTGCTG CGGGGCGCTT CATGTGCATG CAGGGGACCG CGAGATGGGC
CGGAAGCTTG CCAAACAAAA TATTGAAGCA TTTCAACATG CAGACCGCGT GATCGTGAAC
GCGGCAGGGT GTGGCTCTAT GTTAAAGGAA TATCCAGAGT TGTTCCGCAA CGATCCAGAG
TGGCGGGAAA AAGCGGAAGA TTTTTCCCGT AAAGTCGAAG ATATTTCAAA ATATTTGCAT
GATACGGGGT ATGAAAAGCC GAAAGCGGAA TTAAATGTGC GCATTACGTA CCATGACGCT
TGCCATTTGG CGCACGGCCA AGGAATTCGT CAAGAACCGC GCACGATTTT ATCGAGCATT
CCGGGAGTGG AAATGGTTTC GATGCCAAAC GCCGACCGCT GCTGCGGAAG CGCGGGAATT
TATAATCTTA CCCATCCGGA TATGGCAAGC GCCGTATTGG AAAGCAAAAT GGAAAACGTT
CCAGAAGATG TGGAACTTAT TACAATGGGG AATCCTGGAT GCATGTTGCA AATGGCGATG
GGAGTTTTGA AATACGGCCG AAATCAAAAA GTCGTTCATA CCGTACAGCT CCTAGACTGG
GCATATCAAA AAGAAGACGG AAAGGAGGTA CGTGTATGA
 
Protein sequence
MSLRESEIKS GPACATVSLG NYRFSDPPDP SKWADCVHCG MCLESCPTYE QTGQEQHSPR 
GRVHLIKSVA EGKLEITEDL IDPIFMCLDC RACTTACPAD VDVGGLIEEV RGQIRQAIPL
TGWKAFVNNF FLKGVFPHPP RLHLLGSLLK LYQKSGLQMI ARKTKLLHIM PKHLVEMEAI
LPEAGVPVRK KYKHMNVIKA KGETKHTVAL LTGCVMDVMF SDINEATIRV LTRNGNDVVI
PQNQTCCGAL HVHAGDREMG RKLAKQNIEA FQHADRVIVN AAGCGSMLKE YPELFRNDPE
WREKAEDFSR KVEDISKYLH DTGYEKPKAE LNVRITYHDA CHLAHGQGIR QEPRTILSSI
PGVEMVSMPN ADRCCGSAGI YNLTHPDMAS AVLESKMENV PEDVELITMG NPGCMLQMAM
GVLKYGRNQK VVHTVQLLDW AYQKEDGKEV RV