Gene GWCH70_1282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1282 
Symbol 
ID7976063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1332066 
End bp1333724 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content55% 
IMG OID644798226 
Producturocanate hydratase 
Protein accessionYP_002949399 
Protein GI239826775 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0391092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAACGA AACACCGGCC AGTGCAAGCA TACACCGGTT CTACTTTGCA TGCGAAAGGC 
TGGATTCAAG AGGCCGCGTT ACGAATGCTG AACAACAACT TACATCCAGA GGTGGCGGAG
CGCCCGGAAG ATTTGGTTGT CTACGGCGGC ATCGGCAAAG CGGCGCGCAA TTGGGAGTGC
TACGGGGCGA TTGTCGAAAC GCTATTAAAC TTAGAAAACG ATGAAACGCT GCTCATTCAG
TCGGGAAAGC CAGTCGCGGT ATTTAAAACG CACACGGATG CGCCAAGGGT GTTAATCGCC
AACTCGAATC TTGTGCCTGC TTGGGCGACG TGGGATCATT TCCACGAACT TGATAAAAAA
GGATTAATCA TGTATGGCCA AATGACGGCA GGAAGCTGGA TTTACATTGG CAGCCAAGGC
ATCGTGCAAG GCACGTACGA AACGTTTGCC GAGGTGGCGC GCCAGCACTA TGGCGGCACG
CTAAAAGGAA CAATTACGGT GACGGCAGGT CTTGGCGGCA TGGGGGGAGC ACAGCCGCTC
GCCGTCACGT TAAACGGCGG CGTCTGCATT GCCGTCGAAG TGGACCCAGC CCGCATCCAG
CGCCGCATTG ACACGAAATA TTTAGACACG ATGACCGATC GTCTCGATGT GGCGATCCAG
ATGGCGAAAA GGGCGAAGGA AGAAGGAAAA GCGCTATCGA TCGGCCTGCT TGGCAACGCG
GCAGAAGTGC TGCCGAAAAT GATCGAAATC GGCTTTATTC CGGACGTGTT GACAGACCAG
ACGTCCGCCC ACGATCCGCT TAACGGCTAC ATTCCGGCGG GCATGACGCT CGAGGAAGCG
GCTGAGCTGC GCCAGCGCGA TCCGAAGCAG TATATCCGCC GCGCCAAACA ATCGATCGCC
GAACATGTCA AAGCGATGCT CGCCATGCAG AAACAAGGCT CAGTGACATT CGATTACGGC
AACAATATCC GCCAAGTCGC GAAAGATGAA GGAGTGGAAG AGGCGTTTAA TTTTCCAGGT
TTTGTTCCCG CCTACATCCG TCCGCTCTTT TGCGAAGGGA AAGGGCCGTT CCGCTGGGTG
GCCCTGTCAG GAGATCCGGA AGACATCTAC AAAACCGATG AAGTCATTTT GCGCGAATTC
AGCGACAACC AACATTTGTG CAACTGGATC CGCATGGCGC GGGAAAAAAT CCAGTTTCAA
GGGCTGCCGG CGCGCATCTG CTGGCTCGGC TACGGCGAAC GGGCGAAATT TGGCAAAATC
ATTAACGACA TGGTGGCGAA AGGCGAGCTG AAAGCGCCGA TCGTCATCGG CCGCGATCAT
TTGGATTCCG GTTCTGTCGC CTCGCCAAAC CGCGAAACGG AAGGAATGAA AGACGGCAGT
GACGCGATCG CCGACTGGCC GATTTTAAAC GCGCTTCTTA ACGCGGTTGG CGGTGCAAGC
TGGGTATCGG TGCACCATGG CGGCGGCGTC GGCATGGGCT ATTCGATTCA TGCGGGAATG
GTCATTGTCG CCGATGGCAC GAAAGAAGCG GAAAAACGGC TCGAGCGCGT CTTGACGACC
GACCCGGGCC TTGGCGTTGT CCGCCACGCT GATGCTGGCT ATGAACTCGC TATCAAAACG
GCGAAAGAAA AAGGCATCCA TATGCCGATG CTGAAATAA
 
Protein sequence
MVTKHRPVQA YTGSTLHAKG WIQEAALRML NNNLHPEVAE RPEDLVVYGG IGKAARNWEC 
YGAIVETLLN LENDETLLIQ SGKPVAVFKT HTDAPRVLIA NSNLVPAWAT WDHFHELDKK
GLIMYGQMTA GSWIYIGSQG IVQGTYETFA EVARQHYGGT LKGTITVTAG LGGMGGAQPL
AVTLNGGVCI AVEVDPARIQ RRIDTKYLDT MTDRLDVAIQ MAKRAKEEGK ALSIGLLGNA
AEVLPKMIEI GFIPDVLTDQ TSAHDPLNGY IPAGMTLEEA AELRQRDPKQ YIRRAKQSIA
EHVKAMLAMQ KQGSVTFDYG NNIRQVAKDE GVEEAFNFPG FVPAYIRPLF CEGKGPFRWV
ALSGDPEDIY KTDEVILREF SDNQHLCNWI RMAREKIQFQ GLPARICWLG YGERAKFGKI
INDMVAKGEL KAPIVIGRDH LDSGSVASPN RETEGMKDGS DAIADWPILN ALLNAVGGAS
WVSVHHGGGV GMGYSIHAGM VIVADGTKEA EKRLERVLTT DPGLGVVRHA DAGYELAIKT
AKEKGIHMPM LK