Gene GWCH70_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1949 
Symbol 
ID7978773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2008180 
End bp2009166 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content49% 
IMG OID644798777 
Producturea amidolyase related protein 
Protein accessionYP_002949947 
Protein GI239827323 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000276448 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA TTCAAGTAAT ACATAGCGGA TTTTTCACAA CCGTGCAGGA CGGAGGACGC 
TTCGGATATC AAAAAGCCGG TGTATCCGTT GGAGGAGTGA TGGATTCATT TGCGAGCCGG
ATTGCGAATT TGCTTGTCGA AAATGACGCG AATGAGGCGA CATTAGAAAT TACGATGAAC
GGCCCGACGC TTCGCTTTGA AACGGATGCG CTCGTTGCAA TTTGCGGCGG CGTGTTTCGC
TGTGCATTAA ACGGCGAACC AATATCGATG TGGAAGCCGC TTGTCATCCG ACGCGATGAT
GTGTTATCGA TCGGGGCGTG TCAAGGCGGC TATCGCGCAT ATATCGCGTT TGCGGGCGGG
CTTAACATTC CGATTGTGAT GAACAGCCGT TCTACCCATG TACAAGCGCA CATCGGCGGT
TTTCATGGAA GAGCGCTGCA GCCGGGGGAT GTATTGTCCC TGAGATCCAA GACGATAACG
ATTCCAAAAA ATCCTATTCG TTGGGGGATT GCGTTTTCTG CGAGAAATTA CATAAAAGGG
AAAAGAAAAA TGATACGTGT CGTGAAAGGC CCGGAATATA ACATGTTTAC CGAGCAAAGC
CTAGAGCGAT TTTTTTCTTC CGCGTATGAA GTGACGACGC AATCGGACCG CATGGGCTAT
CGCTTGCAAG GGCAAGCTCT TGAGCGCAAG ACGAATCAAG AAATGATTTC GGAAGCGGTC
ACGTTTGGAA CGGTGCAAGT GCCTGCTTCC GGCCAGCCGA TTGTATTGAT GGCCGATTGC
CAGCCGACAG GAGGGTATCC GCGCATCGCT CAAGTAATTA GCGTAGACCT TCCGATTTTA
GCGCAGGCGC GCCCAGGCGA TCATATTCAA TTTCAAAAAG TATCGTGGCA AGAAGCACAA
CGGTTATATA TCGAACGAGA GCAAGAAATA AAAAAATGGA AAATGATCAT TCATCAAAAA
TGGAGGGAAA TCGGCTATGC GCGTTGA
 
Protein sequence
MSTIQVIHSG FFTTVQDGGR FGYQKAGVSV GGVMDSFASR IANLLVENDA NEATLEITMN 
GPTLRFETDA LVAICGGVFR CALNGEPISM WKPLVIRRDD VLSIGACQGG YRAYIAFAGG
LNIPIVMNSR STHVQAHIGG FHGRALQPGD VLSLRSKTIT IPKNPIRWGI AFSARNYIKG
KRKMIRVVKG PEYNMFTEQS LERFFSSAYE VTTQSDRMGY RLQGQALERK TNQEMISEAV
TFGTVQVPAS GQPIVLMADC QPTGGYPRIA QVISVDLPIL AQARPGDHIQ FQKVSWQEAQ
RLYIEREQEI KKWKMIIHQK WREIGYAR