Gene GWCH70_3176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3176 
Symbol 
ID7977029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3204852 
End bp3206081 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content50% 
IMG OID644799961 
Productallantoate amidohydrolase 
Protein accessionYP_002951100 
Protein GI239828476 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.183247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTAATG CAGATCGGCT TTGGAATCGG CTCATGGAAT TAGGAGAGAT TGGAAAACAG 
CCGTCAGGCG GAATTACTCG TTTATCGTTT ACAAAAGAAG AACGTGCCGC AAAGGAAAAA
GTAGCTTCGT ATATGAAGGA AGCAGGGCTT GCCGTGTATG AAGATGCGGT TGGAAATTTG
CTTGGACGTA AAGAAGGAAA AGATCCGGAG GCCGCTGTCG TGCTGGTCGG TTCGCACTTA
GACTCCGTCT ATAACGGAGG AATGTTTGAC GGTCCGCTCG GAGTGCTTTC CGCGGTGGAA
GTGTTACAAA CGATGAACGA ACGAGGGGTG GAAACGAAGC ATCCGATTGA AGTCGTTGCT
TTTACCGATG AAGAAGGAGC ACGCTTTAGT TACGGTATGA TCGGCAGCCG TGGAATGGCG
GGAACATTGT CGGAGGAAGA ACTCGTTCAT CAAGATAAAC ATGGAATTTC GATTGCCGAA
GCGATGAAAG CAGCGGGGCT TGACCCCAGT GAAATAGGCA AGGCTGCGCG GCGAAAAGGA
TCAGTAAAAG CTTATGTCGA GTTACATATT GAACAAGGGC GTGTTTTGGA ACAAGCGAAT
CTTCCTGTCG GAATTGTCAC AGGGATCGCC GGGCTTGTAT GGGCGAAATT TACGGTGGAA
GGAAAAGCGG AACATGCCGG GGCAACGCCA ATGCCAATCC GGCGCGATCC GCTTGTTGCC
GCAGCACAGA TCATCCAAAT GATCGAACAA GAGGCGAAAA AGACAGGAAC CACCGTGGGA
ACCGTTGGGC AAATGCAGGT GTTCCCGGGA GGAATTAACG TCATTCCGGC ACGAGTCGAA
TTTTCCTTAG ATTTGCGGGA TATTGACGCG GCAGTGCGCG ATAACGTATT CCAGTCGATT
ATTGAACGAG CGCAACAAAT TGGCCAAGAG AGAAATGTAA AGGTCACTGT CGAGCGGCTG
CAAGAGATGC CTCCGGTATT ATGTTCCGAA CTTGTGCAAA ATGCAGCGAA GGAAGCGTGT
AAACAACTAG GTTTTGATGT GTTCTCCCTT CCTAGCGGCG CTGCCCATGA CGGGGTGCAG
CTCGTGGATC TTTGCCCGAT CGGGATGATT TTTGTCCGCT CGAAAGATGG GATCAGCCAT
AGCCCGGAGG AATGGAGTTC AAAGGAAGAT TGTGCGGCCG GTGCGAACGT ATTGTACCAT
ACCGTATTGC GTTTAGCGAT GGGAGAATAA
 
Protein sequence
MINADRLWNR LMELGEIGKQ PSGGITRLSF TKEERAAKEK VASYMKEAGL AVYEDAVGNL 
LGRKEGKDPE AAVVLVGSHL DSVYNGGMFD GPLGVLSAVE VLQTMNERGV ETKHPIEVVA
FTDEEGARFS YGMIGSRGMA GTLSEEELVH QDKHGISIAE AMKAAGLDPS EIGKAARRKG
SVKAYVELHI EQGRVLEQAN LPVGIVTGIA GLVWAKFTVE GKAEHAGATP MPIRRDPLVA
AAQIIQMIEQ EAKKTGTTVG TVGQMQVFPG GINVIPARVE FSLDLRDIDA AVRDNVFQSI
IERAQQIGQE RNVKVTVERL QEMPPVLCSE LVQNAAKEAC KQLGFDVFSL PSGAAHDGVQ
LVDLCPIGMI FVRSKDGISH SPEEWSSKED CAAGANVLYH TVLRLAMGE