Gene GWCH70_0412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0412 
Symbolsat 
ID7978570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp468649 
End bp469809 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID644797398 
Productsulfate adenylyltransferase 
Protein accessionYP_002948598 
Protein GI239825974 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAA GCATTCCACA CGGAGGCACA TTAATCGATC GATGGAATCC TAGCTATCCA 
TTGGATACAC TGACAAAAGA AATTGAGCTG ACAAATGCCG AGTTAAGTGA TTTAGAATTG
ATCGGAACAG GCGCTTACAG CCCGCTTACC GGATTTTTAA CAAAAGAGGA TTATGACTCT
GTCGTTGAAA CGATGCGCTT AACCAACGGT ACAGTATGGA GCATTCCAAT TACCCTTGCT
GTAACCGAAG AAAAGGCAAA AGAGATTTCT GCCGGCGAAA CGGCCAAACT CGTTTACAAC
GGAGAAGTCT ATGGAGTCAT TGATATTCAA GAAATCTATC AACCTGATAA AACAAAAGAA
GCACTTCTCG TTTATAAAAC GGATGAACTC AAACATCCAG GCGTACGGAA ACTATTTGAA
AAGCCAAATG TATATGTAGG CGGCCCGATT ACATTAGTAA AACGCACTGA TAAAGGGCGT
TTCGCTCCAT TCTACTTCGA TCCGGCCGAA ACACGTAAGC GTTTTGCCGA GCTTGGCTGG
AACACGGTGG TCGGCTTTCA AACGCGAAAT CCTGTCCATC GCGCCCATGA ATACATCCAA
AAATGCGCGC TCGAAATTGT CGACGGCCTC TTTTTAAACC CGCTCGTCGG CGAAACAAAA
GCGGATGACA TCCCAGCGGA CATCCGCATG GAAAGCTATC AAGTATTGTT GGAAAACTAT
TATCCGAAAG ACCGCGTGTT TTTAGGAGTA TTCCAGGCAG CGATGCGCTA CGCCGGGCCA
CGGGAAGCGA TTTTCCATGC GATGGTGCGC AAAAACTTCG GCTGCACCCA CTTCATTGTC
GGCCGCGACC ATGCCGGCGT CGGTGATTAC TATGGCACAT ATGATGCGCA AAAAATCTTC
TTGAATTTTA CGCCGGAAGA ACTTGGCATT ACGCCGCTGT TTTTCGAACA TAGCTTTTAT
TGCACGAAAT GCGAAGGAAT GGCGTCGACG AAAACATGCC CGCATGATCC GAAATACCAT
GTTGTATTAT CCGGCACAAA AGTGCGGGAA ATGCTGCGCA ATGGCCAAGT GCCGCCAAGC
ACGTTCAGCC GTCCGGAAGT CGCGGCAGTA TTGATTAAAG GATTGCAGCA GCGCGAGGCT
GTCACCTCAT CTACACGTTA A
 
Protein sequence
MSLSIPHGGT LIDRWNPSYP LDTLTKEIEL TNAELSDLEL IGTGAYSPLT GFLTKEDYDS 
VVETMRLTNG TVWSIPITLA VTEEKAKEIS AGETAKLVYN GEVYGVIDIQ EIYQPDKTKE
ALLVYKTDEL KHPGVRKLFE KPNVYVGGPI TLVKRTDKGR FAPFYFDPAE TRKRFAELGW
NTVVGFQTRN PVHRAHEYIQ KCALEIVDGL FLNPLVGETK ADDIPADIRM ESYQVLLENY
YPKDRVFLGV FQAAMRYAGP REAIFHAMVR KNFGCTHFIV GRDHAGVGDY YGTYDAQKIF
LNFTPEELGI TPLFFEHSFY CTKCEGMAST KTCPHDPKYH VVLSGTKVRE MLRNGQVPPS
TFSRPEVAAV LIKGLQQREA VTSSTR