Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0412 |
Symbol | sat |
ID | 7978570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 468649 |
End bp | 469809 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644797398 |
Product | sulfate adenylyltransferase |
Protein accession | YP_002948598 |
Protein GI | 239825974 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2046] ATP sulfurylase (sulfate adenylyltransferase) |
TIGRFAM ID | [TIGR00339] ATP sulphurylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAA GCATTCCACA CGGAGGCACA TTAATCGATC GATGGAATCC TAGCTATCCA TTGGATACAC TGACAAAAGA AATTGAGCTG ACAAATGCCG AGTTAAGTGA TTTAGAATTG ATCGGAACAG GCGCTTACAG CCCGCTTACC GGATTTTTAA CAAAAGAGGA TTATGACTCT GTCGTTGAAA CGATGCGCTT AACCAACGGT ACAGTATGGA GCATTCCAAT TACCCTTGCT GTAACCGAAG AAAAGGCAAA AGAGATTTCT GCCGGCGAAA CGGCCAAACT CGTTTACAAC GGAGAAGTCT ATGGAGTCAT TGATATTCAA GAAATCTATC AACCTGATAA AACAAAAGAA GCACTTCTCG TTTATAAAAC GGATGAACTC AAACATCCAG GCGTACGGAA ACTATTTGAA AAGCCAAATG TATATGTAGG CGGCCCGATT ACATTAGTAA AACGCACTGA TAAAGGGCGT TTCGCTCCAT TCTACTTCGA TCCGGCCGAA ACACGTAAGC GTTTTGCCGA GCTTGGCTGG AACACGGTGG TCGGCTTTCA AACGCGAAAT CCTGTCCATC GCGCCCATGA ATACATCCAA AAATGCGCGC TCGAAATTGT CGACGGCCTC TTTTTAAACC CGCTCGTCGG CGAAACAAAA GCGGATGACA TCCCAGCGGA CATCCGCATG GAAAGCTATC AAGTATTGTT GGAAAACTAT TATCCGAAAG ACCGCGTGTT TTTAGGAGTA TTCCAGGCAG CGATGCGCTA CGCCGGGCCA CGGGAAGCGA TTTTCCATGC GATGGTGCGC AAAAACTTCG GCTGCACCCA CTTCATTGTC GGCCGCGACC ATGCCGGCGT CGGTGATTAC TATGGCACAT ATGATGCGCA AAAAATCTTC TTGAATTTTA CGCCGGAAGA ACTTGGCATT ACGCCGCTGT TTTTCGAACA TAGCTTTTAT TGCACGAAAT GCGAAGGAAT GGCGTCGACG AAAACATGCC CGCATGATCC GAAATACCAT GTTGTATTAT CCGGCACAAA AGTGCGGGAA ATGCTGCGCA ATGGCCAAGT GCCGCCAAGC ACGTTCAGCC GTCCGGAAGT CGCGGCAGTA TTGATTAAAG GATTGCAGCA GCGCGAGGCT GTCACCTCAT CTACACGTTA A
|
Protein sequence | MSLSIPHGGT LIDRWNPSYP LDTLTKEIEL TNAELSDLEL IGTGAYSPLT GFLTKEDYDS VVETMRLTNG TVWSIPITLA VTEEKAKEIS AGETAKLVYN GEVYGVIDIQ EIYQPDKTKE ALLVYKTDEL KHPGVRKLFE KPNVYVGGPI TLVKRTDKGR FAPFYFDPAE TRKRFAELGW NTVVGFQTRN PVHRAHEYIQ KCALEIVDGL FLNPLVGETK ADDIPADIRM ESYQVLLENY YPKDRVFLGV FQAAMRYAGP REAIFHAMVR KNFGCTHFIV GRDHAGVGDY YGTYDAQKIF LNFTPEELGI TPLFFEHSFY CTKCEGMAST KTCPHDPKYH VVLSGTKVRE MLRNGQVPPS TFSRPEVAAV LIKGLQQREA VTSSTR
|
| |