Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1398 |
Symbol | |
ID | 7084519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1556510 |
End bp | 1558429 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643698415 |
Product | sulfate adenylyltransferase, large subunit |
Protein accession | YP_002355053 |
Protein GI | 217969819 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2895] GTPases - Sulfate adenylate transferase subunit 1 |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00455] adenylylsulfate kinase (apsK) [TIGR02034] sulfate adenylyltransferase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0369421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACACG TTTCCGACCT GATCGCGACC GACATCGAGC AGTACCTCAG GGCGCACGAG AACAAGAGCC TGCTGCGCTT CATCACCTGC GGCAGCGTGG ACGACGGCAA GAGCACGCTG ATCGGGCGCC TGCTCTACGA GTCGAAGATG CTCTTCGAGG ACCAGCTCGC CGCGGTCGAG GCCGACTCGA AGAAGTACGG CACCCAGGGC GAGGCGATCG ACTTCGCGCT GCTGGTCGAC GGCCTGGCCG CCGAGCGCGA GCAGGGCATC ACGATCGACG TGGCGTATCG CTTCTTCTCC ACCGACAAGC GCAAGTTCAT CGTCGCCGAC ACCCCCGGCC ACGAGCAATA CACCCGCAAC ATGGTCACCG GCGCCTCGAC GGCCGACGCG GCCATCCTGA TGGTCGATGC GCGCAAGGGC ATCCTCACCC AGACGCGGCG TCACAGCTAC CTGGTGAACC TCATCGGCAT CCGCCACATC GTGGTGGCGA TCAACAAGAT GGACCTGGTC GATTACGCCG AGGGCGTGTT CCGCCGCATC GTCGAGGACT ACACCGCCTT CTCCCGCCAG CTCGGCATCG AGCAGGTGAC CTTCATCCCG ATGTCGGCCT TCAGGGGCGA CAACATCACC TCTCCGAGCG CTGCGATGCC CTGGTACCAC GGCACCACGC TGATGGGCTA CCTCGAGACG GTGGAGGTCG ATGACGGCCT GATGCAGCGC GCGCCCTTCC GCCTGCCGGT GCAGTGGGTC AATCGCCCCA ACCTGGATTT CCGCGGCTTT GCCGGCTCCA TCGCCGGCGG CACCATCCGC CCGGGCGATC GTGTGCGCGT ACAGCCCTCG GGGCGCGAGA GCACGGTGGC GCGCATCGTG ACCCGCGACG GCGATCTCGA CCGGGCCGTG GCCGGGCAGT CGGTCACGCT GACCCTCGCC GACGAGATCG ACATCTCGCG CGGCGACGTC ATCTCGACCG TCGAGGCGCC GGCCGAGGTC GCCGACCAGT TCGAGTGCAC GGTGGTGTGG ATGCACGACG AGCCCATGCT CGCCGGCCGC CCTTACCTGC TCAAGATCGG CGCGCGCACG GTCAGCGCGA CGATCACCGA GATCAAGTAC CAGGTGAATG TGAACACCCT CGAGCACGTC GCCGCCAAGC GGCTCGAGCT CAACGCGATC GGCGTGTGCA ACCTGAGCCT GGATCGGCCG ATCGCCTTCG ATCCCTACCG GATCAACCGC GACACCGGCG GCTTCATCCT GATCGACCGC CTCTCCAACA ACACCGTCGG CGCCGGCCTG TTGCACTTCG CGCTGCGTCG CGCGCACAAC ATCCACCTGC AGCACGTCGA TGTGGACAAG CGCGCGCGTG CCGCGCTGAA GAACCAGCGC GGCTGCGTGC TGTGGTTCAC CGGCCTGTCC GGCGCAGGGA AGTCCTCGAT CGCCAACCTG GTCGAGAAGA AACTGCACGC GCTCGGCCAC CACACCTACC TGCTCGACGG CGACAATGTG CGCCATGGCC TCAACAAGGA CCTCGGCTTC ACCGACGCCG ATCGCGTGGA GAACATCCGT CGCGTCGCCG AGGTCGCGAA GCTGATGGTC GATGCAGGCC TGATCGTGCT CACCGCCTTC ATCTCGCCTT TCCGCTCCGA GCGCCGCATG GCGCGCGGCC TGGTGGAGGA GGGCGAGTTC GTCGAGGTCT TCGTCGACAC TCCGCTGGAG GTGGCCGAGG CGCGCGATCC CAAGGGCCTG TACAAGAAGG CCCGGCGCGG CGAGCTGAAG AACTTCACCG GCATCGATTC GCCTTACGAG GCGTCGGAGG ACCCCGAACT GCGCCTCGAT ACCACCCGCC TCGACCTGGA GGCCGCGGCC GATGCGGTGC TTGCCTGGCT GGGCGAGCGC GGCATGCTGA GCCGGACGCC GGGGGGCTGA
|
Protein sequence | MAHVSDLIAT DIEQYLRAHE NKSLLRFITC GSVDDGKSTL IGRLLYESKM LFEDQLAAVE ADSKKYGTQG EAIDFALLVD GLAAEREQGI TIDVAYRFFS TDKRKFIVAD TPGHEQYTRN MVTGASTADA AILMVDARKG ILTQTRRHSY LVNLIGIRHI VVAINKMDLV DYAEGVFRRI VEDYTAFSRQ LGIEQVTFIP MSAFRGDNIT SPSAAMPWYH GTTLMGYLET VEVDDGLMQR APFRLPVQWV NRPNLDFRGF AGSIAGGTIR PGDRVRVQPS GRESTVARIV TRDGDLDRAV AGQSVTLTLA DEIDISRGDV ISTVEAPAEV ADQFECTVVW MHDEPMLAGR PYLLKIGART VSATITEIKY QVNVNTLEHV AAKRLELNAI GVCNLSLDRP IAFDPYRINR DTGGFILIDR LSNNTVGAGL LHFALRRAHN IHLQHVDVDK RARAALKNQR GCVLWFTGLS GAGKSSIANL VEKKLHALGH HTYLLDGDNV RHGLNKDLGF TDADRVENIR RVAEVAKLMV DAGLIVLTAF ISPFRSERRM ARGLVEEGEF VEVFVDTPLE VAEARDPKGL YKKARRGELK NFTGIDSPYE ASEDPELRLD TTRLDLEAAA DAVLAWLGER GMLSRTPGG
|
| |