Gene Tmz1t_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1398 
Symbol 
ID7084519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1556510 
End bp1558429 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content67% 
IMG OID643698415 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_002355053 
Protein GI217969819 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00455] adenylylsulfate kinase (apsK)
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0369421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACACG TTTCCGACCT GATCGCGACC GACATCGAGC AGTACCTCAG GGCGCACGAG 
AACAAGAGCC TGCTGCGCTT CATCACCTGC GGCAGCGTGG ACGACGGCAA GAGCACGCTG
ATCGGGCGCC TGCTCTACGA GTCGAAGATG CTCTTCGAGG ACCAGCTCGC CGCGGTCGAG
GCCGACTCGA AGAAGTACGG CACCCAGGGC GAGGCGATCG ACTTCGCGCT GCTGGTCGAC
GGCCTGGCCG CCGAGCGCGA GCAGGGCATC ACGATCGACG TGGCGTATCG CTTCTTCTCC
ACCGACAAGC GCAAGTTCAT CGTCGCCGAC ACCCCCGGCC ACGAGCAATA CACCCGCAAC
ATGGTCACCG GCGCCTCGAC GGCCGACGCG GCCATCCTGA TGGTCGATGC GCGCAAGGGC
ATCCTCACCC AGACGCGGCG TCACAGCTAC CTGGTGAACC TCATCGGCAT CCGCCACATC
GTGGTGGCGA TCAACAAGAT GGACCTGGTC GATTACGCCG AGGGCGTGTT CCGCCGCATC
GTCGAGGACT ACACCGCCTT CTCCCGCCAG CTCGGCATCG AGCAGGTGAC CTTCATCCCG
ATGTCGGCCT TCAGGGGCGA CAACATCACC TCTCCGAGCG CTGCGATGCC CTGGTACCAC
GGCACCACGC TGATGGGCTA CCTCGAGACG GTGGAGGTCG ATGACGGCCT GATGCAGCGC
GCGCCCTTCC GCCTGCCGGT GCAGTGGGTC AATCGCCCCA ACCTGGATTT CCGCGGCTTT
GCCGGCTCCA TCGCCGGCGG CACCATCCGC CCGGGCGATC GTGTGCGCGT ACAGCCCTCG
GGGCGCGAGA GCACGGTGGC GCGCATCGTG ACCCGCGACG GCGATCTCGA CCGGGCCGTG
GCCGGGCAGT CGGTCACGCT GACCCTCGCC GACGAGATCG ACATCTCGCG CGGCGACGTC
ATCTCGACCG TCGAGGCGCC GGCCGAGGTC GCCGACCAGT TCGAGTGCAC GGTGGTGTGG
ATGCACGACG AGCCCATGCT CGCCGGCCGC CCTTACCTGC TCAAGATCGG CGCGCGCACG
GTCAGCGCGA CGATCACCGA GATCAAGTAC CAGGTGAATG TGAACACCCT CGAGCACGTC
GCCGCCAAGC GGCTCGAGCT CAACGCGATC GGCGTGTGCA ACCTGAGCCT GGATCGGCCG
ATCGCCTTCG ATCCCTACCG GATCAACCGC GACACCGGCG GCTTCATCCT GATCGACCGC
CTCTCCAACA ACACCGTCGG CGCCGGCCTG TTGCACTTCG CGCTGCGTCG CGCGCACAAC
ATCCACCTGC AGCACGTCGA TGTGGACAAG CGCGCGCGTG CCGCGCTGAA GAACCAGCGC
GGCTGCGTGC TGTGGTTCAC CGGCCTGTCC GGCGCAGGGA AGTCCTCGAT CGCCAACCTG
GTCGAGAAGA AACTGCACGC GCTCGGCCAC CACACCTACC TGCTCGACGG CGACAATGTG
CGCCATGGCC TCAACAAGGA CCTCGGCTTC ACCGACGCCG ATCGCGTGGA GAACATCCGT
CGCGTCGCCG AGGTCGCGAA GCTGATGGTC GATGCAGGCC TGATCGTGCT CACCGCCTTC
ATCTCGCCTT TCCGCTCCGA GCGCCGCATG GCGCGCGGCC TGGTGGAGGA GGGCGAGTTC
GTCGAGGTCT TCGTCGACAC TCCGCTGGAG GTGGCCGAGG CGCGCGATCC CAAGGGCCTG
TACAAGAAGG CCCGGCGCGG CGAGCTGAAG AACTTCACCG GCATCGATTC GCCTTACGAG
GCGTCGGAGG ACCCCGAACT GCGCCTCGAT ACCACCCGCC TCGACCTGGA GGCCGCGGCC
GATGCGGTGC TTGCCTGGCT GGGCGAGCGC GGCATGCTGA GCCGGACGCC GGGGGGCTGA
 
Protein sequence
MAHVSDLIAT DIEQYLRAHE NKSLLRFITC GSVDDGKSTL IGRLLYESKM LFEDQLAAVE 
ADSKKYGTQG EAIDFALLVD GLAAEREQGI TIDVAYRFFS TDKRKFIVAD TPGHEQYTRN
MVTGASTADA AILMVDARKG ILTQTRRHSY LVNLIGIRHI VVAINKMDLV DYAEGVFRRI
VEDYTAFSRQ LGIEQVTFIP MSAFRGDNIT SPSAAMPWYH GTTLMGYLET VEVDDGLMQR
APFRLPVQWV NRPNLDFRGF AGSIAGGTIR PGDRVRVQPS GRESTVARIV TRDGDLDRAV
AGQSVTLTLA DEIDISRGDV ISTVEAPAEV ADQFECTVVW MHDEPMLAGR PYLLKIGART
VSATITEIKY QVNVNTLEHV AAKRLELNAI GVCNLSLDRP IAFDPYRINR DTGGFILIDR
LSNNTVGAGL LHFALRRAHN IHLQHVDVDK RARAALKNQR GCVLWFTGLS GAGKSSIANL
VEKKLHALGH HTYLLDGDNV RHGLNKDLGF TDADRVENIR RVAEVAKLMV DAGLIVLTAF
ISPFRSERRM ARGLVEEGEF VEVFVDTPLE VAEARDPKGL YKKARRGELK NFTGIDSPYE
ASEDPELRLD TTRLDLEAAA DAVLAWLGER GMLSRTPGG