Gene Athe_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1194 
Symbol 
ID7409668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1285367 
End bp1287061 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content33% 
IMG OID643715559 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_002573067 
Protein GI222529185 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCAA GAGAACTGCT CAAGATAGTA GTAGTAGGTC ATGTTGACCA TGGTAAATCA 
ACAATAATTG GAAGACTTTT GTATGATACT AAATCTGTCC CAGAGACTGC AATAGAAAGA
GTAAAGAGAA TAAGTAAAGA AAAGGGCAGA CCCTTTGAAT ATGCTTATCT TTTAGATGCA
TTAGAAGAAG AACAAAAACA AGGAATTACA ATTGATACAA CCCAGATAAA ATTTTCAACC
CCAAAGAGAG ACTATTTAAT TATAGATGCT CCCGGACACA AGGAATTTCT CAAAAACATG
GTTTCAGGTG CTGCAAATGC AGAAGCAGGA CTTCTTGTTA TTGACGCTAC TGAAGGTGTG
CAGGAACAAT CTAAAAGACA TGCATATATT CTATCGCTTT TGGGTATCCA AAAGGTATAT
GTTATAGTTA ATAAAATGGA TATGATAGGT TTTTCAGAGG AAAAGTTCAA AGAAATCAAA
TATGAAATCT CAACCTTTTT GGACAAGCTC AATGTATATC CCCAAAAGTA CATTCCAGTG
TCAGGCTTTT TAGGAGAGAA CATAACAAGA AAGTCTGATA AAATGCCATG GTATAAGGGT
GAGACTCTAC TTCAGGCTTT GGACCTTTTT GAGAAAGACA AAGAATTGGA AGATAGACTC
TTAAGATTTC CTATACAGGA TGTGTATAAA TTTGACCACA GGAGGATAAT TGCTGGCAGA
CTTGAATCTG GAAGGCTAAA AGTGGGAGAT GAAATAAAAA TCCTGCCAGA AGGAAAGGTT
AGCAAAGTCA AATCGATTGA ATTTTGGCCA GAAAATAATA AAAAGGATGA AGTAGTGGCA
GGAATGTCTA TTGGAATTAC TATTGAAGAT GAGTTCTTTA ACAAAAGAGG AGAAGTTATT
GTTCACAAAA ACGATGATAC TTTATATGTT TCAGATACAT TCAGGGCAAA TTTGTTTTGG
CTTGGAAAAA GAAATCTTGA GAAGAATAAA ACATATAAAC TAAAGCTTGT TACACAAGAG
ACAGAATGTG AAATTGTTTC TATAGACAAA GTTATAGATG CGACAACTCT TGAGACAGTA
GAAAATGCTC TTGATGTTCA AACAAACGAT GTTGCAGAAG TGACTATAAA AACTAAGGAA
AAGATTTGCT TTGATGAATT TAAAGTAAAT CCTACAACAG GTAGGTTTGT TTTAGTAGAC
GAGTATGATG TCTCTGGCGG TGGCATCATA TCAGGCTTAG CTAACCTAAA AGAAGAGGCA
ACTAAATTTG TAAAAGATGA CAAAGAGATG ATAGTTCATT GCTTTGATGA ATATTATTAT
TCATTGTCTG AAGGTCTTAT TAGAAAACAT CCTAAGTACA AAAGAACATT TAAAGTAGGA
GATGCTGTTC CTATTGATGG TGAAACTTAT TCTTACCCTG AAAGTTTTGA TGTCATTGAT
ATAAATGGTA AACTTGTTGC AAAAATTAGA AAAGGCCAAT TAAGCGACTT AGTAAATATT
GATCAGTATA TTTACAGTAA ACTTCCAATT ATAACAGCAG ATGGATTTTA CTTGAAAGTG
AACTCAATTG ATGAATTTGA GGATTTTAAG AATGAATTAG TTCAATTACA AAAGAACGAT
AACTTTCTTT TAGCCATGTT TGCTAACAAG TGGTATGATA TATCTGCAAA TAGAAATTTC
AAATTTACAG TCTGA
 
Protein sequence
MTARELLKIV VVGHVDHGKS TIIGRLLYDT KSVPETAIER VKRISKEKGR PFEYAYLLDA 
LEEEQKQGIT IDTTQIKFST PKRDYLIIDA PGHKEFLKNM VSGAANAEAG LLVIDATEGV
QEQSKRHAYI LSLLGIQKVY VIVNKMDMIG FSEEKFKEIK YEISTFLDKL NVYPQKYIPV
SGFLGENITR KSDKMPWYKG ETLLQALDLF EKDKELEDRL LRFPIQDVYK FDHRRIIAGR
LESGRLKVGD EIKILPEGKV SKVKSIEFWP ENNKKDEVVA GMSIGITIED EFFNKRGEVI
VHKNDDTLYV SDTFRANLFW LGKRNLEKNK TYKLKLVTQE TECEIVSIDK VIDATTLETV
ENALDVQTND VAEVTIKTKE KICFDEFKVN PTTGRFVLVD EYDVSGGGII SGLANLKEEA
TKFVKDDKEM IVHCFDEYYY SLSEGLIRKH PKYKRTFKVG DAVPIDGETY SYPESFDVID
INGKLVAKIR KGQLSDLVNI DQYIYSKLPI ITADGFYLKV NSIDEFEDFK NELVQLQKND
NFLLAMFANK WYDISANRNF KFTV