Gene Msed_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1040 
Symbol 
ID5104339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp965266 
End bp966531 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content49% 
IMG OID640506936 
ProductAAA ATPase 
Protein accessionYP_001191129 
Protein GI146303813 
COG category[R] General function prediction only 
COG ID[COG1373] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATGA TATCGCAGAT GAGCTTTCAA AACCCCTGGT GGACTCAACC CTCATCCATT 
GATGATGACG ATCACGTGAG GAGGGCAAAG TATTACCTCC CACCCGTGAG GGAGAACCTG
CTTATCCTAG GTCCGAGACA GGTGGGGAAA ACTACTTACA TGAAGACCGT GATCAGGGAT
CTACTGAGGG AGGTGGAGCC CAGGAAGGTG TTCTATTTCT CCTGCGACTC ACTCTCCAGG
AAGGACGAGT TAATCCAGCT ACTTAACGAG TATCGAACCC TTGTGAACGG AGATGAGGCC
TTCATATTTC TCGACGAGAT CACGTCAGTA GATGCGTGGA ACATGGGCCT TCTTCACCTC
TTTAACGCAG GTTATTTCAG AAACTCCTTG GTTTACGTGT CTGGATCCTC CTCTCTTAAC
CTGAGTAGGG AAACTCTCCC GGGTAGACCG CTCAAGAAGG TCGTGTATTA TCCGCTCAAC
TTTAGGGTTT ACTTTGACCT TTTTACACGG AAATTGGACG TCCCCACACT CCCCGTGACC
AGTCCCCATG AGATCATGAA GGAGGCGAAA AAGCTACTAC CACACCTCTC GGCCCTCAAC
AAGGCCCTAT TAAGTTACGT TGAAAGGGGA GGATTCTTCG CCACAAATCT AAGCTCTGCC
TCGCTGTATG AAACGTATAG GGACACCGTT CTAAGCGAGA TCGCGAAGAC TGGGAGGAGT
GAGGCCCTCT TCAAGCAGGT GATTTCCAGG ATAATCGAGA GTTATGGTAG CAGAATTTCA
GACAACGGGA TATCCAAGGA GATTTCGGCA TCCCACACGA CGGTATCTGA ATACCTGGAG
CTATTGGAGA GGTTGTTCAT TACGAGAACC TATAGGAAAT GGGAAAATGG GAGGGTGAAC
TATAGGTCCT TAAAGAAGGT CTACATGATA GATCCCTTCC TTTTTAGGGT AATGAAGAGG
TATTCCCTGG GGAAGGACCT GGAGACGGAG GACATACCCC ACGTGATCGA GGGAATAGTT
GGGGAGCACC TATCTAGGGA GTACGCAGAG AGCCTCTTCA CCTTCTTCAA GGACGGTAGA
GAGATCGACT TTCTAGTTAG GGGGATTGGG ATTGAGGTTA AATGGAGTGA ACGGGTGAGG
TCTAGGCCTA AAGCACCAGA GTACGTTCTT ACCATGGACG AGTTTGATGA GGAAAGGAGG
TTAATTCCCG TGTCCCTATT CCTTTACCTC ATTTCCTCGG ACAAGGTGTT TTACGACCTG
GGTTAG
 
Protein sequence
MSMISQMSFQ NPWWTQPSSI DDDDHVRRAK YYLPPVRENL LILGPRQVGK TTYMKTVIRD 
LLREVEPRKV FYFSCDSLSR KDELIQLLNE YRTLVNGDEA FIFLDEITSV DAWNMGLLHL
FNAGYFRNSL VYVSGSSSLN LSRETLPGRP LKKVVYYPLN FRVYFDLFTR KLDVPTLPVT
SPHEIMKEAK KLLPHLSALN KALLSYVERG GFFATNLSSA SLYETYRDTV LSEIAKTGRS
EALFKQVISR IIESYGSRIS DNGISKEISA SHTTVSEYLE LLERLFITRT YRKWENGRVN
YRSLKKVYMI DPFLFRVMKR YSLGKDLETE DIPHVIEGIV GEHLSREYAE SLFTFFKDGR
EIDFLVRGIG IEVKWSERVR SRPKAPEYVL TMDEFDEERR LIPVSLFLYL ISSDKVFYDL
G