Gene Msed_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1023 
Symbol 
ID5104326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp946368 
End bp948167 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content36% 
IMG OID640506922 
ProductATPase 
Protein accessionYP_001191115 
Protein GI146303799 
COG category[R] General function prediction only 
COG ID[COG0714] MoxR-like ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0862859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATG ATTGGTTTAA GTTGAGACAA GGCTTGGAAT ATCTGGATAC ATATCTCACT 
ACGAACAATG AGAAAGAACT AAATGAGTCG ATTGATAGAA TAATAAGTCT AATAGATAAA
AAGATAATTA GTCAAACCGA TATTAAGGAA AATTCTGCTA GAGAAGAACC AAATCCAATA
TGTGTTTTTG TTGGAAACAA GGATAATTAT GAACATTGGA TGTACTCTTT TAGATATTCT
TTAGAAGCAG AAGTAAGTTA CATGCTGTGG GGGGATACAG TTAGCTCCAA GTCAAAGAGT
ACTCCGGACG AGGAGACACA TTTCGAATAT GGAACTTTAG TCGATGCCTA TCGAAAACAA
ATTAAAGACG GGAATATAGT AGACCCGCTT TTTGCAATAT TTTATCTAAA TAAATCATTC
TTTGGTTTTG GAATAATTAC GGACATTAAC TATGATATTT TTAGAAATTT TACGTATTGG
AAAGAGGTTA GTTTCGATAA AATATGGAAA ATGCGCGTTA GAATGAAAGT CCTATATATT
CACAAAAAGC TTAGGGACAA GCCTTTTGAA AATTGGGCGT CTTTTGATGA AATTTCGTTT
GATCCGTTAA CAGATGGGAA AATATCATTA AATGCCAATA ATTGCTATCG TAAAAAAGAA
GTAATGGAAT ACTTACTAAA TGAATATATT AAACCAAAAA AGGACGAGAT AAGGAATACC
CTACTCTTCT ATAGGGATAT CTATCAAAAA CTCCGCGCAT CTCAGGAGAG AAAGTTATCC
TCTCTCCAAA ACCTAACAAC TGGAAACCTT CAATTTAAAC CCCAAATCAG CTGCGTTAAG
ACTGGAGACA TAGTCCTAAA TGACCTTTAT CTAGGCACTG GACTAGAGAC AACTCAGTTT
AGCTCAATCT TGAAGGAGAG TATGCGTGGT GGAAATGTAC TGTTTGTAGG CCCCCCAGGT
GTCGGAAAAA CTGAACTGGC TACACGTCTC GCCCGTTATT ACGCTGGAGA CAATTGCTAT
ACAATAACGA CTGCAAATTC ACTATGGTTT AGGAGAGATG TTATAGGTGG TGAAACCATT
CAGGCAGGTT CTGTTATATG GAAGAGCGGA TTGCTAGTGA AAGCATATAA TAGGGCGGCC
GAAATTCCTT CCGCAAATAG CTTCGCGATA ATAATAGACG AGATAAATAG AGCTGACATA
GATAAGGCGT TTGGTGAATT TTTCACAATA TTTTCCAGTA CCGAACTTAG CAATTGGAAG
TTACCATCTT CCCTAGTTGA TGAGATTAAG AGTTACGGGA ATAACGTGGA TGAGGAAGCT
AGAAGATTCC TAGAGAATTA CGAGAGATTG GGAGATAAAC CACTGACTGG GCTAAGAATA
ATCGCCACCA TGAACCTAAT AGACTTTAGG AATCTCTTCG ACATTGGTAG TGCACTGACT
AGAAGGTTCT TTGTTTTTCA ATTTGAGTAC CCAAAGGGAA TTGAGGATAT ATCGAAACTA
AATCTTCAAG TAGATAAGGA GATAAAGGAC ATTATAAAAT GTCTGAGAGA GAAATTCTCG
TCAAGACCTA GAGGTGACTT ACTTGAGGGA TTTGATACCA GATCCGGATT TAATATATCG
CCTGCGTCCC TCAAGAAGGC AATAAATATT TATAATTCTA CTCAAAATAA AGATATACAT
ATATTTCGTG AGATATTAAG AAGCACGCTT GGAACCGTGA ACTTGAAGGA CTTGGAGAAC
TACAATAAAT ACTTTGAAGA ATGTGAGAAG AATGTTAATC AGGGACAAAC AACTAATTGA
 
Protein sequence
MYDDWFKLRQ GLEYLDTYLT TNNEKELNES IDRIISLIDK KIISQTDIKE NSAREEPNPI 
CVFVGNKDNY EHWMYSFRYS LEAEVSYMLW GDTVSSKSKS TPDEETHFEY GTLVDAYRKQ
IKDGNIVDPL FAIFYLNKSF FGFGIITDIN YDIFRNFTYW KEVSFDKIWK MRVRMKVLYI
HKKLRDKPFE NWASFDEISF DPLTDGKISL NANNCYRKKE VMEYLLNEYI KPKKDEIRNT
LLFYRDIYQK LRASQERKLS SLQNLTTGNL QFKPQISCVK TGDIVLNDLY LGTGLETTQF
SSILKESMRG GNVLFVGPPG VGKTELATRL ARYYAGDNCY TITTANSLWF RRDVIGGETI
QAGSVIWKSG LLVKAYNRAA EIPSANSFAI IIDEINRADI DKAFGEFFTI FSSTELSNWK
LPSSLVDEIK SYGNNVDEEA RRFLENYERL GDKPLTGLRI IATMNLIDFR NLFDIGSALT
RRFFVFQFEY PKGIEDISKL NLQVDKEIKD IIKCLREKFS SRPRGDLLEG FDTRSGFNIS
PASLKKAINI YNSTQNKDIH IFREILRSTL GTVNLKDLEN YNKYFEECEK NVNQGQTTN