Gene Msed_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1917 
Symbol 
ID5103304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1863896 
End bp1865671 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content49% 
IMG OID640507805 
ProductV-type ATP synthase subunit A 
Protein accessionYP_001191981 
Protein GI146304665 
COG category[C] Energy production and conversion 
COG ID[COG1155] Archaeal/vacuolar-type H+-ATPase subunit A 
TIGRFAM ID[TIGR01043] ATP synthase archaeal, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000535851 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.804968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGG GTAAAGTTGC AAGGGTGAAC GGGCCACTGG TTGTGGCCGA GGGTATGAAG 
GATGCGCAGA TGTTTGAAGT AGTCGAAGTG GGCGAGCCTA GGCTAGTGGG CGAAATAACA
AGGATTGAGG GTGACAGGGC TTACATACAG GTATATGAGG ACACGAGCGG AATCAGACCT
GGTGAGCCAG TTTTTGGGAG TGGGGCTCCC CTATCAGTGG AGTTAGGTCC AGGCCTTCTC
GGTCAACTGT TTGATGGTCT ACTTAGACCT CTAGGAGAGA TCAAGGAAAT TACTAAGTCA
CCTTTCATCA AGAGGGGGAT AAAGTTACCA ACACTAAATA GGGAGAAAGA ATGGCACTTC
GTCCCAAAGA TGAAGAAGGG AGATAAGGTA GAGCCAGGTG ATATTCTAGG AGTGGTACAG
GAGACAGGAC TTGTAGAGCA CAGAATACTC GTTCCTCCTT ACGTTCACGG AAAATTAAAG
GAGGTTGTTG CAGAGGGCGA TTACAAGGTT GAAGACAACG TCGCAATTGT GGACATGAAC
GGTGACGAAG TTCCAGTGAA AATGATGCAG AAGTGGCCAG TTAGGGTTCC AAGACCCTTC
AAGGAAAAGC TAGATCCAAG CCAGCCGTTA CTGACTGGGG TTAGAATACT TGATACTATA
TTCCCCATAG CAAAGGGAGG TACTGCGGCG ATTCCAGGGC CCTTCGGGAG TGGAAAGACA
GTAACTCTAC AAAGCCTATC CAAGTGGAGT GAGGCTAAGA TTGTCATTTT CGTTGGATGT
GGAGAGAGAG GAAACGAAAT GACTGACGAG CTAAGAAGCT TCCCCAAACT GAAGGATCCG
TGGACTGGAA GACCCCTTCT AGAGAGGACA GTGATGGTGG CAAACACCAG TAACATGCCC
GTAGCAGCTA GGGAAGCAAG CATTTACGTA GGTGTTACCC TAGCTGAGTA CTTCAGGGAC
CAGGGATACG ACGTGCTTAC CGTGGCTGAC TCGACCTCCA GATGGGCTGA GGCGCTAAGG
GACCTAGGAG GGAGAATGGA GGAGATGCCA GCCGAGGAAG GGTTCCCAAG CTACCTCTCC
TCCAGAATCG CTGAGTACTA TGAGAGGGCT GGAAGAGTTA GAACTCTAGG TAATCCAGAG
AGGTACGGCT CAGTTACCTT AGCGTCTGCA GTTTCACCAC CAGGCGGTGA CTTTACTGAA
CCAGTTACTA GCACAACACT TAGATTCGTC AGGGTCTTCT GGCCATTGGA CGTCTCCTTA
GCTCAGGCCA GACATTACCC TGCAATAAAC TGGCTCCAGG GGTTCTCATC CTACGTTGAC
CTGGTATCTG ACTGGTGGAT CAAGAACGTC GACCCAGATT GGAGAATTAT GAGGGACTTC
ATGGTTAGGA CACTTCTAAG AGAGGATGAG TTGAAACAGA TAGTGAGGCT AGTGGGTCCA
GAGTCCCTTG CAGAAAAGGA TAAGCTAACG TTGGAGGTTG CAAGGCTGAT CAAGGAAGCC
TTCCTGAAAC AGAACGCATA TGATGACATA GATGCCTTCT CGTCACCCCA GAAGCAGGCT
AGAATCATGA AGTTGATCTA CCACTATAAT CAATATGCAA CTAACGCCGT GGAAAGGGGA
ATACCAGTGA AAAAGATTGT GGACAAGATA ACTGTGGTTC CTGACATAAT TAGATCAAAG
GCTACTATCA AGAACAACGA GCTTCAGAAA TACGACGAAT TAGAGAACAA GTTAAAAGCT
CAGTTCGATG AACTCTTGAA GGAGGCTGGT GCTTAA
 
Protein sequence
MAQGKVARVN GPLVVAEGMK DAQMFEVVEV GEPRLVGEIT RIEGDRAYIQ VYEDTSGIRP 
GEPVFGSGAP LSVELGPGLL GQLFDGLLRP LGEIKEITKS PFIKRGIKLP TLNREKEWHF
VPKMKKGDKV EPGDILGVVQ ETGLVEHRIL VPPYVHGKLK EVVAEGDYKV EDNVAIVDMN
GDEVPVKMMQ KWPVRVPRPF KEKLDPSQPL LTGVRILDTI FPIAKGGTAA IPGPFGSGKT
VTLQSLSKWS EAKIVIFVGC GERGNEMTDE LRSFPKLKDP WTGRPLLERT VMVANTSNMP
VAAREASIYV GVTLAEYFRD QGYDVLTVAD STSRWAEALR DLGGRMEEMP AEEGFPSYLS
SRIAEYYERA GRVRTLGNPE RYGSVTLASA VSPPGGDFTE PVTSTTLRFV RVFWPLDVSL
AQARHYPAIN WLQGFSSYVD LVSDWWIKNV DPDWRIMRDF MVRTLLREDE LKQIVRLVGP
ESLAEKDKLT LEVARLIKEA FLKQNAYDDI DAFSSPQKQA RIMKLIYHYN QYATNAVERG
IPVKKIVDKI TVVPDIIRSK ATIKNNELQK YDELENKLKA QFDELLKEAG A