Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1917 |
Symbol | |
ID | 5103304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1863896 |
End bp | 1865671 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507805 |
Product | V-type ATP synthase subunit A |
Protein accession | YP_001191981 |
Protein GI | 146304665 |
COG category | [C] Energy production and conversion |
COG ID | [COG1155] Archaeal/vacuolar-type H+-ATPase subunit A |
TIGRFAM ID | [TIGR01043] ATP synthase archaeal, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000535851 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.804968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGG GTAAAGTTGC AAGGGTGAAC GGGCCACTGG TTGTGGCCGA GGGTATGAAG GATGCGCAGA TGTTTGAAGT AGTCGAAGTG GGCGAGCCTA GGCTAGTGGG CGAAATAACA AGGATTGAGG GTGACAGGGC TTACATACAG GTATATGAGG ACACGAGCGG AATCAGACCT GGTGAGCCAG TTTTTGGGAG TGGGGCTCCC CTATCAGTGG AGTTAGGTCC AGGCCTTCTC GGTCAACTGT TTGATGGTCT ACTTAGACCT CTAGGAGAGA TCAAGGAAAT TACTAAGTCA CCTTTCATCA AGAGGGGGAT AAAGTTACCA ACACTAAATA GGGAGAAAGA ATGGCACTTC GTCCCAAAGA TGAAGAAGGG AGATAAGGTA GAGCCAGGTG ATATTCTAGG AGTGGTACAG GAGACAGGAC TTGTAGAGCA CAGAATACTC GTTCCTCCTT ACGTTCACGG AAAATTAAAG GAGGTTGTTG CAGAGGGCGA TTACAAGGTT GAAGACAACG TCGCAATTGT GGACATGAAC GGTGACGAAG TTCCAGTGAA AATGATGCAG AAGTGGCCAG TTAGGGTTCC AAGACCCTTC AAGGAAAAGC TAGATCCAAG CCAGCCGTTA CTGACTGGGG TTAGAATACT TGATACTATA TTCCCCATAG CAAAGGGAGG TACTGCGGCG ATTCCAGGGC CCTTCGGGAG TGGAAAGACA GTAACTCTAC AAAGCCTATC CAAGTGGAGT GAGGCTAAGA TTGTCATTTT CGTTGGATGT GGAGAGAGAG GAAACGAAAT GACTGACGAG CTAAGAAGCT TCCCCAAACT GAAGGATCCG TGGACTGGAA GACCCCTTCT AGAGAGGACA GTGATGGTGG CAAACACCAG TAACATGCCC GTAGCAGCTA GGGAAGCAAG CATTTACGTA GGTGTTACCC TAGCTGAGTA CTTCAGGGAC CAGGGATACG ACGTGCTTAC CGTGGCTGAC TCGACCTCCA GATGGGCTGA GGCGCTAAGG GACCTAGGAG GGAGAATGGA GGAGATGCCA GCCGAGGAAG GGTTCCCAAG CTACCTCTCC TCCAGAATCG CTGAGTACTA TGAGAGGGCT GGAAGAGTTA GAACTCTAGG TAATCCAGAG AGGTACGGCT CAGTTACCTT AGCGTCTGCA GTTTCACCAC CAGGCGGTGA CTTTACTGAA CCAGTTACTA GCACAACACT TAGATTCGTC AGGGTCTTCT GGCCATTGGA CGTCTCCTTA GCTCAGGCCA GACATTACCC TGCAATAAAC TGGCTCCAGG GGTTCTCATC CTACGTTGAC CTGGTATCTG ACTGGTGGAT CAAGAACGTC GACCCAGATT GGAGAATTAT GAGGGACTTC ATGGTTAGGA CACTTCTAAG AGAGGATGAG TTGAAACAGA TAGTGAGGCT AGTGGGTCCA GAGTCCCTTG CAGAAAAGGA TAAGCTAACG TTGGAGGTTG CAAGGCTGAT CAAGGAAGCC TTCCTGAAAC AGAACGCATA TGATGACATA GATGCCTTCT CGTCACCCCA GAAGCAGGCT AGAATCATGA AGTTGATCTA CCACTATAAT CAATATGCAA CTAACGCCGT GGAAAGGGGA ATACCAGTGA AAAAGATTGT GGACAAGATA ACTGTGGTTC CTGACATAAT TAGATCAAAG GCTACTATCA AGAACAACGA GCTTCAGAAA TACGACGAAT TAGAGAACAA GTTAAAAGCT CAGTTCGATG AACTCTTGAA GGAGGCTGGT GCTTAA
|
Protein sequence | MAQGKVARVN GPLVVAEGMK DAQMFEVVEV GEPRLVGEIT RIEGDRAYIQ VYEDTSGIRP GEPVFGSGAP LSVELGPGLL GQLFDGLLRP LGEIKEITKS PFIKRGIKLP TLNREKEWHF VPKMKKGDKV EPGDILGVVQ ETGLVEHRIL VPPYVHGKLK EVVAEGDYKV EDNVAIVDMN GDEVPVKMMQ KWPVRVPRPF KEKLDPSQPL LTGVRILDTI FPIAKGGTAA IPGPFGSGKT VTLQSLSKWS EAKIVIFVGC GERGNEMTDE LRSFPKLKDP WTGRPLLERT VMVANTSNMP VAAREASIYV GVTLAEYFRD QGYDVLTVAD STSRWAEALR DLGGRMEEMP AEEGFPSYLS SRIAEYYERA GRVRTLGNPE RYGSVTLASA VSPPGGDFTE PVTSTTLRFV RVFWPLDVSL AQARHYPAIN WLQGFSSYVD LVSDWWIKNV DPDWRIMRDF MVRTLLREDE LKQIVRLVGP ESLAEKDKLT LEVARLIKEA FLKQNAYDDI DAFSSPQKQA RIMKLIYHYN QYATNAVERG IPVKKIVDKI TVVPDIIRSK ATIKNNELQK YDELENKLKA QFDELLKEAG A
|
| |