Gene Msed_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1914 
Symbol 
ID5103301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1860823 
End bp1862940 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content48% 
IMG OID640507802 
ProductV-type ATP synthase subunit I 
Protein accessionYP_001191978 
Protein GI146304662 
COG category[C] Energy production and conversion 
COG ID[COG1269] Archaeal/vacuolar-type H+-ATPase subunit I 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000116977 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.804968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCTTTC CTGAAAACAT GGTAAAAGTA GAAATCATAT CGCCAGTTAA CCAAAAAGAC 
AAAGTAACCA CGGAGATACT CAAGTTCGGT AAAATGGAGG TAATAGAGCC AGCCCATCCT
ATCTCCAACT CGAGGTATGA AGATGCTAGG AGGGAATTGA CAGCAATCCA AGAACACGTG
AATAAGGTCA AGCTGATCAT GGAAATAGCT GGAACACCGC TGGAACCTAG GGGCAAGATG
AAGGTTGACT CCACATGGAC TGACGTGGCC AAGAAGCTTT CAGAGGAGGC ATCGCTTGAG
GAGTTCAGAT ACAAGGAGCT CCTCGAGGAA ATAGGGAAAC TGAAGGGTGA GATAGACCTC
TACCAGGCAC AACTCAGGGA ATTGGAACCA TTCAGCGGGA TTACCACCGA TCTATCCACC
TTGTACTCGC TAGAGACGTT GGATGTAGCA CTAGTAGTTC TCACGGAAGA CCAGTTAAAC
AAGGTGAAGT CTAACAAGAA CGTTGTGGTG GAGACAACTC CGCTGAAGGA TGGTAAATAT
GCCTCCGTGC TTATCTCAAG GAGAAACGTT GTGTCACTAG ATCAGGTACT TAAGGAACTG
GGAGTCAGGA GATTTGAGAC TAACGAGGGG AAATCTCCCT ACGAAGTGTA TAAGGTGTTA
AGCGAAAGGA TTGGCGAGCT TAGGAAAATC CTTGAGGAGA CCAGAAGTAG CCTCATCAAG
AAGATAAGGG AAAATGAGAG GGAGTTCAGC GAGTTATACG GAAAGCTTCT AACGGCGAGG
GACGCGCTTT CCCTCCTTTC CAAGGGGAGA ATCTCAGACC ACTTTTTCCA GATAGAGGGA
TACGTTCCCG AGAGGGACTG GAAAAAGCTG AAATCTGCCT TAGATCAGGT TGCTGTTATC
GAAAGTGCTA CCCCAAGGAG GTTCGGAGAG GAAGAGGAGC CCCCTACGTA CATCCGACTT
CCCAAGAGTA TCTCGGCCAT AGAGTCTGTT GTGGAAATCT ACGGTACACC GTCGTACTGG
GAGATATCAC CGCTGGTGTT TCTAGTCATT ACCTTCCCTC TCCTGTTCGG GCTAATGTTT
CCAGATGTTG GGAATGCCCT TGTGCTCCTG ATCTTTGCCA TCTTCTTCCA TAGATACGGC
GTCAAGAAGG GAAGCAACAA CATCAAGAAC CTTTCCCTCA TTCTAGGTTA CTCCTCAGTC
GTTGCCATGA TTACTGGTTT CCTAGCTAGA GACTTCTTCG GACCACTTCC CGTGGGAGGG
CTCAAGGAGA TGGGTATAGC CAACGTGAGC GGTCCCCTGG ATAGCGTATG GCCAGTACCA
GTGAGCGTAA CGGAGGCGTT ATCCCCACTA CTTCCATTTG GTGAATACAG CACTAGCACC
TCGATTGAAC ACACAATAAT CCTTTCGATC CTCCTGGGAG CAATAGCTCT CTTCGTAAGC
TCCCTTCTGG GAGTCATAAA TGCAGTGAAG AAGAGAGATC CTGAATTTCT TGTCTTCGAG
AAATTACCAC TCCTGGTGCT ATACACAGTT CCCCTGGTGA TATTTGGATA CGGTGTAACG
GATCCTTCCC ATTACTTCAG TACTGTGGGA GTCCTCCTAG GTGATATCCT CCAGAACTTG
CTAAGCGGGA TACATCCTGA CCTGAGCAAT CCTACGTCGG CCTTGGCGTA TGGGTTAATT
GTATGGGTAG AGATAGGACT AATATACAAC TGGATATCTA AGGCTATAAT CCTGAAGAGA
CACGATCACG CATCCACGGG AAGTGCCCTT GCCATGGGAT TCATTGAAGG AGGTTTCGAG
GCTGCTATCC TACTGTTATC CAACACAATC TCTTTCATAA GAATTCTAGT GTTTGCACTG
GCTCACTACT ACCTTCTGTA CGCGTTCTCA TACATGGCCT ATCTAGTGGT GGGGCATCCA
AGCCTCCTAG CCTTGGGGAC TAACCCGGCC TCGATCGTGA TCCTGATAAT AGGAAACCTC
CTTGCAATAG GACTAGAGGG ACTAGTGGTG TTCATACAGG ACATGAGGCT CCACTTCTAC
GAGATGTTCA GCAAGTTCTA TGAGGGAAGA GGAAAGAAGT TCGAACCCCT AGTAGCGTCA
GTGGAACTGG CTACTTAA
 
Protein sequence
MIFPENMVKV EIISPVNQKD KVTTEILKFG KMEVIEPAHP ISNSRYEDAR RELTAIQEHV 
NKVKLIMEIA GTPLEPRGKM KVDSTWTDVA KKLSEEASLE EFRYKELLEE IGKLKGEIDL
YQAQLRELEP FSGITTDLST LYSLETLDVA LVVLTEDQLN KVKSNKNVVV ETTPLKDGKY
ASVLISRRNV VSLDQVLKEL GVRRFETNEG KSPYEVYKVL SERIGELRKI LEETRSSLIK
KIRENEREFS ELYGKLLTAR DALSLLSKGR ISDHFFQIEG YVPERDWKKL KSALDQVAVI
ESATPRRFGE EEEPPTYIRL PKSISAIESV VEIYGTPSYW EISPLVFLVI TFPLLFGLMF
PDVGNALVLL IFAIFFHRYG VKKGSNNIKN LSLILGYSSV VAMITGFLAR DFFGPLPVGG
LKEMGIANVS GPLDSVWPVP VSVTEALSPL LPFGEYSTST SIEHTIILSI LLGAIALFVS
SLLGVINAVK KRDPEFLVFE KLPLLVLYTV PLVIFGYGVT DPSHYFSTVG VLLGDILQNL
LSGIHPDLSN PTSALAYGLI VWVEIGLIYN WISKAIILKR HDHASTGSAL AMGFIEGGFE
AAILLLSNTI SFIRILVFAL AHYYLLYAFS YMAYLVVGHP SLLALGTNPA SIVILIIGNL
LAIGLEGLVV FIQDMRLHFY EMFSKFYEGR GKKFEPLVAS VELAT