Gene Msed_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1918 
Symbol 
ID5103305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1865671 
End bp1867071 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content47% 
IMG OID640507806 
ProductV-type ATP synthase subunit B 
Protein accessionYP_001191982 
Protein GI146304666 
COG category[C] Energy production and conversion 
COG ID[COG1156] Archaeal/vacuolar-type H+-ATPase subunit B 
TIGRFAM ID[TIGR01041] ATP synthase archaeal, B subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0383296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCAT CCATGAATGT AAGGGAATAT TCGAACATTT CAATGATTAA GGGACCCCTT 
CTGATGGTTC AGGGAGTGGC AGACTCAGCT TATAACGAAC TTGTAGAGGT CGAGATGCCC
AATGGGGAGA GAAGGAGAGG GATAGTTGTA GATAGCCAGA AGGGAATTTC CATAGTACAG
GTCTTTGAGG GAACCAGGGG AATATCGCCA GTAGGAACCA CCGTGAGGTT CCTAGGTAGA
GGATTAGAGG TTAAGATCTC TGAGGAAATG CTTGGCAGAA TTTTCAATCC CCTGGGAGAT
CCCCTTGACA ATGGTCCCAT GGTGATAAAG GGGGAGAAGA GGGACATCAA TGGCGAACCC
TTGAACCCTG CTATCAGGGA TTACCCAGAG GAGTTCATTC AGACTGGTAT ATCAGCAATA
GATGGTCTAA ATTCCCTTCT CAGGGGTCAG AAGCTTCCCA TCTTTAGCGG AAGCGGTTTA
CCCGCGAACA TACTTGCTGC ACAGATAGCT AAACAGGCCA CAGTGAGAGG AGAGGAGAGC
AACTTTGCCG TTGTATTCGG TGCCATCGGT GTAAGATATG ACGAGGCACT GTTCTTCAGG
AAGTTCTTTG AGGAAACCGG GGCAATCAAT AGGGTGGCCC TAATCATGAG TTTAGCTAAC
GAACCACCAG TGATGAAGAC ACTAACTCCC AAGACAGCAC TTACCCTCGC GGAATATTTG
GCCTTTGAGC AGGACATGCA CGTGTTGGCA ATCCTTATCG ATATGACAAA TTACTGTGAG
GCCCTTAGGG AGATCAGCGC ATCAAAGGAG GAAGTCCCCG GTAGAGGTGG TTACCCAGGT
TACATGTACA CTGACCTTGC CCAGACCTAC GAGAGAGCAG GAAAAGTGAT AGGAAAGAAG
GGTTCCATTA CTCAGATGCC CATTCTCACA ATGCCAAACG ACGACATTAC CCATCCAATT
CCAGACCTTA CAGGTTATAT CACGGAGGGG CAAATTACCT TAGACAGAAG CCTATACAAC
AAGGGTATCT ATCCACCAAT TAACGTCCTC ATGAGTTTGT CAAGGCTTGC CAAGGACGGA
ATAGGTGAGG GTAAGACCAG GGATGATCAC AAGGACTTAT CTAACCAGTT GTTTGCGGCC
TACGCAAAAG CAGTAGATAC TAGGGGATTA GCTGCAATCA TTGGAGAGGA TAGCCTATCT
GACACAGACA AGAAGTACCT AATGTTTGGG GATGCCTTTG AGAGGAAGTT CGTAAGTCAA
GGAGTGAACG AGAACAGGGA TATAGAGACG ACCCTAGATA TTGGGTGGGA GGTACTCTCC
ATTTTGCCAG AAAGGGAGCT CACGAATGTG AAGGTTGACT ACATCAAGAA GTATCATCCA
GCCTACCGTG GTAAGAAATG A
 
Protein sequence
MESSMNVREY SNISMIKGPL LMVQGVADSA YNELVEVEMP NGERRRGIVV DSQKGISIVQ 
VFEGTRGISP VGTTVRFLGR GLEVKISEEM LGRIFNPLGD PLDNGPMVIK GEKRDINGEP
LNPAIRDYPE EFIQTGISAI DGLNSLLRGQ KLPIFSGSGL PANILAAQIA KQATVRGEES
NFAVVFGAIG VRYDEALFFR KFFEETGAIN RVALIMSLAN EPPVMKTLTP KTALTLAEYL
AFEQDMHVLA ILIDMTNYCE ALREISASKE EVPGRGGYPG YMYTDLAQTY ERAGKVIGKK
GSITQMPILT MPNDDITHPI PDLTGYITEG QITLDRSLYN KGIYPPINVL MSLSRLAKDG
IGEGKTRDDH KDLSNQLFAA YAKAVDTRGL AAIIGEDSLS DTDKKYLMFG DAFERKFVSQ
GVNENRDIET TLDIGWEVLS ILPERELTNV KVDYIKKYHP AYRGKK