Gene Msed_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2073 
Symbol 
ID5105053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1990376 
End bp1991869 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content42% 
IMG OID640507963 
ProductATPase-like protein 
Protein accessionYP_001192137 
Protein GI146304821 
COG category[R] General function prediction only 
COG ID[COG1106] Predicted ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.672394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.790362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATACTGT TGAGGCTAGC TGAGTTCTAC GCTAATAACT TCAGGAGTTT GGAGTCAGTT 
GAATTAAGGG ATGTTGGAGG CTTCAATGTG ATTGTGGGGT TTAACGGTTA CGGCAAAACT
AACCTGCTGA CGGCAATCTA CCTATACATA AAGAACCTGT CAGCGGGTAT TGAGAAGAGG
TCAATAGAAG ATAAGAACCA GGAATATCTG TTAATGTGGA ACAGTTATGA TACGAGCAAA
CCCATTCTTT TAGGAGGCAG GTTACAGTTT AGTGAGAAGG AAGTTGAGAA AGTCGTGGGA
AAGAGTATGC CCATGAACAT CGACCTAGTG AACAAGTTGA GGTACATTAA TGGATACCTT
GAGTGGGACC TGGACCTCAT CAGAATAAAT GGATCTGTCC CATCCAAGGA CGAAATAGAT
CAGGCCAAAA AGCTGTTGGA CTACGCCTCA AGTCAAGTTG AGTATGTTCC CATTTTCGAT
CAGAACTACT TTGATGACGT ACTCAACAGA ATTATTGGAC TGAACAGATC GCCCATCAGC
TTGAGAAAGT ATTGGTATGA CTTTGCGAAC CTGGTGAGTA ATACTATTCC AGAGGTAAAG
GGAATAGAGA TCTGGGATTC AAAGAAGCTA GTTCTAAACG TGTATAATTT GCCAATCTAC
ATTGACTTAG CTGCTAGTGG ATTTCAGAGA ATAATTCTTA TCCTTTTCGT AATATGGTTG
AGCGGAAACA AGATACTCCT GATCGAGGAA CCAGAGGTGA ACATGCATCC CATCATGCAG
TATAAGATGG CGAAGCTCCT GAAGTCATGG ACTGAAAGCG ATGTTCTACA GGTTTTCATG
ACAACACACT CTCCCTTTAT TGTATCCTCC GATGTGGATA GCTTCATAGT TCTGAGAAGG
GGTCAGACCG CGTCTAAGGC TGTCAACTTC CAGCCCACAG AGGATGTAAA ATCGGCCTTC
TCCATTCTTA ACGTAAATAT CAGTGATCTT CTCTTTAACA AGACTATTAT AGTAACGAGT
GAAATGGCCG AGCCAAACGT AATCCTGAAT TGGCTCAGGA AACTGAACGT GAATCCAGAG
TACAACGGAA TTGTTATATA CACGGTGAGG AACGAACTGG AGTTGCAGAC CTGGCTTAAG
TTAAGGAACA TGCTGAAACT TGACATGCTA TTCCTGGGCC TTTGTGACAA AATAGACATA
GAGTTAAAGG ACTCCTGTCT TCCCCTTACC AAGGAGGTAG AGTCATTTTA CAGTAAGAGT
GGAATGTTAG AGGCACTCAA GAGAATAGGC ATTTACCCAG ATGAAAAGGA AATGAGGGAT
CTCTCTCGGG AAGACAACGC CAGATGGTTG ATAAACGTTC TTAAGAGGAG AGGATTGGAT
TACGGTACCA TGAGATCGTC TATAGGTGAC ATAATATCTA GAATAGATTC CATTGAGATC
CCCAAGGAGA TGGAAATCCT CGTGAATAAA ATTAAAACCG CGCAGGTTAT CTAG
 
Protein sequence
MILLRLAEFY ANNFRSLESV ELRDVGGFNV IVGFNGYGKT NLLTAIYLYI KNLSAGIEKR 
SIEDKNQEYL LMWNSYDTSK PILLGGRLQF SEKEVEKVVG KSMPMNIDLV NKLRYINGYL
EWDLDLIRIN GSVPSKDEID QAKKLLDYAS SQVEYVPIFD QNYFDDVLNR IIGLNRSPIS
LRKYWYDFAN LVSNTIPEVK GIEIWDSKKL VLNVYNLPIY IDLAASGFQR IILILFVIWL
SGNKILLIEE PEVNMHPIMQ YKMAKLLKSW TESDVLQVFM TTHSPFIVSS DVDSFIVLRR
GQTASKAVNF QPTEDVKSAF SILNVNISDL LFNKTIIVTS EMAEPNVILN WLRKLNVNPE
YNGIVIYTVR NELELQTWLK LRNMLKLDML FLGLCDKIDI ELKDSCLPLT KEVESFYSKS
GMLEALKRIG IYPDEKEMRD LSREDNARWL INVLKRRGLD YGTMRSSIGD IISRIDSIEI
PKEMEILVNK IKTAQVI