Gene Msed_2190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2190 
Symbol 
ID5105411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2101742 
End bp2103610 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content43% 
IMG OID640508084 
ProductAAA ATPase 
Protein accessionYP_001192253 
Protein GI146304937 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0020112 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000768998 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTTTCT TGTTCGGAAG TAAAAATGAC GGCCAAAAGA CAGCACAAAA AACTAGGTAC 
TATCGTTTAG AAGGGATCCC ATTCTTCTTA CTGAGTCCTG AGGATCGTGA TATGAAAATG
AAAGAACTAT CGACGCTCTT ATCACAAGCA GAGCGTGGCT ATATCTACAT TAGCAGAGCT
CCAGGTGAGT ATGAGTTCGA AGGGACGAAA TTCCCCATCA TTACATCTTC TTTTAATTTA
ATCACAGAGA AAGAGATGGA CCTCGAAACA GCAGAGCCTC CCAAGAGACC TAAGATAAAG
AAGGAACATG CTAAGTACTT GCAAACAATG GAAGGATATG CTAGAGTTTT AGTATCATAT
AGATATAGTA CCAGAATTTA CGAGGGAGCT CTTGGAAGGG TCGACTTCAA ACAGATGAAC
CCAGAGCTCT TTGAAACGAT TATTCAGTTT CAAAAGATTC CCCAACTCTC TGTTAGAAAT
TATCTCAATT CGCTAGAAAT GAAGAAGATG AAGATGGCCA AGTATGCTAA TGTCTCAACT
GCCGTTGAAG AAATGTTAAC AAGTGGAACC CAACTTAAGA AGGACATAGA CGAGGAAGCT
GCGGAGCCCA TCAAGTTCCG ATATATCTTT GTAATCCATG CAAAGACCCT ACAAGAGCTC
GAAAGTCTAA CAAGGGAATT AATGAAGACC GCACAAGAAA ACGGGGTGCT CCTCGATACT
CCATGTTGCG CTCAGAGTGA GTTGTATAAT TTTGAAGGAG GTGTAGGGTA TCGCATAAGC
AGCAACGTCA GTCTCGCTAA GTTCTACCCC CTGGTCGGGT TTAACTTAGT CGAGCCTAAT
GGCATCTTCT TAGGAACGGA TGAAAAAGGA GCTCCAATAT CAATCAATCC ATATTTGGCG
ACCAACGGAA GACAAAATCC TCATTGGGCG ATAACAGGAA CAACTGGAGC AGGAAAGACT
ACTACAGGTG CCGCTTTGAT TGATAGGCTA CGCAGGGCCC ATGGCGAGAT ATATGTTATT
ATCATCGATC CGATGAGCAA CTATAACCGT TTCTTCACTA ATGAAGCGGA TTTGAATATC
GCATTCAAGG ACGGGGACTA TGTTGGACTG GATCCGGTAG CTTTAGCAGC TGAAGGAGTG
GTCTCAAGCG GTGACATAGC AGACTTCCTC ATCGAGTCAT ATGGAATTCC ATTGGAGCTC
CGAGGAATCC TAGTGTCCCA GTTGGAGCAA AACAGGAGTT TGAAGGACCT AACAGACAAT
CTAGAGAGCT TGGCTAGTAA GAAGTTTGCG ACTGAATATA GAAAATTAGA GAACTTCTTA
CTCAATATGA CTAGCGGAGC TGACAAGTAT GTTTTCACTG GAACTCCTCC TAACCTTAAG
GGCAAACGTT TCATCATACT AGGGCTTCAA ACTGAGGATA CGAGAAAGAA GAGATTAGCA
GCTACAATGT TAATGCTTTA TGCCTATTCG TTAATCAATA AACTCCCTCG TTCAGTCGAG
AAATTGATAT TGATAGATGA AGCTCACTTC CTATTCGAAT ACCAGAGTGT GGCGAAAATC
ATAGCGATAA TTTACAGGAC TGCTAGAGCC CTTAAAACCA GTATGATTAC TATGACACAG
CTCATTCAAC ATTTTAATAT GAATCAATAC AGCAAGGAAG CATGGCAACT TGCAGATAAC
AAACTGATTC TTAAACAGGA AAAGGAGGCT AAGGACGACT TAGTTAACTT GGCTCACCTG
AGCGAAGAGG AGGTCGATTA CGTGCTCAAA TCTTCAAGGG GTAGGGGGAT ATTGAGGACA
GGCGCAATAA CGACCCATAT TCAGGTCCAG CTCACGGAAG AAGAGAAACA GCGCTGGAGG
ACTGAGTAA
 
Protein sequence
MSFLFGSKND GQKTAQKTRY YRLEGIPFFL LSPEDRDMKM KELSTLLSQA ERGYIYISRA 
PGEYEFEGTK FPIITSSFNL ITEKEMDLET AEPPKRPKIK KEHAKYLQTM EGYARVLVSY
RYSTRIYEGA LGRVDFKQMN PELFETIIQF QKIPQLSVRN YLNSLEMKKM KMAKYANVST
AVEEMLTSGT QLKKDIDEEA AEPIKFRYIF VIHAKTLQEL ESLTRELMKT AQENGVLLDT
PCCAQSELYN FEGGVGYRIS SNVSLAKFYP LVGFNLVEPN GIFLGTDEKG APISINPYLA
TNGRQNPHWA ITGTTGAGKT TTGAALIDRL RRAHGEIYVI IIDPMSNYNR FFTNEADLNI
AFKDGDYVGL DPVALAAEGV VSSGDIADFL IESYGIPLEL RGILVSQLEQ NRSLKDLTDN
LESLASKKFA TEYRKLENFL LNMTSGADKY VFTGTPPNLK GKRFIILGLQ TEDTRKKRLA
ATMLMLYAYS LINKLPRSVE KLILIDEAHF LFEYQSVAKI IAIIYRTARA LKTSMITMTQ
LIQHFNMNQY SKEAWQLADN KLILKQEKEA KDDLVNLAHL SEEEVDYVLK SSRGRGILRT
GAITTHIQVQ LTEEEKQRWR TE