Gene Msed_0162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0162 
Symbol 
ID5105015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp130836 
End bp132611 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content40% 
IMG OID640506065 
Producthypothetical protein 
Protein accessionYP_001190263 
Protein GI146302947 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.222598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0745617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACAT TTGACAGATA TACGCCATAC ATAATTCTCT TGCTTCCAAT AATCTTGGCA 
TTAGTATTTA GGAACCCTTT TCTCCTCTCC CTATCTGGTA TATTATTGGT TCTAATGCTA
TATACTGGCT CAATAAAGAT TCCTGTAAAT ATTAGTTTTC CAATAAAGAG AGGAGATGGA
ATAGCGCATT CGATAGAAAA GGGGTTGATT ATTCGAGGTA ATTCTGCTAT TGGGGTTGTC
ATCGTAGACG ATATCCCCTA TGATTATAGG GATCTTTCGG ACTCATCATT AAGGGCCTCG
ATAAATGCCT TTCATAAAAT AACCAATATA GGGGAACATG TGGATATTAT TTTCAGGAAA
AAATATATCG ATCAGAGAAT ATATACCGAA AGATTACTTA ATAAATTACA AAATCTCAGA
ATAATAATAG AAAATGACCC ATCTAATGCC AAGGCAAAGA AGGAGATGGA ACTTCTTCAG
AGTATTTTAG ATAGGTTGGA GCAAGGAGAG AAACCTTTCT CGTATCAGGT AGTTCTGTTA
GTCCATGGAA AGGATAAGCA AGAGGCGAGA AGTCTCGGTG AGGTTCTCAT AAGGGGTTTA
GAGAGCCTCA ATATCAAATC GAGGTTCGCC ACAGTCAAGG AGATAGAGGA TATCCTATCT
CTTAACCTTA CTAGGTTTAG GAAAGAGGGA TTGCCTTCTC AAATACCATT TCTAACTCCA
TTCTCCCTAG AGAAGATGCC GGTGGTCGAG AAGTGGAGCG ATGGAGTATA TCTAGGAAAG
GATATGGAAA GGCAAGTTCC GGTATTCTGG AATGTGGAAA AGTCCGAGAA CCCTCACATC
ATGATAATAG GTCCAACTGG ATCTGGAAAA ACTGAGTTAC TGATTTGGCT CGGGAGCCTA
ATGTCCCTAC AATATTCAGT TCCAGTTGTA TTTTTTGACG TAAAGGGAGA TATTAGGACG
AGGTTGAGGA GATATGGCTT CAACTTCAAG GTTCTGAATC CTCTTTTCTA TTCACTGAAG
TTACTGGACT TTCCCTACGT GGCCCCGTCA ATAAAACCAT TGTTTATAGA AAAAATAGTA
GGAGTTTCAT TTAAGCTAAA TCGCGAAGAA AGGGCAATAC TTTTCAATGT GTTGAATAGG
CATTTGAAGG AAACCCATGG GAGGCCTGAG TGGAGAACCA TACTAAGGTA TGAGGAAATA
GGAGACAGAT ATTCCATAAG GAGATCCTTG GAACTAGTAG AGTCTTTTGA CTCTGACGGT
CCGTTTATCT TGGACGGTAT GACGCATGGA ATAAACGTAG TGGACCTAAC TCAATTAAAG
GATGAGACTC TGAGGAGATT TGTCATATAT TCCTTTATAT CAATGCTTTA TGCATACTAC
TCGTCCGATG CCGATGTCGG TCTCAGGGTA GGTTTGGTAG TGGATGAGGC GTGGACAATT
CTTAAGGACG ATGACGAATA CGGGATAATA GGTGATCTAA TAAAGAGAGG CAGGGGTCAT
GGTATATCCT TATTAATGGC TACTCAGAAC ATCCAGGATC TTGGACAAAA CGCGGACATA
TTCATGGATA ACATAGGTAC CCTTTGCTTC ATGAATAATG GGGATAAGAA TTTCTGGAAG
GATGTGGTTA AGAGGTATTC CAATATTCTG GATGGTGAAG TTGAGAATAA ACTGGCATTC
CTAGGTAGAG GGGAGATGTT GGTCAGATTC CTGGGAGATC CAAGGCCAAT CCTTGTGGCC
CACAAACCGT TAACTGGAAG CTCTTTCCAG AATTGA
 
Protein sequence
METFDRYTPY IILLLPIILA LVFRNPFLLS LSGILLVLML YTGSIKIPVN ISFPIKRGDG 
IAHSIEKGLI IRGNSAIGVV IVDDIPYDYR DLSDSSLRAS INAFHKITNI GEHVDIIFRK
KYIDQRIYTE RLLNKLQNLR IIIENDPSNA KAKKEMELLQ SILDRLEQGE KPFSYQVVLL
VHGKDKQEAR SLGEVLIRGL ESLNIKSRFA TVKEIEDILS LNLTRFRKEG LPSQIPFLTP
FSLEKMPVVE KWSDGVYLGK DMERQVPVFW NVEKSENPHI MIIGPTGSGK TELLIWLGSL
MSLQYSVPVV FFDVKGDIRT RLRRYGFNFK VLNPLFYSLK LLDFPYVAPS IKPLFIEKIV
GVSFKLNREE RAILFNVLNR HLKETHGRPE WRTILRYEEI GDRYSIRRSL ELVESFDSDG
PFILDGMTHG INVVDLTQLK DETLRRFVIY SFISMLYAYY SSDADVGLRV GLVVDEAWTI
LKDDDEYGII GDLIKRGRGH GISLLMATQN IQDLGQNADI FMDNIGTLCF MNNGDKNFWK
DVVKRYSNIL DGEVENKLAF LGRGEMLVRF LGDPRPILVA HKPLTGSSFQ N