Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0650 |
Symbol | |
ID | 5103810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 594885 |
End bp | 596603 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506554 |
Product | type II secretion system protein E |
Protein accession | YP_001190749 |
Protein GI | 146303433 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.200962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTAA GTCTACCGTT GCCATTCCGT AGCAAGCCTT CTCCTAAGAC CGAGCAGTTA ACCTTTGGGG ATCTACCTAT AAGCCTCTAT CCCATAACAT TGCCCTACGA GGAATTACCT GAGATTATGT CTGAGTATGA AGTTGACATG ATTAGCTTAA TGCCCCGATC AGTTAAATCA TCCTTTGAGT CTAACAACGT CGAGTTGTCA ATAGCTAATC CTCACACATT CATTGTCTTC GACCAGGACA AGGGGATATT CAGGTATGTC TTGGTAGAGC CACCAATAGA TCAAAACATT TTCAGTGCTT ACATTTATCT GATCAAAGAG ATAGAAAGGT CACTATTAAG CAAGGAAGAC ACAGTGGATA TAGCTAAAAT CCTGCTTCAG GCCAATGCCA AAATGCCCAG CATGGGCCTT GTCCAAGGTC AGGTTGGTGG AATAACCAAG CTTAGTACTA AGGGAAAAGT GGCCCTTTAT TACCTCTTAA GAAACATGTT TGGTTATAAC GTTCTTACTC CCCTCTTAGC TGATAACAAG GTAGAGGACA TCTCATGTAG CGGAATTGGT TTACCTGTAT ACGTTTATCA CAGGGAGTAT GATTACGTTC CCACAAATCT TACCATAACT GAGTCGATGA AGGTATTGGA CAAGGAGATT AGCGGATCAG AGTTATTGGA TCAAATAGTG TTGAGGCTGA TCTCCCTATC AGGTAAAACC GTGTCTATAG CTACTCCCAT AGCTGACGGT ATATTGCCGA AGGGAGACAG AATTGCAGCG ACCTTTAGAT ATGAAGTTAG TGCTAGAGGA TCCAGCTTCG TTATAAGGAG ATTCAGCGAA AGGCCGATAA CTATTCTAGA CCTGATAAAT AGTGGCGTCA TGAGTCCTGA GACTGCAGCC TACCTGTGGT ATTCCATCGA TCTCAGAATG TCGTTCATGG TTATAGGAGT CACAGGTGCT GGAAAGACCA CGGTGTTGGG GTCCATCCTT AACCTGGCAA AGGAATCCCT GAAGATAGTG TCCATTGAGG ATATCCCGGA GATAAAGCTG GCACAGGAGA ACTGGGTTCA GCTTTACGCC AGGCAGGCGT ACGGCGAGTC TAGCAAGGAA ATAACCCTCA TGGACCTGCT GAAGCTGTCC CTGAGGTACA GGCCTGACAT AATAGTGGTC GGTGAGATAA GGGGTGCAGA GTCCTATATC CTGTTCCAGG CCCTCTCCAC GGGTCACGGT GGTGCCACGA CTTTCCACGC CCACGACTCA GAAAGCGCGG TCAAGAGGTT AATGAACCCT CCACTCAATA TACCTGCAGA GTGGATACCC ATGAACAACA TTATCATTAG CGTGAGAAGA TTGCCTGTAC TCATAGGAGA TAAGATACAG TTAAAGAGAA GGGTCGTAGC CATAGACGAA CTGGTGACTG CCTCAGATTT CAGGAGGGTA GTTAATTGGG ATCCGAAAGT TGATAACCAC GTGGTAGACC TGGATAACGC TAAGGTTCTG AGAAACAGAC TGGAGGAGTC TGGAAGATCA CTTGAGGAGG TGAAAGAAGA GATACAGAGA AGGGCACTTT ACCTGAGGCT AATGGCTACC TCCAAGGACA TCGTTCAGAG CCCAGAGAGT TATAAGATGG TCAAGAAGTA CATTATAAAG TATAGCCTGA GACCTGATGA GGCCATGAAA GAAGTGGCCA GGATGTCCTC AATAAAGGTC ACGGTATGA
|
Protein sequence | MNLSLPLPFR SKPSPKTEQL TFGDLPISLY PITLPYEELP EIMSEYEVDM ISLMPRSVKS SFESNNVELS IANPHTFIVF DQDKGIFRYV LVEPPIDQNI FSAYIYLIKE IERSLLSKED TVDIAKILLQ ANAKMPSMGL VQGQVGGITK LSTKGKVALY YLLRNMFGYN VLTPLLADNK VEDISCSGIG LPVYVYHREY DYVPTNLTIT ESMKVLDKEI SGSELLDQIV LRLISLSGKT VSIATPIADG ILPKGDRIAA TFRYEVSARG SSFVIRRFSE RPITILDLIN SGVMSPETAA YLWYSIDLRM SFMVIGVTGA GKTTVLGSIL NLAKESLKIV SIEDIPEIKL AQENWVQLYA RQAYGESSKE ITLMDLLKLS LRYRPDIIVV GEIRGAESYI LFQALSTGHG GATTFHAHDS ESAVKRLMNP PLNIPAEWIP MNNIIISVRR LPVLIGDKIQ LKRRVVAIDE LVTASDFRRV VNWDPKVDNH VVDLDNAKVL RNRLEESGRS LEEVKEEIQR RALYLRLMAT SKDIVQSPES YKMVKKYIIK YSLRPDEAMK EVARMSSIKV TV
|
| |