Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0651 |
Symbol | |
ID | 5103811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 596670 |
End bp | 598517 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640506555 |
Product | type II secretion system protein |
Protein accession | YP_001190750 |
Protein GI | 146303434 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.219631 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCAT TAAAGGGACT CAGAAGGAAT GCGGGTAATA AGAAAGAAGC GAAGCCATCA ATCGGGATAA TGGGAAACTT TTACGAATTA GGAATAGTTA AGTCTATTGC TAGAAGTATA GAGAAAAAAT TACTTTTAGC AGGTCTCAGT ACCGATCCTC AGCTCTTTGC CGCTCAAATG TTCTTCTATC TGATGGTATC ATCAGTCTTT TCGGCAATTC TTGCCTTTCT AGGAGTTTAC GTCCTCGTTA AGCTCTATCT AGTATTCAGG GTAGCTAAGT TTGCAGTCGC AGGTCTAATG TTTATCATTT TCGCAGCCAT AATACCTCCA GTTACGTATC TTCTGTTGAA TGTGAATATA TCTCAGAACA TAGAAAACAG GAGAATAGGT ATAGATGCCG AAACTGCCGC CTTCTCAGCA GTTTTTACAA TTTTCCTCAG ATCAGGCTTG AGTCCTAGGA TACTTTTCGA TAGAATCTCC AGAACCATAG CGTTCAATTA CATAAATCAG GTCCTTCTTT ACGTCTCTAA GAGGATAAAC TTTCTTGGAG AAAACGTGGA GGACGCCTTG CTTCACGCAA TAAGAATCTC TCCCTCTAAG ATTCTGAATG ATTTCTTTGT TAGCTATGTT GCAGCAGTGA GAAGCGGTGC GCCTGTCTTA GATGCAGTAT CTGCTAAGGC TAAGGATATC CTTAAACAAC TCGAGCTGGG CGCTGCCTTG GCTGCGGATA GGCTTTCAGG AGTTGGCGAA ACTTACGTAA TCTGGCTAGC CTCGGGTTAC ATCACGTTCT TTCTGATATT ATTGTTGCAG GCTCTCTTCC CCAGCATAGT GGGAAGTTCT ATCCCCCTTA ACGCCTTCGG AGCTATCCTG ATTCTGATAT TGCCCCTAGT AGATGGGGTT TTCATTTTAA TGGCAGAACA GTCACAACTT AGGTTTCCAG AGAGAAAGAT CTCATCGTAC AAGACGTTCT ATATCTCACT AGGTGTGGGT CTTGTTGTAA TGTTTGTTCT TCTAGGCGTA ACTAAGCAAC TTATTCCCTT TGTTACACTT ACAGGGAATA TTAGCAATGT CACGCCAGTA ACTATCATCA TACTAATTGG TTTCCTGATA GCGGCTATTC CGCCTGCAAT TGTCACGTCT AGAGAGTTGA AAAAGGGAAC AGGCTATGAC CCTTACGTGG TTAACTTGCT TCGAGCAATT TCTGAAGGCA TAAGGGCAGG ATTGTCACCA GAGACGATAA TTAAGAACAT AAAAGAGAGC CAGGAGATGG GGAAATTATC GTATATATTG AAAAGGATTT ACGCATATAT CTCGCTAGGT TACCCGCTTA GAGATGCATT CCTAAAGGGA GCCGAGGAAA TAGTCGATTT TACGTCAAGG ATTTCTCTAG TTTCTATGGC AGATATGATT GATATAGGTA GCCTCACCCC AGAAAGCATA GAAAGTCTAG CTGAACAGGT AGAGACGCAG ATAAAGATAA AGAGAGAATA TGAAAGTAAG GTCAAGATAC TTCTCTATAC TCCCTACATT GGTGTAATTA TCTCCATAAT AGCGGTAAAC CTGCTTTCGG CGGCAATACT AGGCCTTATA ACGGGCAACG CATATGCTTT CTCCTCGGGT GCACTTGGAG AGGCTAGAGT CCTCCTACCA CAGGCTGTTT ACATTACTGC AATAGCCTCA ATGATAAACG CGTTCTTTGC AGGACTACTG GTAGGAAAGT TGGGAAAGGG TAAAGTAGCA ACAGGTTTCA TTCACGCAGC AATTATGGTA GCAATAACTG CAATATTAAT GATTATAATA GTTCATGTTC ACTTTACTTT CGGACCAACT GTACCTCCTT CAGGATAA
|
Protein sequence | MMALKGLRRN AGNKKEAKPS IGIMGNFYEL GIVKSIARSI EKKLLLAGLS TDPQLFAAQM FFYLMVSSVF SAILAFLGVY VLVKLYLVFR VAKFAVAGLM FIIFAAIIPP VTYLLLNVNI SQNIENRRIG IDAETAAFSA VFTIFLRSGL SPRILFDRIS RTIAFNYINQ VLLYVSKRIN FLGENVEDAL LHAIRISPSK ILNDFFVSYV AAVRSGAPVL DAVSAKAKDI LKQLELGAAL AADRLSGVGE TYVIWLASGY ITFFLILLLQ ALFPSIVGSS IPLNAFGAIL ILILPLVDGV FILMAEQSQL RFPERKISSY KTFYISLGVG LVVMFVLLGV TKQLIPFVTL TGNISNVTPV TIIILIGFLI AAIPPAIVTS RELKKGTGYD PYVVNLLRAI SEGIRAGLSP ETIIKNIKES QEMGKLSYIL KRIYAYISLG YPLRDAFLKG AEEIVDFTSR ISLVSMADMI DIGSLTPESI ESLAEQVETQ IKIKREYESK VKILLYTPYI GVIISIIAVN LLSAAILGLI TGNAYAFSSG ALGEARVLLP QAVYITAIAS MINAFFAGLL VGKLGKGKVA TGFIHAAIMV AITAILMIII VHVHFTFGPT VPPSG
|
| |