Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2104 |
Symbol | |
ID | 5104398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2025982 |
End bp | 2027412 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640507994 |
Product | type II secretion system protein E |
Protein accession | YP_001192168 |
Protein GI | 146304852 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.609091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAA CACTCCTTGA GGAATATAAT GTAATGGGAA GTAAGGTCTC CATCATCGAC GAGGATGGAC AGGGACTCTA CATTGTTGAG GATCCAAGGA TTTCCCGTGA GGAGTCATCG GCCATATCCT CCATAATGGA TGAGATTTAC TTTTCCTCAA CATCCGTCTC AGACGCGGAG AATAAGCTAC TGGAAATACT GAAAACGAAA AATATATCTC CAGAATTAAA TGAAAAGATA CTGAATATAT TTAAAAAGAG AATACTTTAT GATGAAATAA CTGTGCCTGT GATGGATCCT GAGGTGGAGG AAATTGAGTG TATGGGTCCA GGCTTACCGC TTACCGTAAT TCATAGAAAA TATTCCAATT ATATGAGGTT ATACACCAAC ATAGTTTTGC CCAAGGAGGA AGATATACTT AGGATCATAG AAAAACTTGC AATAAAATCA AGTAAATCCG TTAATATTGC TAGACCCTAT CTCGAATTTT CTCTCCCAGA AGGTCACAGG GTTGCTGCCA CCGTATCCAA TGAGATCTCG AACCCAGGTT CCACATTCGA TATAAGGAAA TTCCCTGTGT CTCCCATATC CCTAATTAAG CTGATTAAAG GTAACTCTCT CAGCGTCGAG ATTGCATCCT ATCTATGGTT CTTGCTAGAT TACAAACCCT TTTATCTGAT AGTGGGGTCC ACAGGTTCAG GGAAAACCAC ATTTCTGAAT GCCTTGCTAA ACTTCGCTAA TCCAGACGCA AAGATCCTAA GTATTGAGGA TACGCCTGAG CTTAACCTTA GTGGGAAAAA TTGGATAAGA TTCTTTTCAA GGCAATCCCT AGTCTCGACG TATGACGTCA CCATTGGTGA GCTATCAAGA CTAGCCTTAA GATATAGGCC AGACTACCTG ATCATAGGTG AAGTGAGGGG GAAGGAGATT GAGACCCTCA TTCACGCCTC ATCCTCGGGT CACGCGTCAC TAAGCACTTT TCATGGGGGT AAACCCAGTG ACGTCGTAAC TAGGATTGTA AGCTTGCTTC CTAAGGAGCT TGCCATAATG TTCCTGAATA ATGTGTGGGG CTTCATTCTA GTGGGAAGGA GAGTCGATGA AAACGGTAGA ATAATTAAGG CTATAAATGC AATATATGAG ACGCAAAAGA TCAACGGCAA GACTAAATTC AGAAAAATAG TGTGGTGGTC GTTCAAGGAT AAAATATATA AGCCAAATAA TTTCAGTGAG CTATTAAAAA TTTCCACTAA ATTAAAGTTT ATCTCGGATT CGTATGGGTT AAGCAAGTCC GATATCCTTG ACGAACTGGA GAGAAGGAAA AATGTGATAG AGAAGTTGAT AGCTGAGAAC ATTGCAGACA ACGAGATGAT TCAAGCAGAG ATAGCCAAAT TCTACAGGGA GAGGGGTCTA AATGTCCAAA GTACGATTTG A
|
Protein sequence | MSRTLLEEYN VMGSKVSIID EDGQGLYIVE DPRISREESS AISSIMDEIY FSSTSVSDAE NKLLEILKTK NISPELNEKI LNIFKKRILY DEITVPVMDP EVEEIECMGP GLPLTVIHRK YSNYMRLYTN IVLPKEEDIL RIIEKLAIKS SKSVNIARPY LEFSLPEGHR VAATVSNEIS NPGSTFDIRK FPVSPISLIK LIKGNSLSVE IASYLWFLLD YKPFYLIVGS TGSGKTTFLN ALLNFANPDA KILSIEDTPE LNLSGKNWIR FFSRQSLVST YDVTIGELSR LALRYRPDYL IIGEVRGKEI ETLIHASSSG HASLSTFHGG KPSDVVTRIV SLLPKELAIM FLNNVWGFIL VGRRVDENGR IIKAINAIYE TQKINGKTKF RKIVWWSFKD KIYKPNNFSE LLKISTKLKF ISDSYGLSKS DILDELERRK NVIEKLIAEN IADNEMIQAE IAKFYRERGL NVQSTI
|
| |