Gene Msed_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2104 
Symbol 
ID5104398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2025982 
End bp2027412 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content41% 
IMG OID640507994 
Producttype II secretion system protein E 
Protein accessionYP_001192168 
Protein GI146304852 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[N] Cell motility 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.609091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAA CACTCCTTGA GGAATATAAT GTAATGGGAA GTAAGGTCTC CATCATCGAC 
GAGGATGGAC AGGGACTCTA CATTGTTGAG GATCCAAGGA TTTCCCGTGA GGAGTCATCG
GCCATATCCT CCATAATGGA TGAGATTTAC TTTTCCTCAA CATCCGTCTC AGACGCGGAG
AATAAGCTAC TGGAAATACT GAAAACGAAA AATATATCTC CAGAATTAAA TGAAAAGATA
CTGAATATAT TTAAAAAGAG AATACTTTAT GATGAAATAA CTGTGCCTGT GATGGATCCT
GAGGTGGAGG AAATTGAGTG TATGGGTCCA GGCTTACCGC TTACCGTAAT TCATAGAAAA
TATTCCAATT ATATGAGGTT ATACACCAAC ATAGTTTTGC CCAAGGAGGA AGATATACTT
AGGATCATAG AAAAACTTGC AATAAAATCA AGTAAATCCG TTAATATTGC TAGACCCTAT
CTCGAATTTT CTCTCCCAGA AGGTCACAGG GTTGCTGCCA CCGTATCCAA TGAGATCTCG
AACCCAGGTT CCACATTCGA TATAAGGAAA TTCCCTGTGT CTCCCATATC CCTAATTAAG
CTGATTAAAG GTAACTCTCT CAGCGTCGAG ATTGCATCCT ATCTATGGTT CTTGCTAGAT
TACAAACCCT TTTATCTGAT AGTGGGGTCC ACAGGTTCAG GGAAAACCAC ATTTCTGAAT
GCCTTGCTAA ACTTCGCTAA TCCAGACGCA AAGATCCTAA GTATTGAGGA TACGCCTGAG
CTTAACCTTA GTGGGAAAAA TTGGATAAGA TTCTTTTCAA GGCAATCCCT AGTCTCGACG
TATGACGTCA CCATTGGTGA GCTATCAAGA CTAGCCTTAA GATATAGGCC AGACTACCTG
ATCATAGGTG AAGTGAGGGG GAAGGAGATT GAGACCCTCA TTCACGCCTC ATCCTCGGGT
CACGCGTCAC TAAGCACTTT TCATGGGGGT AAACCCAGTG ACGTCGTAAC TAGGATTGTA
AGCTTGCTTC CTAAGGAGCT TGCCATAATG TTCCTGAATA ATGTGTGGGG CTTCATTCTA
GTGGGAAGGA GAGTCGATGA AAACGGTAGA ATAATTAAGG CTATAAATGC AATATATGAG
ACGCAAAAGA TCAACGGCAA GACTAAATTC AGAAAAATAG TGTGGTGGTC GTTCAAGGAT
AAAATATATA AGCCAAATAA TTTCAGTGAG CTATTAAAAA TTTCCACTAA ATTAAAGTTT
ATCTCGGATT CGTATGGGTT AAGCAAGTCC GATATCCTTG ACGAACTGGA GAGAAGGAAA
AATGTGATAG AGAAGTTGAT AGCTGAGAAC ATTGCAGACA ACGAGATGAT TCAAGCAGAG
ATAGCCAAAT TCTACAGGGA GAGGGGTCTA AATGTCCAAA GTACGATTTG A
 
Protein sequence
MSRTLLEEYN VMGSKVSIID EDGQGLYIVE DPRISREESS AISSIMDEIY FSSTSVSDAE 
NKLLEILKTK NISPELNEKI LNIFKKRILY DEITVPVMDP EVEEIECMGP GLPLTVIHRK
YSNYMRLYTN IVLPKEEDIL RIIEKLAIKS SKSVNIARPY LEFSLPEGHR VAATVSNEIS
NPGSTFDIRK FPVSPISLIK LIKGNSLSVE IASYLWFLLD YKPFYLIVGS TGSGKTTFLN
ALLNFANPDA KILSIEDTPE LNLSGKNWIR FFSRQSLVST YDVTIGELSR LALRYRPDYL
IIGEVRGKEI ETLIHASSSG HASLSTFHGG KPSDVVTRIV SLLPKELAIM FLNNVWGFIL
VGRRVDENGR IIKAINAIYE TQKINGKTKF RKIVWWSFKD KIYKPNNFSE LLKISTKLKF
ISDSYGLSKS DILDELERRK NVIEKLIAEN IADNEMIQAE IAKFYRERGL NVQSTI