Gene Msed_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0114 
Symbol 
ID5104967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp94737 
End bp96128 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content41% 
IMG OID640506013 
Productpreprotein translocase subunit SecY 
Protein accessionYP_001190215 
Protein GI146302899 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000441427 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000810769 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCACTTA CTGACGCGCT AGCCAAGTTG GGTCAGGTTC TCCCAGCAGT TACTAAGCCA 
GAGGAAAAAC CAACGTTAAA CAAGAAACTG CTATGGTCCA TAGTGGGAGT AGTAGTCTAT
CTATTAATGT CATCTGTTCC CCTTTATGGG ATCCAGAGTA CTGCCCTAAG TAACTTCCTC
TTGGAACAAG TAATATTTGC GTCTACCGCT GGCACGTTAG CCCAGCTTGG AATTGGACCC
ATAATCACTG CCGGACTAAT AATGCAAATA CTTGTTGGAT CTAAACTGCT CAATCTTAAC
TTAAACGATG AAGAAGATAA GGCAAAGTTC ACAGAAGCAC AGAAGGGGTT AGCCTTTCTT
TTCATCTTGT TGGAGTCATT TCTATTTGCA TTTGCATTGA CTAGGTCAAG TGGATTGTCC
AATATCAATA TTCCGTTAAT TGTCGCTGGG CAATTGATTG TTGCAACTTA CCTTATACTA
TTACTGGATG AATTAATTCA GAAAGGTTGG GGACTAGGCT CTGGAGTAAG CTTGTTCATC
CTCGCTGGAA CAATGAAAAT AATATTCTGG TATATGTTCG GAATTGTGAA CGTTCAATCT
CAAAATCTCC CTGTCGGATT CTTCCCGTCG CTCGTCACAA CCATAATCGA TCACGGCAAC
TTACTTAATC TGGTGGTCAA CACGACGAAA TCTTTTCAGC CTGACCTAGT GGGGCTAATT
ACTACAATAG GTCTAATATT TCTAATAATA TATCTGACTT CCATAAATGT TCAAATACCT
ATTACCTCTC AGAAACTAAG GGGAATAAGA AGAACGATTC CGCTCAACTT CCTTTATGTC
AGTAGCATAC CCGTTATATT TGTAAGTGTT CTTGGTGCAG ATATTGAACT TTTCTCTTCC
TTAACCTCTT ATATATCATC CTCTGCTAGC AGTGTTCTAA ACGCAATCCA ATCCGCATTT
ATATTTCCAC CACCTAGCAC CACAATACCT CACAGTGTCT ACGCTGTGGT ACTAGACCCA
GTAGGCGCAG TGATTTATTC TGTAGTTTTC ATCGTGTTAG GTATACTCTT TGGAATAGTA
TGGGTAGAGG TATCTGGTCT TGATCCTGCC ACTCAAGCTC AAAACCTTGT TGATGCTGGG
ATAGAGATCC CTGGCATGAG GAACAATCCA AAGATGATAG AGGCTGTATT GGCCAAGTAT
ATCTATCCTC TAGCCTTCTT TAGTTCCCTA ATAGTCAGTG TGATAGCGGT AGGGGCTACG
CTTTTAGGAG TATACGGAAC TGGTGTTGGA ATACTCTTGG CGGTGTCCAT AGCGATGCAG
TATTACAGTC TATTAGCATA CGAAAGATCT ATAGAGATGT ACCCCTTGTT AAAGAGATTG
ATAGGTGAAT AG
 
Protein sequence
MSLTDALAKL GQVLPAVTKP EEKPTLNKKL LWSIVGVVVY LLMSSVPLYG IQSTALSNFL 
LEQVIFASTA GTLAQLGIGP IITAGLIMQI LVGSKLLNLN LNDEEDKAKF TEAQKGLAFL
FILLESFLFA FALTRSSGLS NINIPLIVAG QLIVATYLIL LLDELIQKGW GLGSGVSLFI
LAGTMKIIFW YMFGIVNVQS QNLPVGFFPS LVTTIIDHGN LLNLVVNTTK SFQPDLVGLI
TTIGLIFLII YLTSINVQIP ITSQKLRGIR RTIPLNFLYV SSIPVIFVSV LGADIELFSS
LTSYISSSAS SVLNAIQSAF IFPPPSTTIP HSVYAVVLDP VGAVIYSVVF IVLGILFGIV
WVEVSGLDPA TQAQNLVDAG IEIPGMRNNP KMIEAVLAKY IYPLAFFSSL IVSVIAVGAT
LLGVYGTGVG ILLAVSIAMQ YYSLLAYERS IEMYPLLKRL IGE