Gene Msed_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1197 
Symbol 
ID5104493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1168127 
End bp1169578 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content52% 
IMG OID640507089 
Producttype II secretion system protein E 
Protein accessionYP_001191282 
Protein GI146303966 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.585907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAGC CGAGAGACGT CAAGGTAATA AGACCCAAGT ATCAGGGCGA TATCATCCAG 
GAGTACGACG TCAAGGGGGT TTACCCCGAT TACGATGTTG CCTTCCCCAA GGTTAAGATC
ATCAGGAACC AAACTGACGT GTATTACGAC ATTGAGGAGC CCAACATAAA CCAGTCCCAA
CTTGAGCTCC TGAAGAAGGC GCACAAGGCA CTCTACTACA CCCTCAAGCC CTCTGAGGTG
GACCGCCTGG TCACCTCCTA CCTAGACTAC GTTGGTAGGG ACAAGGTTAT AGGCTATTAC
GTGGTCAGGG ACATCTTAAG ATATGACGTC CTCACAACCC TCCTAGACGA CGAGCAGATA
GAAGATGTGT CCTACTCTCC CACCGCCCCA TCCTCACCCG TGTTCGTGTT CCACAGAGGG
TACGGAACCT GGATCTCCAC TAACGTGATC CTTGACTCCA GCGAGGCCAA CTATCTGGTT
CAGAAGATCG CCTTCAAGAG CGGTAAATCC ATCACCCTAG CTAGGCCAAT CCTAGACGCC
ATGACCCCTG AGGGTTACCG TATTGCTCTG ACCTACGGAA GGGAGGTCTC TCCCACTGGC
TCAACTATCT CCATAAGGAA GCCGATCAAG GAGGTCTGGA CTCTCCCCTA CATGGTAGTA
AGGAGGAGGA TAATGCCACC CCTCGTAGCC TCAGCGCTCT GGTTCGTCCT TCAGAATCGC
GGTGTCATAC TCGTGGTCGG GAGATCTGGT TCGGGTAAAA CGACCCTGAT CAACGCCTTG
CTCACAGTTG CCCCACCCTC CTGGAAGATA GTCACGGTGG AGGAGATACC AGAGCTAAGG
GTAATCTACC CCAACTGGGT GAGGCTTGTG TCCAGGAAGC CAACCCTAAT GACCGAGTAC
AGTGAGTCTG CTGAGATTCC ACTGGATCGA CTCATCTCCC ACACCTTCAG GGTTAGGCCC
GACCTGGTCT CGGTGGGTGA GGTAAGGTCT AAGGAGGAGA TAAGGGAGTT CATTCACTCA
GTAGCTGCAG GTCACGGTGG AATCACTTCG CTACACGCTG AGGACTTTGC CTCCCTAAAG
GCTAGGTTCA ACTATGCTGG GGTAGACGAC TCCTTCTTCG CGATTGTGTC CATGGTGGTA
TTCGTGAACT CCTACAACGT TAGTGGTAAA CTAGTAAGGA GGGTCCAGGA GGTCGGTGAG
GTAGTCCTGA GGGACGGTGA GGCACATTAC GTTCCGCTTG CGACCTACTC TCCCGTGTCT
GACTCTTACG TGGTGGACAT ACTCCACAGC AAGAGGTTAA TGACCATGGC CTCACTCAAG
GGTTACACTG AGGGAGATCT TAAGGCAAAC TTAGAGAAGA AGGTTAAGTT TATGCTAGAG
ACCGCCGGTC TACCGCAACC TGAATACGAG AAAAGGATAA GGTCCTACTA CGAGGAGGAG
GGTATAGTAT GA
 
Protein sequence
MMEPRDVKVI RPKYQGDIIQ EYDVKGVYPD YDVAFPKVKI IRNQTDVYYD IEEPNINQSQ 
LELLKKAHKA LYYTLKPSEV DRLVTSYLDY VGRDKVIGYY VVRDILRYDV LTTLLDDEQI
EDVSYSPTAP SSPVFVFHRG YGTWISTNVI LDSSEANYLV QKIAFKSGKS ITLARPILDA
MTPEGYRIAL TYGREVSPTG STISIRKPIK EVWTLPYMVV RRRIMPPLVA SALWFVLQNR
GVILVVGRSG SGKTTLINAL LTVAPPSWKI VTVEEIPELR VIYPNWVRLV SRKPTLMTEY
SESAEIPLDR LISHTFRVRP DLVSVGEVRS KEEIREFIHS VAAGHGGITS LHAEDFASLK
ARFNYAGVDD SFFAIVSMVV FVNSYNVSGK LVRRVQEVGE VVLRDGEAHY VPLATYSPVS
DSYVVDILHS KRLMTMASLK GYTEGDLKAN LEKKVKFMLE TAGLPQPEYE KRIRSYYEEE
GIV