Gene Msed_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1835 
Symbol 
ID5104185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1779949 
End bp1781346 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content40% 
IMG OID640507729 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001191908 
Protein GI146304592 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.963545 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATATC AGTTTAAGAT CATGAATCCA TTAACTGGTT CATTGAAGTT TCTAGCGACT 
ACACTTCTAA ACTCAATCAC CGCTCTACTG TTCTTTCTCA TTGTCGCCCA TTTTTCTAGT
CCGTCATTTG TTGGGAAGGT AGCAATAATA CAGCTTATAG AGACTATTAC AGGATCGTTC
TTTGCTTTAC TACCGTTCAA TCTTGTCACG AGGGATATCT CACATAAATA TGCCTCCTCT
CAGGACCATA GGAAGGTAGT TTCTACTTCA CTCTCGTACT CCCTTTTGGT TTCACCTTTT
CTCCTTTTTC TGTTTCTATT TCCCTCATAC GTATGGTTGT CAATACCGTA CTTTGTTTTA
TATTTATTTT CCACTTATCA ATATCAGATT TTATCGGGAT TGGGAAAGTT CTCTGAAACG
AATTTAGGCA ACGTTATCTT TACCGTAACG AGATGGGGGA TATCCTCGGT CGCAGTGTTT
TATCACAGCA TATCACTCTT GATCCTAATT TGGACTTTAG GTGCCCTGGT AAGGGTTATC
TACTATAACC ACTATCTTCC GTTTAAATTT CACTTCGATT TTCAGGTTGC AAAGGAAATA
GCCAAGATAG GGGTTCCAAT TTATTTGTCA GGAATAGTGT CTTTTATTTC CGGACAAGGG
GATAGAGTTG TTACAGCATT TTTACTGGGA TCGTATAGTC TAGGTATTTA TCAATTGGTA
GCATTGATTT CTGTTGTACC AAATACGCTG ATTTGGTCCT TGACCTCTGC CCTACTACCT
TCCTCTACCT ACTATTACAC TAAGGGCGTC GAGATGAGGG AGATGGCCTC CGGTGCCTTT
AGACTCTTGA CCTTCCTCTC CCTTCTTCTA GGGGTATCTA GTTACGCAAT TGCCCCATAT
CTAGTTCTCA AGCTTTTCCC TGAGTATTCA CCTGGAGTCG AGGTGTTGAA GATCCTAGTT
CTATTCATTA CAGTTACAAT GCCCTTTCAA ATTCTCTCAA CGTTCTTGAT TGCACTCAAC
AAGAATTACA GACCCTTCCT GGTAATTGGG AGCGCGAGCG CCATTGAAGT GGTTCTGGTC
TCCTTCCTCC TGATCCCGCG AATGGGGATT TTGGGTGCGG GGATAGCCCA GGCTGGGAAT
GCCATAGTAA CCAGTATTCT TTATGTAATT TTCTCCCTAA AACAGGGAAT AATAACACTC
GATAGAAAGA CTATATATTC CATTTTGTTG ATATCTCTTT CTTCGATTTC CCTCTTCTCC
TGGGTGATTG GGGCACTTGT GATTATTCTA GGATTGAAGT TTCTTGGAAT CATAACTAAT
AAGGAGATGG CCTTAATACA AAAATTCATT CCACCTCAGC TCAGATTCTT TATAAGGATA
CTTAATCTCT TCATTTAA
 
Protein sequence
MIYQFKIMNP LTGSLKFLAT TLLNSITALL FFLIVAHFSS PSFVGKVAII QLIETITGSF 
FALLPFNLVT RDISHKYASS QDHRKVVSTS LSYSLLVSPF LLFLFLFPSY VWLSIPYFVL
YLFSTYQYQI LSGLGKFSET NLGNVIFTVT RWGISSVAVF YHSISLLILI WTLGALVRVI
YYNHYLPFKF HFDFQVAKEI AKIGVPIYLS GIVSFISGQG DRVVTAFLLG SYSLGIYQLV
ALISVVPNTL IWSLTSALLP SSTYYYTKGV EMREMASGAF RLLTFLSLLL GVSSYAIAPY
LVLKLFPEYS PGVEVLKILV LFITVTMPFQ ILSTFLIALN KNYRPFLVIG SASAIEVVLV
SFLLIPRMGI LGAGIAQAGN AIVTSILYVI FSLKQGIITL DRKTIYSILL ISLSSISLFS
WVIGALVIIL GLKFLGIITN KEMALIQKFI PPQLRFFIRI LNLFI