Gene Msed_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1098 
Symbol 
ID5103572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1024786 
End bp1026351 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content52% 
IMG OID640506993 
Producthypothetical protein 
Protein accessionYP_001191186 
Protein GI146303870 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTG AGGACGCTAA GAGTCGTGCG AGTAAGTACG GCGAGGTTGT GGGTCTCATT 
AGCAGGGTGA CCCCAATCTC CCACGGAAAG GACAATAACC AGATTAGGGC TGAGATCCCC
TACGAGGTCT ATCTCAAGAG GAAGTTCCTC ATAGGGAGTT ATGTGGGGAT ATCTATACCC
GTGTCAGGGA CTCTCATGCT GGGGAGGATT ACCTCGGTCG AGAGGGCAGA CATCCTTGCA
ATCTCCAGGA TCCCAGCTCT CTCCCCTGTT GAAGACGTGT CCGCTATTAC CACCCCTCTC
TCCCTGACCA TAGAGCTCCT GTCCGAGAAG GTGGAGAATG AGGTGGTTCC GCCCAGTTCA
CCTGTGGACC CGCAGAGTCC TATCTTCGTT CCAAGCCAGG AGTTCATAAA GGAAATGTTG
GGGTTACCGC AAGACGGAAT ACCCATAGGG AAGATCGTCG AGGGTTACAG GATCCTTGAC
GTACCTGTCA ACCTCACCGA GGAGGCGTTG AGACACCACG TCCTCGTAGT GGGAACTACG
GGTGCGGGGA AAACAAACCT TCTTCGCCTC CTGATCACCA GGAGCAGGAT CCCGGTCCTA
GGCTTCGACA TTCAGGGAGA TTACGTGAAG ACCATGGCGA AGATTGGGGG AACCGTCCTT
GTTCCTGTGA CGAGGGATAT GGGGAAGGTA ACTGAATTCG TTTCGCTCTT CTTGAAGAGG
AGCAACCTGC AGGACTTCAG GATATCCCAG GTTGACGGCC AAAGGATCAC GTTGACCAAC
GGTGAGAAAA CCTTTCACGT GGAGCTCTTG GGTTTCAGGT TGAGGGAGAC CTACAAGGAG
ATTCCAGATG TGTCGCCCCT CTTCTCTGGG CAGGGAGCCT ATTTCTTCAA GTTGATCACA
GAGCACTGCC TAACGGAAAT TGACAATTGG ATTGGGGAAT GTGAGGAACT CTTTTCTGAG
TTTCATGTTC ATAAGACCAC AGAGGACAAC ATAAGGAGGT CAGTTATTAT GCTGAAGGAG
ACTGGCATCC TGGACATACC CCTGGAAAAG GGGTTCCTTG GGGAACCCAA CTACGAGGAC
CTAGTAAGAA AGAAGGCAAT AGTGGACTTG AGATGGGTAA TGGAGAAAGG AATTTCCACC
GCAACCACCA CGGCCTTCCT CATTGTGGAC AGATTGTTCA GACTGATAGA CGCGAAATAC
AAGAATGAGG GGGTTGAGAC TCCTTACCTC CTCGTATTCG ATGAGGCTCA CGAGTACTTC
CCGCAGTCAA GGAGGGATGA GGAGAAGGAG GGGCTTGAGA GACTCATCAA CAGAATACTT
AGGCTGGGGA GGGTAAGGGG AATGGGCACG GTGTTGGCTA CCCACAGGCC GACGGACCTA
AACGACCTCA TCCTTACACT CACGAACACG AAGATCGCCA TGAGGGCTGA CGAGGATGCT
CTCGAGAAGA TAGGGATGGA GGAGTACGCG AACATACTGC AGGCCTCACC ACCTGGGTAC
GCAGTTATGA GGACTTTCTC CTTGAAGGTC CAGGACCTAG TGTTCAGGAC GGATAAGTAC
GAGTAA
 
Protein sequence
MSLEDAKSRA SKYGEVVGLI SRVTPISHGK DNNQIRAEIP YEVYLKRKFL IGSYVGISIP 
VSGTLMLGRI TSVERADILA ISRIPALSPV EDVSAITTPL SLTIELLSEK VENEVVPPSS
PVDPQSPIFV PSQEFIKEML GLPQDGIPIG KIVEGYRILD VPVNLTEEAL RHHVLVVGTT
GAGKTNLLRL LITRSRIPVL GFDIQGDYVK TMAKIGGTVL VPVTRDMGKV TEFVSLFLKR
SNLQDFRISQ VDGQRITLTN GEKTFHVELL GFRLRETYKE IPDVSPLFSG QGAYFFKLIT
EHCLTEIDNW IGECEELFSE FHVHKTTEDN IRRSVIMLKE TGILDIPLEK GFLGEPNYED
LVRKKAIVDL RWVMEKGIST ATTTAFLIVD RLFRLIDAKY KNEGVETPYL LVFDEAHEYF
PQSRRDEEKE GLERLINRIL RLGRVRGMGT VLATHRPTDL NDLILTLTNT KIAMRADEDA
LEKIGMEEYA NILQASPPGY AVMRTFSLKV QDLVFRTDKY E