Gene Msed_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0906 
Symbol 
ID5103552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp837089 
End bp838396 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content47% 
IMG OID640506809 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001191002 
Protein GI146303686 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.598412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTAT ATATGACACA AATCCTCGAG GCGAGAAGGG GAAATGTCAC CGAGGAGATG 
AGGAAGATCG CTGAAATCGA GGGAGTATCT CCGGAAAAGA TTAGGGATAG GGTCGCTACA
GGCAGAGTAG TAATTCTCAA GAACTTGAAA AGAAACTTGA GGAAGTACAC CGCAGTAGGG
GAGGGGCTCT CAACGAAGGT GAACGTTAAT ATTGGTGCCT CGACTGATCA TTACAATATA
GAAGAGGAAC TGAGGAAGGT GGAGATAGCG AATAGATACG GAGCGGATTC CCTAATGGAC
CTAACCGATG GCGGTGACAT AGATTCCATG AGGAGGATGG TAATAGAACA CGCAGAAATG
CCCGTGGGGA CGGTCCCCAT TTATCAGGTA TATTATGAGA TGGTCACAAG AAGGAAGTAC
GTAATCGACT TCACGCCTGA TGAACTTTTC AGGGTGATAA GGAAACAGTT AGATGACGGA
GTTGACTTCA TCACTGTTCA CACTGGAATT ACCCTCGAGC TCTCCAGAAA ACTAGTTGAG
AAGAAAAGAG TGGCAGGTAT CGTGAGCAGG GGTGGAACTA TCATGGCTGC ATGGTCCCTA
CATAATCAAA GGGAGAACCC GCTTTATTCA GAGTTCGATT ATCTCCTGGA AATAGCAAGG
GAATATGATG TTACCCTGAG CCTTGGAGAT GCGCTCAGAC CTGGAGGAAT TGCTGATGCG
CATGACGAGT TCCAGGTAGC TGAGCTCATA AATAACGCTA GGTTGGCCAG GAGGGCGGTC
GAAAAGGGAG TCCAGGTAAT GATAGAGGGA CCTGGTCACA TGCCCTTAGA TCAGATAGAG
ATGGACATAA AACTTGAGAA GGAACTCTCT GGAGGAGTTC CATATTACGT CCTAGGAATT
TTACCTACCG ATATAGCTGC AGGCTATGAT CACATTGCCG GAGCCATAGG AGGTGCAGTC
GCCTCAGCTC ACGGAGCTGA TATGTTATGT TATCTTACAC CTGCAGAACA TCTTTCCTTG
CCTACCCCAG AACAAGTGAA GGAGGGACTC ATAGCATTTA AGATAGCTGC CCATGTGGGG
GATACGATTA AACTCGGGGA GAGGGCTAGG GAAAAGGACA GAGAAATGAG CGTGGCTAGG
GCCTCTCTCA ATTGGTTGAA GATGTTCTCG CTCACCTTCG ATCAGGATAG GGCGAAACAG
ATCTACACTC AATACAAAGA CAAGCCTCTC GGTTCATGCA CCATGTGTGG AGACCTTTGC
GTCTACCTGG TACTACCCAG AGTTACCGAG AAAATGAAGA GAACTTGA
 
Protein sequence
MLLYMTQILE ARRGNVTEEM RKIAEIEGVS PEKIRDRVAT GRVVILKNLK RNLRKYTAVG 
EGLSTKVNVN IGASTDHYNI EEELRKVEIA NRYGADSLMD LTDGGDIDSM RRMVIEHAEM
PVGTVPIYQV YYEMVTRRKY VIDFTPDELF RVIRKQLDDG VDFITVHTGI TLELSRKLVE
KKRVAGIVSR GGTIMAAWSL HNQRENPLYS EFDYLLEIAR EYDVTLSLGD ALRPGGIADA
HDEFQVAELI NNARLARRAV EKGVQVMIEG PGHMPLDQIE MDIKLEKELS GGVPYYVLGI
LPTDIAAGYD HIAGAIGGAV ASAHGADMLC YLTPAEHLSL PTPEQVKEGL IAFKIAAHVG
DTIKLGERAR EKDREMSVAR ASLNWLKMFS LTFDQDRAKQ IYTQYKDKPL GSCTMCGDLC
VYLVLPRVTE KMKRT