Gene Msed_0515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0515 
Symbol 
ID5103675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp471423 
End bp472625 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content46% 
IMG OID640506419 
Productradical SAM domain-containing protein 
Protein accessionYP_001190614 
Protein GI146303298 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000107878 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000774187 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCCTAT CCAAGTTCAA TATTTTTATT GATAATATAA TTTTCAACAC ACTTACCGGT 
TATGCCATGG AACTAGAGCC GTGGGAGATA GAGAAGTTAA AGGGTGGAGA GGTACCTGAT
CACCTTAAGG AAGTAGTGGA GGAGGGATTT TCAACTCCTG GCGATTTGGA GAGCGTGTTG
GAGCCACTCC TCAACAAGCC TGTCCTCGAA CCTACCCTTC TCCTCACATA CAACTGCAAT
TTCAATTGTA CCTACTGTTT TCAGAAGGGA TTCAGGAAAG ATCTCACGGT CACGGAAGAG
GTGATGAAGG GTTTCATAAA CTACGTGAGG AAGAGGGAAA GAGGTAGAAA GGTAAGAGTC
ACGTTCTTTG GGGGCGAGCC TCTCCTAGAG CTCAAGAAGA TCGAGGAGAT ATCTAGGTCG
CTCTCTGATC TGAAGTACTC CTTTAGCGTT GTCACCAATG GTTCCCTCTT GACCAAAAGT
GTAACCCAAA GGCTGATATC CCACGGACTT TCGCATGTCC AGATAACCCT GGATGGACCC
CCGGAAGTTC ACGATAAGAG AAGGTTTTAT GTAGATGGTA GAGGTTCCTT CAACACGATA
ATACAAAACC TGAGAGAGGT TCAGGATCTA GTGAAGGTAG TTTTGAGAAT AAACATAGAC
GTGAATAACC TTAACGAGGT ATACACTCTT CTGGCCAAAT TGGTGGAGGA GGGGATAACT
AGGATCAGAT TGGATCCTCA CTTCGTACAT ACCAACCTAT TTAGGAACGA ATGGTGGGAA
AACGTGATTC CGAAGGACCT GGAATCAGAC GTCCTAGTCA AGTTCTGGGA AAAGGCCAGG
GGTTACGGAT TTGAGATTCC CCATGACATC TTTAGACTTG GGATCTGTGC AGCACATATA
GACGAAGACA TCGTGGTAGA TCCTGAGGGA AAGGTCTATC CATGTTGGGC TTTCACAGGG
AATCCCCTAT ACGTGAAGGG AAGGCTCACG CAGGAAGGTG AGGTGGAGCT ACTGAATCGG
TCCCTATCCG GAAGGAAATC CCTCATAATC CACGAGAAAT GTAAGTCATG CCCCTATCTT
CCCATGTGTA TGGGAGGGTG TAGGTTCCTC TCAGTCCTTG ACGGAAAAGG ATACCACGGT
CTAGATTGCA GGAAGGAAAC TTATGAAAAG CTAGTCAAGC TATTAAAGTT TCTAATGCGG
TAA
 
Protein sequence
MALSKFNIFI DNIIFNTLTG YAMELEPWEI EKLKGGEVPD HLKEVVEEGF STPGDLESVL 
EPLLNKPVLE PTLLLTYNCN FNCTYCFQKG FRKDLTVTEE VMKGFINYVR KRERGRKVRV
TFFGGEPLLE LKKIEEISRS LSDLKYSFSV VTNGSLLTKS VTQRLISHGL SHVQITLDGP
PEVHDKRRFY VDGRGSFNTI IQNLREVQDL VKVVLRINID VNNLNEVYTL LAKLVEEGIT
RIRLDPHFVH TNLFRNEWWE NVIPKDLESD VLVKFWEKAR GYGFEIPHDI FRLGICAAHI
DEDIVVDPEG KVYPCWAFTG NPLYVKGRLT QEGEVELLNR SLSGRKSLII HEKCKSCPYL
PMCMGGCRFL SVLDGKGYHG LDCRKETYEK LVKLLKFLMR