Gene Msed_1466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1466 
Symbol 
ID5104713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1437827 
End bp1438954 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content52% 
IMG OID640507354 
Producthypothetical protein 
Protein accessionYP_001191547 
Protein GI146304231 
COG category[C] Energy production and conversion 
COG ID[COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain 
TIGRFAM ID[TIGR00273] iron-sulfur cluster-binding protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGGG AAATCGCCAT TGAAAGGACG ATAAGGAACA ACGTTCCCAG AGTTTACAAT 
GTCTTAGAGA AGTATCCCTA CATACTAGAT CTTGCAAAGG AGCTGAGGAA GGCCAAGCTT
GAGGTCCTGA ACAATCTCGA GATGTACGTG GAACAGACGG TCGAGTCTAT AAAGAGAATT
GGTGGGGTTC CACACGTCGT TGGGGATTCT ACGGAGGCGA GAGAGGTCAT CTCCAAGATA
ATTGGGGATA GGAAGAGGGT TGTCATGGGC AAGTCCATGG TGGCCTTCGA AGTTGGATTA
AGGGAACATC TCAAGAGCCT AGGAAAGGAG GTGTGGGAAA CTGACCTAGG CGAGTTCCTG
ATACAGCTCG CCAACGAGCC ACCCTCTCAC ATCATAGCCC CTGCGGTTCA TATGTCAAAG
GAGAGGGCTG AGGAACTGGT TAGAGAGGCG CTCGGTGGTC TTCCTCCCAA TTCAACTCAC
GAACAGATCG TGGCAAGGGT GAGGGAGTTC CTGAGGGACA AGTTCGTCAA CGCTGAGGTG
GGAATAACGG GAGCAAACGC GATAGCTGCC GATACTGGGT CAATCATCCT CGTGGAAAAC
GAGGGAAACA TAAGGTTTAC CACAGTGTCT CCTCCTCTTC ATATTGCAGT GGCCGGTTTC
GAGAAAATCG TACCTACCCT TCCACACGCT ATGATGGAGG CCATGGTCCA AGCTGCATAT
GCGGGATTAT ATCCGCCCAC CTATGTTAAC CTGACCTCTG GACCCAGTTC CACAGGTGAT
ATTGAGATGA AGAGGGTTAG CCCAGCACAT GGGCCCAAGG AGTTCCACCT TGTCCTGGTG
GATAACGGGA GAGTGAAGGC GTCCAAAGAT CCTGACCTGA GGGAGGCCTT ACTTTGTATT
AGGTGTGGTA GATGCCATCT ACACTGCCCC GTGTATAGGG CGATGGATGG AAAATGGGGC
GTTCCTCCCT ACTCGGGTCC CATGGGCTCC ATGTGGTCAT ATGTCGTGTT CGGCGATCCT
AAACCCTCGC TACTCTGCAC ACACTCTGGG GGATGCAAGG AGGTTTGTCC CATGAAGATA
AACATACCGA GGGTTCTAGA GAAGATAAAG GCTCGGGCGT GGAGCTAA
 
Protein sequence
MTWEIAIERT IRNNVPRVYN VLEKYPYILD LAKELRKAKL EVLNNLEMYV EQTVESIKRI 
GGVPHVVGDS TEAREVISKI IGDRKRVVMG KSMVAFEVGL REHLKSLGKE VWETDLGEFL
IQLANEPPSH IIAPAVHMSK ERAEELVREA LGGLPPNSTH EQIVARVREF LRDKFVNAEV
GITGANAIAA DTGSIILVEN EGNIRFTTVS PPLHIAVAGF EKIVPTLPHA MMEAMVQAAY
AGLYPPTYVN LTSGPSSTGD IEMKRVSPAH GPKEFHLVLV DNGRVKASKD PDLREALLCI
RCGRCHLHCP VYRAMDGKWG VPPYSGPMGS MWSYVVFGDP KPSLLCTHSG GCKEVCPMKI
NIPRVLEKIK ARAWS