Gene Msed_2296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2296 
Symbol 
ID5104247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2189967 
End bp2191196 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content52% 
IMG OID640508195 
Producthypothetical protein 
Protein accessionYP_001192357 
Protein GI146305041 
COG category[R] General function prediction only 
COG ID[COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.414005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGT ACGTAATTGG CGTTGACGAT CACGATTCCC CAGAGGGCGG ATGCACCACA 
CATTTTTCCT CGCTTTTATT GAAAGAGTTT AACAAGGCTA ACGTAAGGGT TGTGGGATAT
CCTAGGTTAA CTAGGCTGAA CCCCAACATA CCCTGGAAAA CCAGGGGGAA CGCCTCAGTC
TCATTCGTGG TGGAGACTGA GAGGGACCAG GCGGAACTCC TTGAGATGGT CTGGAGCGAG
TCCATGAACT ACGTGGAGAG GGTGTCCAGG GGATTACTGT ACAAGAGGTC TCCCGGCGTT
TCTGTGGGAA AGGTTGAGGT GATGGGGGAG CTGGAACACC TCTACTGGAA GGCGGTCAGC
GACGTGGTCA CGCTGGATTA CGTCAAGAAC GTTTCGGAAA GGCTTGGGAT CCTTACCACG
GGAGGTAGGG GGGTGATAGG GTCCATGGCC TCCATGGGAT TTTCAGGTAA TGGAACCTAT
GAACTAGTAA CGTACAGGGC CCAGGAAAAC TGGGGGAGGA GAAGGGAGTT GGACCTATCC
TCCCTGATAG AGTATGACGA GAGATATTTT CCAAGGGTGT ATGCGAATGT GGACTACGTG
GACATGGAAC CACTGGTCCT GTCTCACGGA AGGGATCCCG TGCTGTTTGG TCTTAGGGGA
ACCGATCCAG TCGCGTTGGT GGAAGGAATG AAGAGATTGA AGGTGAACGA GGAAGCGGAA
TCGTATGTGG TGTTCGTAAC TAACCAGGGA ACTGATCACC ATTTTCGGAA CCCTAAACTT
AGGCCGTACT CCAGTTTCGT GGGAGAAGTT ACCGTGAATT CTGTGAGGGT GGAAAGGGGA
GGAGACTGCG TCGTGATAGG GGATGACCTG GTGATGCTGG TGTACAAGGA AACCGGGGAG
TTAAACAGGG CCGTAAGGGA GTTATTGCCT GGAGATAGGA TCAGGGTCTA TGGAGCTGTC
AAGCCGTCGG TTAGGTACGG GGTTGTGATA GAACCGGAAA AGGTGGAGAT CCTGAACTTG
GTGCCAAAGG TGGAGGTAAA CAATCCTAGA TGTCCCATCT GCGGAGGCTC CTCCGAGTCT
GCCGGTAAGG GGAAGGGATT TAGATGCAGG AGGTGTGGGC ACAGGTTTGC GGGGGAGAAG
GTGGTGAGGG AAGTGGAAAG AGGAATCAGT CTAGGAGTGT TTCAGACGAG GAAGTACAGA
CACCTAACGA AGCCGATTTT TTATGAGTAG
 
Protein sequence
MNMYVIGVDD HDSPEGGCTT HFSSLLLKEF NKANVRVVGY PRLTRLNPNI PWKTRGNASV 
SFVVETERDQ AELLEMVWSE SMNYVERVSR GLLYKRSPGV SVGKVEVMGE LEHLYWKAVS
DVVTLDYVKN VSERLGILTT GGRGVIGSMA SMGFSGNGTY ELVTYRAQEN WGRRRELDLS
SLIEYDERYF PRVYANVDYV DMEPLVLSHG RDPVLFGLRG TDPVALVEGM KRLKVNEEAE
SYVVFVTNQG TDHHFRNPKL RPYSSFVGEV TVNSVRVERG GDCVVIGDDL VMLVYKETGE
LNRAVRELLP GDRIRVYGAV KPSVRYGVVI EPEKVEILNL VPKVEVNNPR CPICGGSSES
AGKGKGFRCR RCGHRFAGEK VVREVERGIS LGVFQTRKYR HLTKPIFYE