Gene Msed_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1775 
Symbol 
ID5104775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1717749 
End bp1718693 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content52% 
IMG OID640507673 
Product3,4-dihydroxyphenylacetate 2,3-dioxygenase 
Protein accessionYP_001191854 
Protein GI146304538 
COG category[R] General function prediction only 
COG ID[COG2514] Predicted ring-cleavage extradiol dioxygenase 
TIGRFAM ID[TIGR02295] 3,4-dihydroxyphenylacetate 2,3-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0882804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACA TATTACGCGT CTCCCATGTA GTTTACAGGG TCACTGATCT GGACAGGGCC 
CTCTTTTTCT ATAGGGACCT CCTGGGCTTC GTGGAGACGG AGAGAAATGG AAACGAGGCT
TATTTACGCG GAGTTGAGGA GGGACAACAT CACAGTCTTG TTTTAAGGAA AGCTGACTCC
CCCGGTTTAT CCTACGCCTC CTTGAGGGTG AGAAAACCCG AGGTTCTGGA TCAGGCCAGG
GAGAAGTTCG ATGAGATTGG CATAAGGTAC AGGAGAATGA AGGAAAGGGG AGTGGAGGAG
GCAATCCTCT TTGAGGACCC GCAGGGTCTA CCTATTCTCC TGTATCACGA CATGGAGTAC
GTGGGAGATA GAAGGCTCAA GTTCCACGAG TACAGGGGAG TGACCCCCGT AAGGATAGAT
CACATCAATT TCATGGTAAG GGACCTAGAC GTTGAGGTTG AGTTCTACAC CAAGGTCTTT
GGATTCACTG AGACCGAGAC GTTCCTGGAT AGGGATGGGA AAAAGATGGT CTCCTGGATG
ACCAAGATCG GTCACTCGCA CGAGATTGCC ATCGCCAGAA GTTCCAGGAA CGTTCCGGGG
TTTCATCACG CAACCTTCTA CGTTCATGAC GTGAGGGATA TCATAAGGGC TGCGGACCTA
GTCTCCTCGG CTCAACTTTG GGACAGCCTA GAGAGGGGAC CTGGAAGGCA CGGGGTTACC
CAGGGGTTTT ACGTTTACCT CAGGGATCAG GACAGGAATA GGATAGAGTT CTTCACGGGC
GATTACTTCG TTCTAGATCC CGATAAGTGG AAACCCATAG CCTGGACCTG GGACCAGCTG
AGGTACAGGT CAGACTTCTG GGGAAGGGAG GTGCCAGAGA CCTGGCTCAA GGAGTGGGTT
CCCGTGGAGG ATATCACGGG TAAATTACGG GGGTGGAATA ATTGA
 
Protein sequence
MLDILRVSHV VYRVTDLDRA LFFYRDLLGF VETERNGNEA YLRGVEEGQH HSLVLRKADS 
PGLSYASLRV RKPEVLDQAR EKFDEIGIRY RRMKERGVEE AILFEDPQGL PILLYHDMEY
VGDRRLKFHE YRGVTPVRID HINFMVRDLD VEVEFYTKVF GFTETETFLD RDGKKMVSWM
TKIGHSHEIA IARSSRNVPG FHHATFYVHD VRDIIRAADL VSSAQLWDSL ERGPGRHGVT
QGFYVYLRDQ DRNRIEFFTG DYFVLDPDKW KPIAWTWDQL RYRSDFWGRE VPETWLKEWV
PVEDITGKLR GWNN