Gene Msed_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2154 
Symbol 
ID5104893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2068439 
End bp2070790 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content45% 
IMG OID640508045 
ProductDNA polymerase II 
Protein accessionYP_001192217 
Protein GI146304901 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID[TIGR00592] DNA polymerase (pol2) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000768017 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000641936 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAGTGG ACATTTTCAT CTTAGATTTC TCTTATGATG TGGAGGACGG AAAGCCAGAA 
ATTTACATAT GGGGGATAGA TAGGGAAGGA AATAGGGTAG TTATACTTGA GAAGAGTTTC
AGGCCTTACT TCTATGTTAC TCTGAAAGAA GATGCCAACG TTAACGAAGC CATCACCCAG
ATCAAGAGGC TGTCAAGGGA CCAGTCTCCC ATAACATCAG TTACGGAACA TGAGATGAAG
TATTTTGGAA GACCGCAGAA GGTCCTGCGG GTTGAGACTG TTATTCCTGC CCTTGTGAGG
ACTTACAGGG AGGAGATCGC TAAACTGAAA CCTGTGAAGG ATGTCCTTGA GGCCGATATT
AGATTTTACA TGAGGTATTC CATTGATACT GGAGTTAGAC CATTCTATTG GCTCTCGGCT
GAGGTATCTC AGGTAGAGAG GAAAGATCTT AGGGTAGCGA AGGTCTACGA GCTAAAGAGG
ATTGAGAGAG TCTATGAGGA CTCTCCTCCG CAGCTCAGGT TCATGGCTTT CGATATAGAG
GTCTTTAACA AGTATGGTTT TCCGAATCCT AGGAGGGACC CCATTATCGT TATTGGTGTC
TGGACCGATA ACGGATCGAA GCAGTTCGTC AACAACGATC AGGACGATCT AAAGATTTTG
AGGGATTTCT CAAAATTCGT TGTCGAGTAT GACCCAGACG TCATCCTGGG GTATAATTCC
AACGGGTTTG ATTGGCAGTA CATGCTGGAA AGAGCCTCAG TAAGGGGAAC AAAGCTAGAC
ATAGGGAGGA AGGTTAACTC TGAACCAAGC CAGGGAACAT ACGGTCACTA CTCCGTAGTA
GGAAGGCTTA ACGTTGATCT CTACGGCTTC GCGGAGAGTC TCGAAGGTGT GAAGGTTAAG
AGTCTAGACA ATGTGGCAGA CTATCTTGGA ATCTTGCCTA AAAATAAGAG AACCAACCTT
GAATGGTATC AAATTCCTGA GTATTGGGAA GACCCTAAGA GGAGAGAGGT TGTTCTTAAG
TACAATCTGG ACGACGTAAA GACTACATAC CTTCTCAGGG ACGTGTTCTT TAACTTTGGC
GAACAATTAA CTGTAATTTC AGGTCTTCCG CTGGATCAGT TATGCATGGC CAGTGTGGGA
CACAGGGTAG AATGGCTTTT AATGCGACAG GCAAAGCAGT TCAACGAGTT AATACCTAAC
AGGGTTGAGA GGAGATACGA GGGATATAAG GGCGGTCTGG TTATAGAGCC AAAGCCTGGA
CTTCATGAGA ACGTAGCTGT TCTGGATTTC AGCTCCATGT ATCCCTCCAT CATGATAAAG
TATAACATAG GTCCAGACAC CCTGGTACAG GGAGAGTGCA ACGATTGTTG GGTAGCTCCC
GAAGTTGGTT ACAAATTCAG AAAGGACGTG GATGGATTTT ACAGAAGTAT ACTTAACTTC
CTTCTTGAAG AGAGAAGGAA AACCAAAGAT CAGATGAGTC AGGCCAAAGA CGAGTATGAG
AAACGTAGGT TGGATGAGAG GCAGAGGGCT CTCAAGATCA TGGCCAACGC CATGTACGGT
TACATGGGCT GGTTAGGCGC AAGGTGGTAC AGCAAGGAGG GTGCGGAAGC GGTCACAGCG
TGGGGAAGAC AAACCATTAT GACGGCAGCT GAGATTGCCA AGAATTCAGG ATTTGAGGTA
ATCTATGGGG ATACGGATTC CATTTTCGTT AAGGGAGATA TGTCCTCTGT TGAGTTGTTG
ACTCAAAAAA TAGTCCAGGC CCTCGATCTA GATATAAAAG TAGATAAAAA ATACAAAAAA
GTATTCTTTA CAGAAAATAA AAAGAGATAT GCCGGTCTCA CATTCGATGG GAAGATAGAC
ATCGTTGGGT TTGAGGCCAT AAGGGGAGAT TGGTGTGAAT TAGCCAAGGA CACACAGAGG
ATGGTCATAG AGAGGATCCT GCTCAAGGGA GTGGACGACG CCGTTAAGGC AGCTAAGGAG
GTCATAATGA AGGTAAAGAG GAGAGAGTTC GAGCTACATG ATATAGTGAT ATGGAAGTCA
CTGGACAAGA GTCTAGACGA GTATGAGGTG GATGCGCCCC ACGTAATAGC TGCCAAGAAG
GCCATAAACG CGGGTTACGC CATAATGAGA AATGGCAAGA TAGGCTATGT GGTGGTAAAG
GGAGCAGGAA GAGTATCTGA TAGGGTTGAA CCCTATTTCA TGGTTAAGGA TAAGACAAAA
ATTGATATCG ATTACTACGT CGAGAAGCAG ATAATCCCAG CAGTAATGAG AATATTGGAA
CCTTTTGGGG TAAAAGAAAA TAATTTGAAG GGAGGAGGAA TAGATATAAT GGACTATTTT
AGGGACCGAT AA
 
Protein sequence
MRVDIFILDF SYDVEDGKPE IYIWGIDREG NRVVILEKSF RPYFYVTLKE DANVNEAITQ 
IKRLSRDQSP ITSVTEHEMK YFGRPQKVLR VETVIPALVR TYREEIAKLK PVKDVLEADI
RFYMRYSIDT GVRPFYWLSA EVSQVERKDL RVAKVYELKR IERVYEDSPP QLRFMAFDIE
VFNKYGFPNP RRDPIIVIGV WTDNGSKQFV NNDQDDLKIL RDFSKFVVEY DPDVILGYNS
NGFDWQYMLE RASVRGTKLD IGRKVNSEPS QGTYGHYSVV GRLNVDLYGF AESLEGVKVK
SLDNVADYLG ILPKNKRTNL EWYQIPEYWE DPKRREVVLK YNLDDVKTTY LLRDVFFNFG
EQLTVISGLP LDQLCMASVG HRVEWLLMRQ AKQFNELIPN RVERRYEGYK GGLVIEPKPG
LHENVAVLDF SSMYPSIMIK YNIGPDTLVQ GECNDCWVAP EVGYKFRKDV DGFYRSILNF
LLEERRKTKD QMSQAKDEYE KRRLDERQRA LKIMANAMYG YMGWLGARWY SKEGAEAVTA
WGRQTIMTAA EIAKNSGFEV IYGDTDSIFV KGDMSSVELL TQKIVQALDL DIKVDKKYKK
VFFTENKKRY AGLTFDGKID IVGFEAIRGD WCELAKDTQR MVIERILLKG VDDAVKAAKE
VIMKVKRREF ELHDIVIWKS LDKSLDEYEV DAPHVIAAKK AINAGYAIMR NGKIGYVVVK
GAGRVSDRVE PYFMVKDKTK IDIDYYVEKQ IIPAVMRILE PFGVKENNLK GGGIDIMDYF
RDR