Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1444 |
Symbol | |
ID | 5104814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1411502 |
End bp | 1413190 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507332 |
Product | DNA polymerase B region |
Protein accession | YP_001191525 |
Protein GI | 146304209 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0417] DNA polymerase elongation subunit (family B) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGGTT ACGTAATAGA TGCCGTACCA TACAGGGGTA GAATAAAGAT CGTGCTGGAC TCCTTCAGAG AAGCGTGGAT TAAGACAACC TATCCCATTT ACGTCATCAC CGATAGACCT GACCTCGTGA TGCAACATCC CTCGGTGGTG TCCCACGAAG AGGAGGAGTG GAGAACCCTA TCTGGCGAAA GGATTACCTT ACACAGGTAT GAGCTCCTTG ACCTTGAGGC GAGGTCCTAT ATTTCTAGGA GAACGACCGT GGTGAACCAG CTTCCCACCA CACTTTCCCT TGTCCTGGAC AGGTTAGATG CTAGACCCTT CAGAAGGGTC AGGATAGACG AAGGTAAAGT CCTCGAATTC TGGGATCTGG GCCTATCCTT TCCGCCCGTG AAATATGCCA CGGTGATAAC ACATGACTGG TATGGCCCTT CAGAGACGGG GAACATGTTT GAGGCTGACG TGAACGGAGA GAAGTGGAGG GGGTTACTGA GAGACCTGGA CCTCGAGGTA GACGTCGCTG GCTGTTTCGG ACGTGCGTGC GACCACGTGA AGGCGCCAGT TAAGATAAAA CTTGAACAAA AGAGGTCTCC GGTCTCCATT AAGGGGTTAA TTGAGTGGTC CTACACCTGC AAGACACCCG TAAGGGAATT GGTGGACGCT ACCATAGGGA AAGCGCTGAC CACAAACGAG GCCTGGGTCG CATTCGAGAA GAAAATAGTT GTTCCCAACA AGGTCCCGAG GGTAGAGAAG CTCAGGGATA TGGATGAACT GCTCCTCAAC GATAAGGGTG GACTGGTCCT GTTTCCTAGA ACCGGTTGCT TCGACGATGC CTGGCAAGTT GACTTCTCCT CTATGTATCC TTCTCTCATA GTTAAGCATA ACATCTCCGG GGAAACCGTT GACGCCTGCG ATGACGTAGT CACAGAGATA GGCCATACCA TCTGCAATAG CGAGAGGGGG ATAGTACCGG AGGCGCTATC CTGGTTAATA AGAAGGAAAG AGGCCCTAAA GCCCGTGGAC CCTGAAAGGG CAGAGGCGAT AAAATGGATA CTTGTTGCCT CCTTTGGGTA TCTAGGATAC AGGAACTCTA AGTTTGGGAA GATTGAGGCA TATGAACTGG TGACCTACTA CGCGAGGAAG ACATTGAGGA GAGCCTTGGA AATTGCGCAG GAAATTGGGG TTGAAGTGCT TCACGGGATA GTGGACTCCC TCGTGATTAG GGGGGATGCT CCACAGCTAG TGAGGAGACT GGAGGAGGAG ACTGGGTTAC ACCTTCGCTC CACCAGGCTA AAGTGGATAG TTCTTGGGGG GAGAAGGGAT GGACTTCCTT ACCCTATGAG GTACTTTGGA ATGACGGAGG AAGGAATGAA ATACAAAGGA ATCATCAGGA GAAACATGCC GAACCTAGTA AGGGATTTCC TCGAGAGCTC CATGAACATC CTTTCCAGGG CGGAGACGTG TGAGGACCTG AAAAGGCTTA GGTCAAACCT ACTCGAGAAC CTCAGGGAGT ATGAAGAGAG GGTAATTCAT GGGGAGCCCA GGGACTTCGT TATGTGGGTG AAGGGAGAGC CATACGTGAG GGGAGTGAGG GGGTTCTACA ACGCAAGGAG AGGGTTCATG GGGAGGGATA CGAAGTACTA TCTGGAATAT CTTAGGAGAA CGGCGAGGGA GGTCCTGGGA ATTGGATGA
|
Protein sequence | MEGYVIDAVP YRGRIKIVLD SFREAWIKTT YPIYVITDRP DLVMQHPSVV SHEEEEWRTL SGERITLHRY ELLDLEARSY ISRRTTVVNQ LPTTLSLVLD RLDARPFRRV RIDEGKVLEF WDLGLSFPPV KYATVITHDW YGPSETGNMF EADVNGEKWR GLLRDLDLEV DVAGCFGRAC DHVKAPVKIK LEQKRSPVSI KGLIEWSYTC KTPVRELVDA TIGKALTTNE AWVAFEKKIV VPNKVPRVEK LRDMDELLLN DKGGLVLFPR TGCFDDAWQV DFSSMYPSLI VKHNISGETV DACDDVVTEI GHTICNSERG IVPEALSWLI RRKEALKPVD PERAEAIKWI LVASFGYLGY RNSKFGKIEA YELVTYYARK TLRRALEIAQ EIGVEVLHGI VDSLVIRGDA PQLVRRLEEE TGLHLRSTRL KWIVLGGRRD GLPYPMRYFG MTEEGMKYKG IIRRNMPNLV RDFLESSMNI LSRAETCEDL KRLRSNLLEN LREYEERVIH GEPRDFVMWV KGEPYVRGVR GFYNARRGFM GRDTKYYLEY LRRTAREVLG IG
|
| |