Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1907 |
Symbol | |
ID | 5104195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1853580 |
End bp | 1856213 |
Gene Length | 2634 bp |
Protein Length | 877 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507795 |
Product | DNA polymerase I |
Protein accession | YP_001191971 |
Protein GI | 146304655 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0417] DNA polymerase elongation subunit (family B) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.488214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATAA TGGCCAGACA GCTTACCCTT GCTGACTTCT CTGGGATCAA GAGAGAGGAA CCAGTTAAAC AGGAAGAGAA GACGCAGGAG GAAGAGAGGC CTCTGGAAAG GCCAGCGAGG CTAAGAAAGG ACACAGTTAA ACAGGCGCAG GAGGAGAGAA AGTACTTTCT TCTCTCCGTA GACTATGATG GTAAAATGGG GAAGGCTGTC TGCAAGCTTT ATGATCCTGA AACGGGTGAG CTACACGTCC TTTACGACAG CACGGGTCAC AAGTCATACT TCCTTGTGGA TTTAGAGCCA GATCAGATCC AAAAAATTCC AAAGATTGTT AAGGATGAGT CCTTTGTTAG GCTTGAGAAG ACCACTAAAA TAGACCCCTA CACTTGGAAA CCTATTAACC TAACCAAGAT TGTGGTGAAT GACCCCCTCG CTGTGAGACG CCTAAGAGAA TATGTCCCAA GGGCCTATGA AGCTCATATA AAATATTTTA ACAATTATAT TTACGATTTC AGCCTCATAC CAGGGATGCC CTACGTGGTA AAGAAGGGGA AGCTAGTCCC CCTTAAGCCG GAGGTTGACG TCAAAGAGGT AAAGGAAGCG TTCAAGGATG CTGACCAGAT AGCTCAAGAG ATGGCGCTAG ACTGGGCTCC CCTCTTTGAG TCCGAGATTC CGTCGGTGAA GAGGGTCGCA ATAGATATAG AGGTTTATAC TCCCATGATG GGTAGGGTAC CGGATCCAGT AAAGGCCGAG TACCCCGTGA TAAGCGTAGC CCTAGCAGGG AGCGATGGCC TGAAACTGGT CCTAGTCCTT GATAGGGGAG ATAGTCCGAT TCAAAGTAAG GATATCAAGG TTGAGGTCTT CCGCACAGAG AGGGAGCTTC TCTCCAGGTT GTTTGACATT CTTAAGGAAT ATCCCATGGT TCTGACCTTT AACGGAGACG ACTTCGATAT CCCATACCTG ATCTTCAGAG GTTTCAAGCT CGGGTTACTA CAGGATGAGA TACCCTTCGA GATCTCTAGT TTTGGCAGGA AACCTGACGC GAAGTTCAGA TATGGATTTC ACATAGATTT GTACAGGTTC TTCTTCAACA AGGCGGTCAG GAACTATGCA TTTGAGGGGA AGTACTCAGA GTACAACCTT GACACCGTAG CCCAGGCACT CTTGGGTCTC TCCAAGGTCA AGTTGGACGA GTCCATTAGC GACCTAAACA TGTCTAAACT CGTGGAGTAC AACTACAGGG ACTCGGAGAT CACGCTGAAG TTGACCACGT TCAACAACGA ACTAGTATGG AAGTTGATTG TACTCTTCTC CAGAATTTCC AAGCTTGGTA TAGAGGAGCT AACTAGGACA GAGATATCAG CCTGGGTAAA GAACCTGTAC TACTGGGAAC ATAGGAAAAG GAACTGGTTA ATCCCCCTCA AGGAGGAAAT CCTTGAACGC TCCTCTGGGT TGAAGACAGC TGCCATTATC AAGGGAAAGG GATACAAGGG CGCAGTGGTC ATAGACCCAC CTGTGGGGGT TTACTTTGAC GTAGTTGTTC TGGACTTCGC CTCACTGTAT CCCTCCATCA TCAGGAACTG GAACCTCAGT TATGAAACCG TTGATGTGAA GGAATGTAAC AAGAAAAGGG ATATAAGGGA TGAGAGTGGG GCGAAAATCC ATGAGGTGTG CGTGGACAGG CCCGGGATTA CTGCAGTGGT AACTGGCTTA CTTAGGGACT TCAGGGTCAA AATTTACAAG AAGAAAGGGA AACAGAGCAA CATAGACGAG GAGAGAAAGA TGTTGTACGA CGTGGTACAG AGGGGCATGA AGGTGTTCAT TAATGCGACC TATGGCGTCT TCGGTGCGGA GACCTTCCCC TTGTACGCCC CAGCAGTTGC AGAGAGCGTT ACAGCCCTAG GTAGGTACGT AATCACGTCC ACCAAGGAAA TGGCTAACAA GCTTGGGCTG AAGGTTGTGT ACGGGGATAC GGACTCGCTC TTCATTCACC AGCCTGATAA GAAGAAGCTG GAGGAACTGG TGGAGTGGAC CAGGCAGAAC TTCGGGCTTG ATCTAGAGGT GGACAAAACT TACAGGTTCA TTGCCTTCTC CGGTCTTAAG AAGAACTACT TCGGTGTGTT CAAGGATTCC AAGGTTGACA TAAAGGGCAT GTTGGCAAAG AAGAGGAACA CCCCAGAGTT TCTGAAGCAG GCCTTCAATG AGGCTAAGGA GAGGCTAGCG AAGGTTCAGA ACCAGGAGGA GCTCGAAAAG GCAATTCAAG ACTTAACGGC GCAGGTTAAG GAGGTGTACA GGAAGCTTAA GATGAAGGAA TATAACTTGG ATGAGCTCGC CTTCAGGGTC ATGTTATCCA GGGACGTGAA GTCCTATGAG AAGAACACCC CACAGCACGT TAAGGCTGCG GCACAGCTGG CGGAGATGAA CGTACAAGTG ATGTCAAGGG ATATAATTAG CTTCGTAAAG GTAAAGACTA AGGAGGGAGT TAAACCTGTC CAGCTAGCTA AGCTTTCAGA GATTGATGTG GATAAATACT ATGAGAGCGT GAGAAGTACC TTCGAACAGT TATTGAAAAG CTTCAATGTG AGCTGGGATA GAATAGAGTC CACGACATCA ATCGACTCGT TCTTCAAGAC TTAG
|
Protein sequence | MSIMARQLTL ADFSGIKREE PVKQEEKTQE EERPLERPAR LRKDTVKQAQ EERKYFLLSV DYDGKMGKAV CKLYDPETGE LHVLYDSTGH KSYFLVDLEP DQIQKIPKIV KDESFVRLEK TTKIDPYTWK PINLTKIVVN DPLAVRRLRE YVPRAYEAHI KYFNNYIYDF SLIPGMPYVV KKGKLVPLKP EVDVKEVKEA FKDADQIAQE MALDWAPLFE SEIPSVKRVA IDIEVYTPMM GRVPDPVKAE YPVISVALAG SDGLKLVLVL DRGDSPIQSK DIKVEVFRTE RELLSRLFDI LKEYPMVLTF NGDDFDIPYL IFRGFKLGLL QDEIPFEISS FGRKPDAKFR YGFHIDLYRF FFNKAVRNYA FEGKYSEYNL DTVAQALLGL SKVKLDESIS DLNMSKLVEY NYRDSEITLK LTTFNNELVW KLIVLFSRIS KLGIEELTRT EISAWVKNLY YWEHRKRNWL IPLKEEILER SSGLKTAAII KGKGYKGAVV IDPPVGVYFD VVVLDFASLY PSIIRNWNLS YETVDVKECN KKRDIRDESG AKIHEVCVDR PGITAVVTGL LRDFRVKIYK KKGKQSNIDE ERKMLYDVVQ RGMKVFINAT YGVFGAETFP LYAPAVAESV TALGRYVITS TKEMANKLGL KVVYGDTDSL FIHQPDKKKL EELVEWTRQN FGLDLEVDKT YRFIAFSGLK KNYFGVFKDS KVDIKGMLAK KRNTPEFLKQ AFNEAKERLA KVQNQEELEK AIQDLTAQVK EVYRKLKMKE YNLDELAFRV MLSRDVKSYE KNTPQHVKAA AQLAEMNVQV MSRDIISFVK VKTKEGVKPV QLAKLSEIDV DKYYESVRST FEQLLKSFNV SWDRIESTTS IDSFFKT
|
| |