Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_0095 |
Symbol | |
ID | 3998864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 83208 |
End bp | 85439 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637957936 |
Product | DNA topoisomerase I |
Protein accession | YP_564864 |
Protein GI | 91772172 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01057] DNA topoisomerase I, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000317173 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCTTA TCATAGCGGA AAAGCATATT GCAGCCAAAA GGATAGCAAA GATACTGGCC CCGAAGGAAC CAAAACAGGT ACGTGTCAGT GGAATGGATA CGTTCGAATA TAAAAACCCT GACGCAGAGG ACAGAATAAT CGTAATGGGA CTTAGCGGAC ATATCGTACA GCTCGATTTT CCAAAAGCCT ATAACAACTG GCAGAAGATC GATGCAAATG AACTTATCGA TGCTGAGGTA ATTACTACAC CTACCCATGT GAAGATAGTG GCAGCCCTCA GGAAACTAGG AAAGGAAGCT ACGCATGTTA CCATTGCTAC TGACTACGAC CGTGAAGGAG AGCTTATTGG TGTCGAAGCA CTGAACATTA TAAGGAAAGT GAACCCTGAT GTTGAGTTCG ATCGTGTTTT TTATAGTGCC ATAACCAAGA CGGAGATAGA GGGAGCTTTT GCGAACCCTG CTTCAATTGA TTTCGATCTT GCGGATGCAG GTCATTCCAG ACAGGTCATC GATCTTGTGT GGGGAGCTTC CCTTACAAGA TACCTTTCTA TTTCAGCAGG CAGGCTGGGA AAGCTTTTCC TGTCAGTGGG AAGAGTGCAG TCACCAACAC TTGCATTGAT AGTAGATAAG GAAAAGGAAA GGCAAGCATT TGTCTCGAAG CCATACTGGG AAGTCAACGC GACCCTTAAG GATAACGATG ATGCACTGGT CAATGTACAG CACAAGAACT CCAAGTTCTG GGAAAAGGAA GAAGCTGATG CTGTAATGGA AGTTATAGGA GATAAGGCAT CCGTATCATC CGTGGATAGC TCGAAGAAGA CGGATAAGGT ACCAACTCCT TTCAATACCA CAGAATTCAT CGGTGCTGCA AGTTCCATCG GTTTTACAGC ATCCAATGCC ATGAGGATAG CAGAATCCCT TTACACCAAT GGTTTTATTT CATATCCAAG GACGGATAAT ACCGTTTATC CTGCATCCAT CGACCTGAGG GCACAGATAG AGATATTCAG CAAAGGTCCT TTTAGGGAAT ATGCCCAAAA GTTGCTTGCG AAAGATGAAC TTGTGCCTAC ACGTGGCAAA AAAGAGACCA CGGACCACCC CCCTATCTAT CCTGCATCCC TTGCAAAGAA GAGTGAACTT AATGAACAGG ATTGGAAGCT TTACGAGCTT GTGGTCAGAA GATTCTTCGC AACCTTTGCA GATGAGGCCG AATGGGAGAC CATGAAGGTA AGGTTCGATA TCTCTGGTGA GGAATTCAAG GCAAATGGTG CAAGACTTGT GCACCAGGGA TGGAGATGGT ATTACCCATA CAATGCTCCA CAGGACAGGC TTCTTCCGGC ACTTTCTGAA GGAGATATAC TGGATGTGAC CGGCAAGGAA ATGCTTGACA AGGAAACTCA GCCACCTGGA AGGTATGGAC AGGGACGTCT GATAAAGCTC ATGGAAGACC TTGGTTTAGG TACAAAGGCC ACACGCCATG AGATCATCAG TAAGCTATAT TCAAGGGCAT ACGTCCACGG AAATCCGCTT CAGCCTACCA AGACATCTTT TGCAGTTATC GAGGCCCTTG AAAAGTATGC ACCGACCATT ACCAAGCATG ACATGACCAG TCAGCTTGAA GAGGATATGG ATAGTATCGC TGAAGGGAAG GTTAAGGAAG ACAATGTTCT GAAAGAATCA AGAACAATGC TGCATGAAGT ATTTACAGAG CTTGCCAATA ACAGCGAAAA TATCTCAGAA TCCCTGCGTG CAGGCCTCCG CGAAGATAAA GTAATAGGAG AGTGTTCCGA ATGCGGTTCG AAGCTCATGG TCAGAAGGTC GAAAAGAGGA TCCCGTTTTA TCGGTTGTAA TGGATATCCG GATTGTAATT TTTCACTTCC ACTTCCAAAG AGCGGACAGA TCATCGTTAC TGAAAAAACA TGTGAAGAAC ATGGAATTAA TCATGTGAAG ATAATCAATC CGGGAAAACG TCCATGGGAA TTAGGATGTC CACAATGCAA TTTTATCGAA TGGAAGAAAA CTCAGGAAGA AGAAAAGGCC AAGCAGCCAA AGGTGCCTAT TCCTGATAAG ATCACGGATG TTCCCGGTAT CGGAAAGGTC ACTGCAGAAA AACTTAAGAA TGCAGAGATC AATACTATTG ATGAGTTAAG GCAAGCGAAT GCAATTGAGC TTTCAAAGGC CACAAGCTTG CCTGCAGGCA AGATAATGAA ATGGCAGGAA CTTGTTACAT AA
|
Protein sequence | MHLIIAEKHI AAKRIAKILA PKEPKQVRVS GMDTFEYKNP DAEDRIIVMG LSGHIVQLDF PKAYNNWQKI DANELIDAEV ITTPTHVKIV AALRKLGKEA THVTIATDYD REGELIGVEA LNIIRKVNPD VEFDRVFYSA ITKTEIEGAF ANPASIDFDL ADAGHSRQVI DLVWGASLTR YLSISAGRLG KLFLSVGRVQ SPTLALIVDK EKERQAFVSK PYWEVNATLK DNDDALVNVQ HKNSKFWEKE EADAVMEVIG DKASVSSVDS SKKTDKVPTP FNTTEFIGAA SSIGFTASNA MRIAESLYTN GFISYPRTDN TVYPASIDLR AQIEIFSKGP FREYAQKLLA KDELVPTRGK KETTDHPPIY PASLAKKSEL NEQDWKLYEL VVRRFFATFA DEAEWETMKV RFDISGEEFK ANGARLVHQG WRWYYPYNAP QDRLLPALSE GDILDVTGKE MLDKETQPPG RYGQGRLIKL MEDLGLGTKA TRHEIISKLY SRAYVHGNPL QPTKTSFAVI EALEKYAPTI TKHDMTSQLE EDMDSIAEGK VKEDNVLKES RTMLHEVFTE LANNSENISE SLRAGLREDK VIGECSECGS KLMVRRSKRG SRFIGCNGYP DCNFSLPLPK SGQIIVTEKT CEEHGINHVK IINPGKRPWE LGCPQCNFIE WKKTQEEEKA KQPKVPIPDK ITDVPGIGKV TAEKLKNAEI NTIDELRQAN AIELSKATSL PAGKIMKWQE LVT
|
| |