Gene Mbur_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_0095 
Symbol 
ID3998864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp83208 
End bp85439 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content45% 
IMG OID637957936 
ProductDNA topoisomerase I 
Protein accessionYP_564864 
Protein GI91772172 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000317173 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTTA TCATAGCGGA AAAGCATATT GCAGCCAAAA GGATAGCAAA GATACTGGCC 
CCGAAGGAAC CAAAACAGGT ACGTGTCAGT GGAATGGATA CGTTCGAATA TAAAAACCCT
GACGCAGAGG ACAGAATAAT CGTAATGGGA CTTAGCGGAC ATATCGTACA GCTCGATTTT
CCAAAAGCCT ATAACAACTG GCAGAAGATC GATGCAAATG AACTTATCGA TGCTGAGGTA
ATTACTACAC CTACCCATGT GAAGATAGTG GCAGCCCTCA GGAAACTAGG AAAGGAAGCT
ACGCATGTTA CCATTGCTAC TGACTACGAC CGTGAAGGAG AGCTTATTGG TGTCGAAGCA
CTGAACATTA TAAGGAAAGT GAACCCTGAT GTTGAGTTCG ATCGTGTTTT TTATAGTGCC
ATAACCAAGA CGGAGATAGA GGGAGCTTTT GCGAACCCTG CTTCAATTGA TTTCGATCTT
GCGGATGCAG GTCATTCCAG ACAGGTCATC GATCTTGTGT GGGGAGCTTC CCTTACAAGA
TACCTTTCTA TTTCAGCAGG CAGGCTGGGA AAGCTTTTCC TGTCAGTGGG AAGAGTGCAG
TCACCAACAC TTGCATTGAT AGTAGATAAG GAAAAGGAAA GGCAAGCATT TGTCTCGAAG
CCATACTGGG AAGTCAACGC GACCCTTAAG GATAACGATG ATGCACTGGT CAATGTACAG
CACAAGAACT CCAAGTTCTG GGAAAAGGAA GAAGCTGATG CTGTAATGGA AGTTATAGGA
GATAAGGCAT CCGTATCATC CGTGGATAGC TCGAAGAAGA CGGATAAGGT ACCAACTCCT
TTCAATACCA CAGAATTCAT CGGTGCTGCA AGTTCCATCG GTTTTACAGC ATCCAATGCC
ATGAGGATAG CAGAATCCCT TTACACCAAT GGTTTTATTT CATATCCAAG GACGGATAAT
ACCGTTTATC CTGCATCCAT CGACCTGAGG GCACAGATAG AGATATTCAG CAAAGGTCCT
TTTAGGGAAT ATGCCCAAAA GTTGCTTGCG AAAGATGAAC TTGTGCCTAC ACGTGGCAAA
AAAGAGACCA CGGACCACCC CCCTATCTAT CCTGCATCCC TTGCAAAGAA GAGTGAACTT
AATGAACAGG ATTGGAAGCT TTACGAGCTT GTGGTCAGAA GATTCTTCGC AACCTTTGCA
GATGAGGCCG AATGGGAGAC CATGAAGGTA AGGTTCGATA TCTCTGGTGA GGAATTCAAG
GCAAATGGTG CAAGACTTGT GCACCAGGGA TGGAGATGGT ATTACCCATA CAATGCTCCA
CAGGACAGGC TTCTTCCGGC ACTTTCTGAA GGAGATATAC TGGATGTGAC CGGCAAGGAA
ATGCTTGACA AGGAAACTCA GCCACCTGGA AGGTATGGAC AGGGACGTCT GATAAAGCTC
ATGGAAGACC TTGGTTTAGG TACAAAGGCC ACACGCCATG AGATCATCAG TAAGCTATAT
TCAAGGGCAT ACGTCCACGG AAATCCGCTT CAGCCTACCA AGACATCTTT TGCAGTTATC
GAGGCCCTTG AAAAGTATGC ACCGACCATT ACCAAGCATG ACATGACCAG TCAGCTTGAA
GAGGATATGG ATAGTATCGC TGAAGGGAAG GTTAAGGAAG ACAATGTTCT GAAAGAATCA
AGAACAATGC TGCATGAAGT ATTTACAGAG CTTGCCAATA ACAGCGAAAA TATCTCAGAA
TCCCTGCGTG CAGGCCTCCG CGAAGATAAA GTAATAGGAG AGTGTTCCGA ATGCGGTTCG
AAGCTCATGG TCAGAAGGTC GAAAAGAGGA TCCCGTTTTA TCGGTTGTAA TGGATATCCG
GATTGTAATT TTTCACTTCC ACTTCCAAAG AGCGGACAGA TCATCGTTAC TGAAAAAACA
TGTGAAGAAC ATGGAATTAA TCATGTGAAG ATAATCAATC CGGGAAAACG TCCATGGGAA
TTAGGATGTC CACAATGCAA TTTTATCGAA TGGAAGAAAA CTCAGGAAGA AGAAAAGGCC
AAGCAGCCAA AGGTGCCTAT TCCTGATAAG ATCACGGATG TTCCCGGTAT CGGAAAGGTC
ACTGCAGAAA AACTTAAGAA TGCAGAGATC AATACTATTG ATGAGTTAAG GCAAGCGAAT
GCAATTGAGC TTTCAAAGGC CACAAGCTTG CCTGCAGGCA AGATAATGAA ATGGCAGGAA
CTTGTTACAT AA
 
Protein sequence
MHLIIAEKHI AAKRIAKILA PKEPKQVRVS GMDTFEYKNP DAEDRIIVMG LSGHIVQLDF 
PKAYNNWQKI DANELIDAEV ITTPTHVKIV AALRKLGKEA THVTIATDYD REGELIGVEA
LNIIRKVNPD VEFDRVFYSA ITKTEIEGAF ANPASIDFDL ADAGHSRQVI DLVWGASLTR
YLSISAGRLG KLFLSVGRVQ SPTLALIVDK EKERQAFVSK PYWEVNATLK DNDDALVNVQ
HKNSKFWEKE EADAVMEVIG DKASVSSVDS SKKTDKVPTP FNTTEFIGAA SSIGFTASNA
MRIAESLYTN GFISYPRTDN TVYPASIDLR AQIEIFSKGP FREYAQKLLA KDELVPTRGK
KETTDHPPIY PASLAKKSEL NEQDWKLYEL VVRRFFATFA DEAEWETMKV RFDISGEEFK
ANGARLVHQG WRWYYPYNAP QDRLLPALSE GDILDVTGKE MLDKETQPPG RYGQGRLIKL
MEDLGLGTKA TRHEIISKLY SRAYVHGNPL QPTKTSFAVI EALEKYAPTI TKHDMTSQLE
EDMDSIAEGK VKEDNVLKES RTMLHEVFTE LANNSENISE SLRAGLREDK VIGECSECGS
KLMVRRSKRG SRFIGCNGYP DCNFSLPLPK SGQIIVTEKT CEEHGINHVK IINPGKRPWE
LGCPQCNFIE WKKTQEEEKA KQPKVPIPDK ITDVPGIGKV TAEKLKNAEI NTIDELRQAN
AIELSKATSL PAGKIMKWQE LVT