Gene MmarC7_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC7_0203 
Symbol 
ID5328995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C7 
KingdomArchaea 
Replicon accessionNC_009637 
Strand
Start bp214101 
End bp216545 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content35% 
IMG OID640792724 
ProductDNA topoisomerase I 
Protein accessionYP_001329424 
Protein GI150402130 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.698596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAC TAATTATCTG TGAGAAACCA AACGTTGCTA AAAAAATTGC AGAAGCTCTT 
GGAAAACCAA AGAAAAAATC CCATAATTCT GTGCCATATT ATGAAGTGGA ACGAAGTGGG
GAAAACGTAA TTGTAGCTTC AGCAGTAGGT CACCTCTACA CGCTCCAAGA AAAAACCAAA
ACCAAGTTTG GAGACTACCC TGTTTATGAT ATAGATTGGG TTCCAGCATC CACTCTCGAT
GAAAAAAAAT ATATTCAAAA ATACATCGAT GCACTGAAAA AAGTTTCAAA AGAAGCTAAC
AAGTTTTACG TTGCAACGGA CTGGGATATA GAAGGAGAAT TAATTGGATA CCACGCATTA
AATTTCTCAT GTGGACAAAA AAACGCTCAC AGATTAAGAT TTTCAAGTTT AACAAAAAAA
GAAATTGTAA AAGCTTATGA AACTCCGAAT GAAATAGATT TTGGGCTCGT TGATGCAGGA
GACAGTAGAC ACAAAATCGA CTGGTATTTT GGTATAAACG TTTCAAGAGC ACTCATGCAG
GCCGTAAGTT CTGTTAAACG ATGGAAAACC CTCAGTACAG GTAGGGTTCA AGGTCCAGCA
CTTGCTTTTT TAGTAGATAA AGAAATGGAA ATTAAAAATT TCGTTCCGAC ACCATACTGG
GTGCTTGAAG CTCTTTTGGA TAATGAATTA ATTGCAATTC ATGAACTTGA AAAATTCTGG
GATGAAGAAC AGGCGAATGA AGCTTTTGAA AAAGTAAAAG GCCAAAAGGA AGCTACTGTT
TCAGACGTTA AAAAAACCAT GAAAAGCATT CCTCCAAATC CGCCTTTCGA TTTAGGTGCA
CTTCAAAGAG AAGCACACAA CATGTTTAGA TTTTCACCTA AAAAAACACA GGAAATTGCA
CAAAAACTCT ATGAAAAAGG ATACTGTTCA TATCCAAGAA CGAGTTCCCA AAAACTTCCT
GATGATAAAG CGTACATGGA CGAAATATTG AAAAATCTTT CAAAAAATAA AAATTACAAG
CCATACATTG AAAGAATTTT AAGCGAAAAC AGAAAACCAA TTTCAGGTAA AAAGGACGAT
CCTGCGCACC CTGCGGTACA CGCAGTAGAC GTTCCAAAAG AACACCTGCC TGATGATGAG
TTAAAATTGT ATGAACTAAT TGCAAGAAGG ACTATTGCTC TTTACTGGGA TAATACTGAA
AGGGAATATT CAAAAATTAA TCTCGATATA AACGGTGAAC CATTCAAATT AAGCGGTTCG
AGAACCGTTA AGGAAGGCTG GCACGAAATT TACTACTACA CTAAATTTGA TGAGATAGAA
CTTCCCGATT TAAAGGAAAA AGACGTAATA AATGTAAATA ATATTAATTT CGAAGCAAAA
GAGACGAAGC CCCCAAAAAG ATACACAATG GCTTCAATTA TCAAGGAACT CGAAAAAAGA
AAACTCGGAA CAAAAGCAAC AAGGGCGGAC ATTCTTGAAA AACTGACCAA AAGAGGTTAC
GTGATTGAAG ATGGTTCTTT AACGGTTACC GACCTTGGAA TCGGAGTTAC CGAAACTTTG
AGAAAGTACT GTCCCGAAAT TGTTGAAGAA ACCCTTACAA GAGATCTTGA AGATAAATTG
GAACTTATTC AGGATAAAAA AATTAAAAAA GAAGATGTCA TAACAGAAAC ACAAAACAAG
CTTACCAAAA TCCTTGGAGA ATTCCGTTTA AAAGAAAAAG AAATTGGAAA TGAACTTGTA
GATAAACTCG ATTCGACAAA CCGATCTTTA CAGATAATTG GGAAGTGTAA ATGTGGTGGT
GACCTGATTA TCATCCGTAC AAAAGGCAAA AAAAGATTTG TTGGATGCAG CAATTACCCA
GACTGTGATG TTACATTCCC ACTACCTCAA AAAGGAAGAA TTAAAGTTTT AAATGAAGTC
TGTGAAACAT GCCATAACCC AATAATTGGA CTTGATAGGG TAAAAATTTG TGTAAATCCT
GATTGTACCA CAAGAATTTC AGAAGAAGAT AAAAAAGAAA TTGAAAAAGC AGAAAAAGAA
GAAAAAATAT GTCCAAAATG TGGTGGTAAG CTTTTAATCA AAAAAGGCCC TTACGGCGTA
TTCAGGGGCT GTGAAAATTA CCCAAAATGT AAATACACGG AAAAATTAAA CGGGGAATCC
AATGAAAAAG AAATCGTTGG AAAATGTCCA AAATGTGGAA GCGACTTATT TAAAAGAAGA
GGCAGATTTG GGGAATTTAT CGGATGTAGT AATTACCCAA AATGCAGGCA CACTGAAAAA
ATAGAAAAGA AAAAAGACGA TAACGCAGAT GGAACTAAAG CAGAAAAAAA TGAAGAACCA
AAACCAAAAA CCACTAAAAA ATCAGCTCCA AAAAAGACCG AAACTAAAAA AACTACCAAA
TCGACTGCTA AAAAAACAAC TGCAAAGAAA ACCACTAAAA AATGA
 
Protein sequence
MTGLIICEKP NVAKKIAEAL GKPKKKSHNS VPYYEVERSG ENVIVASAVG HLYTLQEKTK 
TKFGDYPVYD IDWVPASTLD EKKYIQKYID ALKKVSKEAN KFYVATDWDI EGELIGYHAL
NFSCGQKNAH RLRFSSLTKK EIVKAYETPN EIDFGLVDAG DSRHKIDWYF GINVSRALMQ
AVSSVKRWKT LSTGRVQGPA LAFLVDKEME IKNFVPTPYW VLEALLDNEL IAIHELEKFW
DEEQANEAFE KVKGQKEATV SDVKKTMKSI PPNPPFDLGA LQREAHNMFR FSPKKTQEIA
QKLYEKGYCS YPRTSSQKLP DDKAYMDEIL KNLSKNKNYK PYIERILSEN RKPISGKKDD
PAHPAVHAVD VPKEHLPDDE LKLYELIARR TIALYWDNTE REYSKINLDI NGEPFKLSGS
RTVKEGWHEI YYYTKFDEIE LPDLKEKDVI NVNNINFEAK ETKPPKRYTM ASIIKELEKR
KLGTKATRAD ILEKLTKRGY VIEDGSLTVT DLGIGVTETL RKYCPEIVEE TLTRDLEDKL
ELIQDKKIKK EDVITETQNK LTKILGEFRL KEKEIGNELV DKLDSTNRSL QIIGKCKCGG
DLIIIRTKGK KRFVGCSNYP DCDVTFPLPQ KGRIKVLNEV CETCHNPIIG LDRVKICVNP
DCTTRISEED KKEIEKAEKE EKICPKCGGK LLIKKGPYGV FRGCENYPKC KYTEKLNGES
NEKEIVGKCP KCGSDLFKRR GRFGEFIGCS NYPKCRHTEK IEKKKDDNAD GTKAEKNEEP
KPKTTKKSAP KKTETKKTTK STAKKTTAKK TTKK