Gene MCA2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2839 
SymboltopA 
ID3104042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3029974 
End bp3032280 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content62% 
IMG OID637171968 
ProductDNA topoisomerase I 
Protein accessionYP_115233 
Protein GI53803091 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGC ATCTCGTCAT CGTCGAATCG CCGGCCAAGG CCCGGACGAT CGAAAAATAT 
CTCGGCAAGT CTTTCCAGGT CTATGCCTCC TACGGGCATG TGAGGGACTT GATTCCCAAG
GAAGGCGCGG TCGACCCCGA CCACGACTTT TCGATGAAGT ACGCAATCAT CGAGAAGAAC
AAGAAACACG TGCAGGCGAT TTCCAAGGCC ATGGAAAAAG CCGATGCGCT GTATCTCGCG
ACCGACCCCG ACCGCGAGGG CGAGGCGATT TCCTGGCATC TCTATGAATT GCTGAAGGAA
CAGCAGGTTC TCGACAGCAA GCCGGTCCAT CGCGTGGTGT TCCACGAAAT CACCAAGCGC
GCCATCACCG AAGCCGTCGA AAACCCCAAA ACGCTTTCGA CCGATCTCAT CCACGCCCAG
CAGGCCCGGC GGGCGCTCGA TTACCTGGTC GGTTTCAAGC TTTCACCGCT GTTGTGGAAG
AAGATCCGGC GCGGATTGTC TGCGGGGCGC GTGCAAAGCC CGGCGTTGCG GATGATCGTG
GAGCGCGAAC TCGAAATCGA GCGCTTCGAG GTCCAGGAGT ACTGGACCAT CGAGGCGGCG
ATCCAGACCG AAGACCAGGC ACTTTCGGCC CGACTGACCC ATCTGGCCGG CGAGAAGCTG
GAGCAGTTCA GCATCGTCAA CGAGGCGCAG GCGCAGGCCA CGCGGCAGAG TCTGCTGGAC
CAGGCCCGCG GCAAATTGCA GGTGGTCCGG GTCGAGGAGA GGGAACGCAA GCGCAATCCT
GCCGCGCCTT TCACGACCTC GACCCTGCAG CAGGAAGCGG CCAGGAAGCT CGGCTTCACC
ACCCGCCGGA CCATGACCGT GGCCCAGCAG CTTTATGAGG GTATCGACCT GGGCGGCGAG
GCGGTGGGGC TCATCACCTA CATGCGTACC GATTCGGTCA ATCTCGCTCA GGAGGCCGTC
CAGGAATTGC GCGCGCTGAT CGAGTCGCGC TATGGCAAGG ACAATTTGCC CGCGCAGCCG
CGGCTTTACA AAACCAAGAG CAAAAATGCC CAGGAAGCAC ACGAGGCGAT CCGTCCGACT
TCGGCGTTCC GGACGCCGGA GTCGGTCAAG GCGCATCTGA CGCCGGATCA GTTCAAGCTC
TACAGCCTGA TCTGGAAGCG CAGCGTGGCC TGTCAGATGG TCCATGCCAC CTTGAACACC
GTCACCGTGG ACTTCGCCTG CGGCAGCGCC GACAACCTGT TCCGCGCCAC CGGTTCGACG
GTGATCCATC CGGGCTTCAT GTCGGTGTAC CGGGAAGGAC GGGACGACAT GCCGGAGGAA
AGCGACGAAA TCTATCTGCC CAAGCTGGTG GAAGGGCAGG AAGTCGACCT GAAGGACGTG
ACTCCTTCGC AGCATTTCAC CGAGCCGCCG CCCCGCTACA CCGAGGCCAG TCTGGTCAAG
GCGCTGGAGG AATACGGCAT CGGCCGGCCG TCGACCTACG CGACCATCAT TTCGACGCTG
CAGCAGCGGC ATTACGTCGA GCTGGAAAAC AAACGCTTCC GTCCCACCGA TCTGGGGCGG
GTGGTGAACA AGTTCCTGAC GGAGCATTTC AACCGCTACG TGGACTACAA CTTCACCGCC
AACCTCGAAG ACGACCTGGA TGCGGTGTCG CGGGGTGAGA AGGACTGGAT TCCGCTGATG
CGGGAATTCT GGGGACCCTT CCATGCCCTG ATCGGCGAGA AGGATGAAAG CCTGAAACGG
GCCGACGTGA CGCACGAGGC GATCGACGAG AAATGCCCGG AATGCGGAAG CCCGCTTTCG
ATCCGGCTGG GGCGCAACGG CCGGTTCGTC GGCTGCACCA ACTATCCGCA ATGCAAATAC
ACGCGGAATC TGGCCGGCAA AGAGTCGGAG CAAACCGAGC CCGAGGTCGT GGAGGGAAGG
CAGTGTCCCA AATGCCATTC GCCGCTCGTC ATCAAGACCG GGCGCTATGG CCGCTTCATC
GGCTGCAGCG GCTACCCCGC CTGCCGCCAC ATCGAACCGC TGGAGAAACC CACCGACACC
GGCGTGCCCT GCCCCGAATG CGGACAGGGG ACCCTGACCA AGCGCAAGTC CCGCTTCGGC
AAGCTGTTTT ACTCCTGTTC GACTTATCCC AAATGCAGCT ATGCGGTGTG GAATCCGCCG
ATCGCCGAGG CCTGCCCCGC CTGTCAGTGG CCCGTGCTCA CACTGAAGAC CACCAAGCGC
CGCGGCACCG AGAAAGTCTG CCCGCGCAAG GAGTGCGGTT ATGCGGCGCC CTACGAGGGT
GAGCCGTTGA CCGATTTCGC TGCCTGA
 
Protein sequence
MAQHLVIVES PAKARTIEKY LGKSFQVYAS YGHVRDLIPK EGAVDPDHDF SMKYAIIEKN 
KKHVQAISKA MEKADALYLA TDPDREGEAI SWHLYELLKE QQVLDSKPVH RVVFHEITKR
AITEAVENPK TLSTDLIHAQ QARRALDYLV GFKLSPLLWK KIRRGLSAGR VQSPALRMIV
ERELEIERFE VQEYWTIEAA IQTEDQALSA RLTHLAGEKL EQFSIVNEAQ AQATRQSLLD
QARGKLQVVR VEERERKRNP AAPFTTSTLQ QEAARKLGFT TRRTMTVAQQ LYEGIDLGGE
AVGLITYMRT DSVNLAQEAV QELRALIESR YGKDNLPAQP RLYKTKSKNA QEAHEAIRPT
SAFRTPESVK AHLTPDQFKL YSLIWKRSVA CQMVHATLNT VTVDFACGSA DNLFRATGST
VIHPGFMSVY REGRDDMPEE SDEIYLPKLV EGQEVDLKDV TPSQHFTEPP PRYTEASLVK
ALEEYGIGRP STYATIISTL QQRHYVELEN KRFRPTDLGR VVNKFLTEHF NRYVDYNFTA
NLEDDLDAVS RGEKDWIPLM REFWGPFHAL IGEKDESLKR ADVTHEAIDE KCPECGSPLS
IRLGRNGRFV GCTNYPQCKY TRNLAGKESE QTEPEVVEGR QCPKCHSPLV IKTGRYGRFI
GCSGYPACRH IEPLEKPTDT GVPCPECGQG TLTKRKSRFG KLFYSCSTYP KCSYAVWNPP
IAEACPACQW PVLTLKTTKR RGTEKVCPRK ECGYAAPYEG EPLTDFAA