Gene Mext_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1974 
Symbol 
ID5833856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2206017 
End bp2207219 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content69% 
IMG OID641367775 
Productputative DNA topoisomerase I 
Protein accessionYP_001639444 
Protein GI163851401 
COG category[L] Replication, recombination and repair 
COG ID[COG3569] Topoisomerase IB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.169196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCGG AGGAGGTCGG AGCCGGCGTG GTCGATCCCC GCGAGGCCGC GCGGGATGCA 
GGCCTGCGCT ACGTGGACGA TTCCAAGCCC GGCCTGCGGC GCAAGCGCAA CGGCAAGGGC
TTCCGCTACA TCGACCCGAA AGGCGCCCCG GTCCGCGACG CGGAGGAGAT CGCCCGGCTC
AAAAGCCTCG CGATCCCGCC GGCCTACACC GAGGTGTGGA TCTGCCCGCA CCCGAACGGC
CATATCCAGG CGACCGGGCG CGACGAGAAG GGGCGCAAGC AGTACCGCTA CCATCCCCGC
TTCCGCGAGG CGCGGGAGGC CTCGAAGTTC CACCGCATCA TGGCCTTCGC TGAGGCGCTG
CCGGGCATCC GCGCGCGGAT CGACGCCGAT ATGGGCAAGC GCGGCCTGCC GCGCGAGAAA
GTGCTGGCCA CCGTGGTCCA CCTCCTGGAG ACCACGCTGA TCCGCGTCGG CAACGACGAT
TACGCCCGCT CCAACAAGAG CTACGGCCTC ACCACCCTGC GCGATCCGCA TGTGAAGGTG
GCCGGCTCCG AGATGCGCTT CCGCTTCAAG GGCAAGAGCG GCAAGGAATG GTCGGTCTCG
GTGCGCGACC GCCGCGTGGC CAAGATCGTC AAGGCCTGCC AGGACCTGCC CGGCCAGGAG
CTGTTCCAGT ATCTCGACGA GGAGGGCGAG CGGCGCGACG TCACTTCCTC GGACGTGAAC
GCCTACCTGC GCGAGATCAC GGGCGAGGAT TTCACCGCCA AGGATTTCCG CACCTGGGCC
GGCACGGTGC TGGCGGCCCT GGCGCTGCGG GAGTTCGAGG CGTTCGACAA CGCGGCCAAG
GCCAAGAAGA ACCTGCGCGC GGCGATCGAG TCGGTGTCGT CCCGGCTCGG CAACACGCCG
ACCATCTGCC GCAAGTGCTA CATCCACCCG CAGATCCTCG ACTGCTACCT CGAAGGCGGG
ATGCTGCTGC AGGTGAAGGA GGCGGTCGAG GGCGAACTCA AGAACGAACT CGATGTGCTG
CGCCCGGAGG AGGCGGCGGT GCTGAGCCTG CTTCGGGCCC GTCTGGAACG GGCGACGAAG
GCTGCCTCCA AGGGTACGAC GAGCGAGAGC ACGACGAAGA TCGAGCCGCC GCGTCAGACC
GGGGGCCGTA AGGCCAGGGC GACCGGCACG AAGCGCACGT CTGGGGGCAG ACGGGCGGCG
TGA
 
Protein sequence
MLSEEVGAGV VDPREAARDA GLRYVDDSKP GLRRKRNGKG FRYIDPKGAP VRDAEEIARL 
KSLAIPPAYT EVWICPHPNG HIQATGRDEK GRKQYRYHPR FREAREASKF HRIMAFAEAL
PGIRARIDAD MGKRGLPREK VLATVVHLLE TTLIRVGNDD YARSNKSYGL TTLRDPHVKV
AGSEMRFRFK GKSGKEWSVS VRDRRVAKIV KACQDLPGQE LFQYLDEEGE RRDVTSSDVN
AYLREITGED FTAKDFRTWA GTVLAALALR EFEAFDNAAK AKKNLRAAIE SVSSRLGNTP
TICRKCYIHP QILDCYLEGG MLLQVKEAVE GELKNELDVL RPEEAAVLSL LRARLERATK
AASKGTTSES TTKIEPPRQT GGRKARATGT KRTSGGRRAA