Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1974 |
Symbol | |
ID | 5833856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2206017 |
End bp | 2207219 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641367775 |
Product | putative DNA topoisomerase I |
Protein accession | YP_001639444 |
Protein GI | 163851401 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3569] Topoisomerase IB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.169196 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTCGG AGGAGGTCGG AGCCGGCGTG GTCGATCCCC GCGAGGCCGC GCGGGATGCA GGCCTGCGCT ACGTGGACGA TTCCAAGCCC GGCCTGCGGC GCAAGCGCAA CGGCAAGGGC TTCCGCTACA TCGACCCGAA AGGCGCCCCG GTCCGCGACG CGGAGGAGAT CGCCCGGCTC AAAAGCCTCG CGATCCCGCC GGCCTACACC GAGGTGTGGA TCTGCCCGCA CCCGAACGGC CATATCCAGG CGACCGGGCG CGACGAGAAG GGGCGCAAGC AGTACCGCTA CCATCCCCGC TTCCGCGAGG CGCGGGAGGC CTCGAAGTTC CACCGCATCA TGGCCTTCGC TGAGGCGCTG CCGGGCATCC GCGCGCGGAT CGACGCCGAT ATGGGCAAGC GCGGCCTGCC GCGCGAGAAA GTGCTGGCCA CCGTGGTCCA CCTCCTGGAG ACCACGCTGA TCCGCGTCGG CAACGACGAT TACGCCCGCT CCAACAAGAG CTACGGCCTC ACCACCCTGC GCGATCCGCA TGTGAAGGTG GCCGGCTCCG AGATGCGCTT CCGCTTCAAG GGCAAGAGCG GCAAGGAATG GTCGGTCTCG GTGCGCGACC GCCGCGTGGC CAAGATCGTC AAGGCCTGCC AGGACCTGCC CGGCCAGGAG CTGTTCCAGT ATCTCGACGA GGAGGGCGAG CGGCGCGACG TCACTTCCTC GGACGTGAAC GCCTACCTGC GCGAGATCAC GGGCGAGGAT TTCACCGCCA AGGATTTCCG CACCTGGGCC GGCACGGTGC TGGCGGCCCT GGCGCTGCGG GAGTTCGAGG CGTTCGACAA CGCGGCCAAG GCCAAGAAGA ACCTGCGCGC GGCGATCGAG TCGGTGTCGT CCCGGCTCGG CAACACGCCG ACCATCTGCC GCAAGTGCTA CATCCACCCG CAGATCCTCG ACTGCTACCT CGAAGGCGGG ATGCTGCTGC AGGTGAAGGA GGCGGTCGAG GGCGAACTCA AGAACGAACT CGATGTGCTG CGCCCGGAGG AGGCGGCGGT GCTGAGCCTG CTTCGGGCCC GTCTGGAACG GGCGACGAAG GCTGCCTCCA AGGGTACGAC GAGCGAGAGC ACGACGAAGA TCGAGCCGCC GCGTCAGACC GGGGGCCGTA AGGCCAGGGC GACCGGCACG AAGCGCACGT CTGGGGGCAG ACGGGCGGCG TGA
|
Protein sequence | MLSEEVGAGV VDPREAARDA GLRYVDDSKP GLRRKRNGKG FRYIDPKGAP VRDAEEIARL KSLAIPPAYT EVWICPHPNG HIQATGRDEK GRKQYRYHPR FREAREASKF HRIMAFAEAL PGIRARIDAD MGKRGLPREK VLATVVHLLE TTLIRVGNDD YARSNKSYGL TTLRDPHVKV AGSEMRFRFK GKSGKEWSVS VRDRRVAKIV KACQDLPGQE LFQYLDEEGE RRDVTSSDVN AYLREITGED FTAKDFRTWA GTVLAALALR EFEAFDNAAK AKKNLRAAIE SVSSRLGNTP TICRKCYIHP QILDCYLEGG MLLQVKEAVE GELKNELDVL RPEEAAVLSL LRARLERATK AASKGTTSES TTKIEPPRQT GGRKARATGT KRTSGGRRAA
|
| |