Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1299 |
Symbol | |
ID | 7271159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 1330702 |
End bp | 1333518 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643569933 |
Product | DNA topoisomerase type IA central domain protein |
Protein accession | YP_002466356 |
Protein GI | 219851924 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01057] DNA topoisomerase I, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.596273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCATCTGA TAATCACTGA AAAGAACATT GCGGCTGACA GAATAGCCCA TATCCTCGCA GGAAAGACCT ACGTCGAGGT GAAGAAGGAC GGGGGGGTCA GTACCTACTC GTTCAACGAC ACGGTCGTGG TCGGTCTTCG GGGGCACGTG GTGGAGGTGG ACTTCGAGCC AGGATACACC AACTGGCGGA GCGAGGTGAA CACTCCGAGA TCCCTGATCT CGGCCAGGAC CATCAAAGCG CCGACCGACA AGAAGATCGT CACGCTGATC CAGAAACTGA CCAAAAAAGC CGACCATGTC ACGATCGCGA CTGATTTTGA TACCGAGGGG GAACTGATCG GGAAGGAGAC CTATGACCTC GTCAAGGCTG TCAAGCCGGA TGTCAGGGTC GACCGGTCCC GGTTCAGTGC GATCACCGAG GAGGAGATCC AGACTGCGTT CGCTGCACCG GCCGAGCTTG ATTTTGCTCT GGCTGCAGCC GGCGAGGCAC GGCAACTGAT CGATCTGATC TGGGGAGCCT CGCTCACCCG GTTCATCAGC CTGGCCGCCC ACCGCGGCGG CAAGAACATC CTCTCTGTCG GCAGGGTTCA GAGCCCGACC CTTGCGATGA TGGTCGACCG TGAGAAGGAG ATCGAGGCGT TCGTGCCTGA ACCCTACTGG GTGCTCACCG TCGATTCAAA CAAGGACGGG GAGCCGGTGC TCGCCCGTCA TACCACAGCA CGGTTCACTG ATGTGGCGAT AGCAGAGGAG GCGAAAGAAG CCACCCGCGC CCCACTGATG GTGACCGAGG TGAAGGAGGG ATCCAAGGTC GACCGTGCTC CGACTCCGTT CGATACCACA GGTTTCATCG TGGCCGCCGG CCGGCTCGGT TTCTCTGCCG CGAACGCGAT GCGGATCGCA GAGGATCTGT ACATGCACGG GTACATTTCG TATCCCCGTA CAGACAACAC CATCTATCCC AGGTCGCTCG ATCTGAATGG CGTCCTCAAG ACGCTGCAGA AGACCGAACT CTCGTCTGCT GTCATATGGG TTATGGCCAA TCGGAGACCA GTGCCGACCC AGGGTAAGAA GTCCACCACC GACCACCCGC CAATCCACCC AAGTGGAGCG GCGACCAGGG CCGAACTCGG CGACGAGCGG TGGAAGATCT ATGAACTGGT AGTCCGCCGG TTCCTCGCGA CTCTCTCCCC TGATGCCCGC TGGATGACGA TGAAGGTCCT CTTTACGGCT GGAAATGAGC CGTACACCAC CACCGGAGCG ACTCTGCTCG AGGCTGGATG GCGGACTGTC TACCCCTACA GCGATGCGAC CGAACACCCG CTCCCGCGCT TTGCGGTCGG CGATAGCCTG CCGATCGAGC AGGTGAACCT CGACCGGAAG GAGACCCAGC CGCCGCCCCG GTACACCCAG AGCAGGCTGA TCCAGCAGAT GGAAGAACTC GGGCTCGGAA CCAAGAGTAC CCGGCACGAG GTGATCGGGA AACTGGTCGG CCGGAAATAT ATCGAAGGGA ATCCGCTCAG GCCGACGCTG GTCGGACGGG CCGTGATCGA GTCGCTCGAA GACCATGCGG CTGCCATCAC CCGACCGGAC ATGACCCAGA CGATCGATGC ACATATGCAG CAGATCAAAG AGCGGAAGCG GACACGCGAT GATGTGGTGA CCGAATCGCG GGCCATGCTG AACCGTGCGT TCGACGAACT GGAGGAGCAC CAGAGTCAGA TCGGTGAGGA TATCATGGGG AGGACCGTCG AGGAGATGAT CCTCGGCCCC TGTCCTGTCT GTGGGAGCGA TCTCCGGATC CATCATATCC GGAACAGCAG CCAGTTCATC GGATGCACCC GGTATCCGGA CTGCCGGTTC AACATCGGAC TGCCCCTGAC CCAGTGGGGA TGGGCGATCA GGACCGATAC GGTCTGCCCA ACCCATCACC TGAACCATGT CAGGTTGATC GCCAAAGGGT CCAGGCCCTG GGATATCGGC TGCCCGCTCT GTCACCATAT CGAATCCAAT CAGGAGACGA TGGTGCTGAT CCCCTCGATG ACCGAGGAGA TACTCGGACG GTTGCAGCAG CATCACATCT ATACCGTTCA TGAAGTCGCC GATGCACCCC CTGAGGCGCT CGCCTCCGCG GCAGAGATTT CATCAACAGC GGCGGAACAC CTGAAGTCTG AAGGAGAAGC GGTCCTTGGA CTCCTCCGGC TCCGCTCAGA ACTGCGAAAA TTTGTCAGAA AGCAGGTTCC ACCCCGGCGC GGCAGGAGTC ATGCCAAGAT CATGAAGCAT CTCCATGCGA ACGGTATCAA TACGATCGCC GACCTCGCAA AGGCGGACCC GACCCTGCTT CGGACAGCAG GGGTCGGGGA GAAGGAGGTG ACATCCCTCC TTATGCAGGC GAAGGAGTAC TGCAACGACA AGACGTTGCG TGCCATCGGA GTGCCGGCGA TCAGCCTCAA AAAGTACTAT GCCGCAGGAA TCCAGAGTCC CGAGGATTTC TGCAGGTATC ATCCGGTCTA TCTGAGTGTC AAGACCGGAA TCAGTCCGGA CACCACGTTC CGGCATGCAG AGATGGTCTG CATCGCCCAG AACAGACCGG TGCCCCGCAA AGTGACCAGG GCAATGCTCG AACGGGGGCG TGCCGAACTA TTGACGATTC CCGGGCTTGG GGAGACGACG ATCGAGAAGC TGTACAGCGG CGGCGTGATC GACGGCATGA CCCTTGCCTC TGCAGATCCT GCGGCGCTGG CCAGCCATTC CGGGATCCCG CTCAAGAAGG TACAGGAATT TCAATCGCGG CTCCCTGGTT CCTCTCAGGC CAGTTGA
|
Protein sequence | MHLIITEKNI AADRIAHILA GKTYVEVKKD GGVSTYSFND TVVVGLRGHV VEVDFEPGYT NWRSEVNTPR SLISARTIKA PTDKKIVTLI QKLTKKADHV TIATDFDTEG ELIGKETYDL VKAVKPDVRV DRSRFSAITE EEIQTAFAAP AELDFALAAA GEARQLIDLI WGASLTRFIS LAAHRGGKNI LSVGRVQSPT LAMMVDREKE IEAFVPEPYW VLTVDSNKDG EPVLARHTTA RFTDVAIAEE AKEATRAPLM VTEVKEGSKV DRAPTPFDTT GFIVAAGRLG FSAANAMRIA EDLYMHGYIS YPRTDNTIYP RSLDLNGVLK TLQKTELSSA VIWVMANRRP VPTQGKKSTT DHPPIHPSGA ATRAELGDER WKIYELVVRR FLATLSPDAR WMTMKVLFTA GNEPYTTTGA TLLEAGWRTV YPYSDATEHP LPRFAVGDSL PIEQVNLDRK ETQPPPRYTQ SRLIQQMEEL GLGTKSTRHE VIGKLVGRKY IEGNPLRPTL VGRAVIESLE DHAAAITRPD MTQTIDAHMQ QIKERKRTRD DVVTESRAML NRAFDELEEH QSQIGEDIMG RTVEEMILGP CPVCGSDLRI HHIRNSSQFI GCTRYPDCRF NIGLPLTQWG WAIRTDTVCP THHLNHVRLI AKGSRPWDIG CPLCHHIESN QETMVLIPSM TEEILGRLQQ HHIYTVHEVA DAPPEALASA AEISSTAAEH LKSEGEAVLG LLRLRSELRK FVRKQVPPRR GRSHAKIMKH LHANGINTIA DLAKADPTLL RTAGVGEKEV TSLLMQAKEY CNDKTLRAIG VPAISLKKYY AAGIQSPEDF CRYHPVYLSV KTGISPDTTF RHAEMVCIAQ NRPVPRKVTR AMLERGRAEL LTIPGLGETT IEKLYSGGVI DGMTLASADP AALASHSGIP LKKVQEFQSR LPGSSQAS
|
| |