Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2326 |
Symbol | |
ID | 4709283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2551145 |
End bp | 2553544 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639856801 |
Product | DNA topoisomerase I |
Protein accession | YP_001003891 |
Protein GI | 121999104 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.240337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAAAA GCCTCGTCAT CGTCGAGTCG CCTGCCAAGG CGCGCACGAT CAACAAATAC CTGGGTTCCG ATTACGAGGT CATGGCCTCC TACGGCCATG TCCGCGACCT CGTCCCCAAG GAGGGGGCGG TGGATCCGTC CAGCGGCTTC GCCATGAAGT ACGCACCCAT CGACAAGAAC CAGAAGCACG TCGATGCCAT CGCCAAGGCG GCCCGCAAGG CCGACGCCCT CTACCTGGCC ACTGACCCGG ACCGCGAGGG TGAAGCCATC TCCTGGCACC TGGTGGAGCT GCTGCGCGAC AAGGGCACCC TGGACGACAA ACCGGTCTAT CGGGTGGTCT TCCACGAGAT CACCAAGGGC GCCATCCAGG AGGCGATGAA CAACCCGCGG GACATCTCCG AGGAGCTGGT CAACGCCCAG CAGGCGCGCC GCGCCCTCGA CTACCTGGTC GGCTTCAACC TCTCGCCCCT GCTCTGGCGC AAGATCACCA GCGGCCTCTC CGCGGGCCGG GTGCAGAGCC CCGCGCTGCG GATGATCTGC GAGCGCGAGA CCGAGATCGA GCAGTTCGAG CCCCAGGAGT ACTGGAGCGT CGAGGCCGAT GCCGCCAAGG CGCAGCAGCC CTTCATGGCC AAGCTCTCGC AGCTCCACGG CGAGAAGGTC CGGCAGTTCA CCATCACCGA CGAGACCCAT GCCCAGGAGG TCGACCGCAC CCTGCGCGAG GCCGCGCGTG CCCAACCCGA CCCCGCCCGG ATCGGCCCCA CCGGCGATGG CGAGACCGAG GTCATCGGCA CGCTGCGGGT CGCCTCGGTG GAGCGCAAGC AACGCCGGCG CAACCCGGCA GCGCCATTCA TCACCTCGAC CCTGCAGCAG GAGGCCTCGC GCAAGCTCGG CTTCACCGCC AGCCGGACCA TGCGCATCGC CCAGCAGCTC TACGAGGGCA TCGACGTCGG CGAAGGCAGT GCCGTCGGTC TGATCACCTA CATGCGAACC GACTCGGTGA ACCTCTCCGG CGAGGCGATC ACTGAGATGC GCCAGGCCAT CACCGACCGC TACGGCGCCG ACAAGCTCCC GGGCCAGGCC CAGGTCTACA AGACCCGCTC GAAGAACGCC CAGGAGGCCC ACGAGGCCAT CCGGCCCACC TCGGCGTCGC GCCACCCGGA CGATGTCCGC GCCTACCTCA ACGAGGAGCA GCGCAAGCTC TACGATCTCA TCTGGAAGCG CGCCGTCGCC TCACAGATGA AGCACGCCAC CATCCACACG GTGGCCGTCG ATCTGGCCGC CGACGCCGAC GCCCGCCATC TGCTGCGGGC CACCGGCTCC ACGGTGGCCG ACCCGGGCTT CATGGTCGTC TACCGCGAGG GCAACGACGA GGGCAAAGAC GACTCCGGCG AGAAGTTCCT GCCCGAACTC GAGGAGGGCG AGCAGGTGGA CCTGCACGCC ATCCGCGCCG AACAGCACTT CACCGAGCCG CCGCCGCGCT ACACCGAGGC GAGCCTGGTC CGCGCCCTGG AGGAGTACGG CATCGGCCGG CCGTCGACCT ACGCCTCGAT CATCTCCACG CTGCAGAACC GCAACTATGT GGAGATGGAC GGCAAACGCT TCATCCCCAC CGACATCGGG CGCACAGTCA ACAAGTTCCT GACCGAGCAC TTCGATCGGT ACGTGGACTA CGACTTCACC GCCCGACTCG AGGACGACCT AGACGCCATC TCCCGCGGCG AGCAGGACTG GGTCCCGGTC CTGGAAGCGT TCTGGGAGCC CTTCCGGGAG CGGGTTGAGG AGAAGAAGAA CGTCTCGCGC CAGGAGGCGG TCCAGGCGCG GGAACTGGGC ACGGACCCGA AGACCGGCAA GCCGGTGACG GTGCGCATCG GTCGCTACGG CCCCTTCGCC CAGCTCGGCT CCCGCGACGA CGACGAGAAG CCGCGTTTTG CCGGCCTGCG CCCGGGACAG AGCATCGACA CCATCACCCT CGACGAGGCC CTGCAGCTGT TCAAGCTGCC GCGGGACATG GGCGAGACCG ACGAGGGCGA AGACGTCCAG GTCAGCATCG GGCGCTTTGG CCCCTACGTG CGCTACGGCA AGAAGTTCGT CTCCATCCCC AAGGACGAGG ACCCGTACAC CATCACCAAG GAACGGGCCC ACGAACTGGT GCGGGAGAAG AAACAGGCCG ACGCCAACCG GATCATCCAC GACTTCGGCG ACGGCATTCA GATCCTGCGC GGACGCTACG GGCCGTACAT CACCAACGGC GAGAAGAACG CCAAGGTGCC CAAGGACCGG GAGCCGGACT CGCTCACCCA TGAGGAGTGC CAGGACCTGA TCGCCAAGGC GCCGGCGCGC AAGGGGCGCC GCGGCGGGGC GGCCAAGGGT GGCCGCGGCC GCAGCAAGGC CACTAGCTGA
|
Protein sequence | MGKSLVIVES PAKARTINKY LGSDYEVMAS YGHVRDLVPK EGAVDPSSGF AMKYAPIDKN QKHVDAIAKA ARKADALYLA TDPDREGEAI SWHLVELLRD KGTLDDKPVY RVVFHEITKG AIQEAMNNPR DISEELVNAQ QARRALDYLV GFNLSPLLWR KITSGLSAGR VQSPALRMIC ERETEIEQFE PQEYWSVEAD AAKAQQPFMA KLSQLHGEKV RQFTITDETH AQEVDRTLRE AARAQPDPAR IGPTGDGETE VIGTLRVASV ERKQRRRNPA APFITSTLQQ EASRKLGFTA SRTMRIAQQL YEGIDVGEGS AVGLITYMRT DSVNLSGEAI TEMRQAITDR YGADKLPGQA QVYKTRSKNA QEAHEAIRPT SASRHPDDVR AYLNEEQRKL YDLIWKRAVA SQMKHATIHT VAVDLAADAD ARHLLRATGS TVADPGFMVV YREGNDEGKD DSGEKFLPEL EEGEQVDLHA IRAEQHFTEP PPRYTEASLV RALEEYGIGR PSTYASIIST LQNRNYVEMD GKRFIPTDIG RTVNKFLTEH FDRYVDYDFT ARLEDDLDAI SRGEQDWVPV LEAFWEPFRE RVEEKKNVSR QEAVQARELG TDPKTGKPVT VRIGRYGPFA QLGSRDDDEK PRFAGLRPGQ SIDTITLDEA LQLFKLPRDM GETDEGEDVQ VSIGRFGPYV RYGKKFVSIP KDEDPYTITK ERAHELVREK KQADANRIIH DFGDGIQILR GRYGPYITNG EKNAKVPKDR EPDSLTHEEC QDLIAKAPAR KGRRGGAAKG GRGRSKATS
|
| |