Gene Hhal_2326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2326 
Symbol 
ID4709283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2551145 
End bp2553544 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content68% 
IMG OID639856801 
ProductDNA topoisomerase I 
Protein accessionYP_001003891 
Protein GI121999104 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.240337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAA GCCTCGTCAT CGTCGAGTCG CCTGCCAAGG CGCGCACGAT CAACAAATAC 
CTGGGTTCCG ATTACGAGGT CATGGCCTCC TACGGCCATG TCCGCGACCT CGTCCCCAAG
GAGGGGGCGG TGGATCCGTC CAGCGGCTTC GCCATGAAGT ACGCACCCAT CGACAAGAAC
CAGAAGCACG TCGATGCCAT CGCCAAGGCG GCCCGCAAGG CCGACGCCCT CTACCTGGCC
ACTGACCCGG ACCGCGAGGG TGAAGCCATC TCCTGGCACC TGGTGGAGCT GCTGCGCGAC
AAGGGCACCC TGGACGACAA ACCGGTCTAT CGGGTGGTCT TCCACGAGAT CACCAAGGGC
GCCATCCAGG AGGCGATGAA CAACCCGCGG GACATCTCCG AGGAGCTGGT CAACGCCCAG
CAGGCGCGCC GCGCCCTCGA CTACCTGGTC GGCTTCAACC TCTCGCCCCT GCTCTGGCGC
AAGATCACCA GCGGCCTCTC CGCGGGCCGG GTGCAGAGCC CCGCGCTGCG GATGATCTGC
GAGCGCGAGA CCGAGATCGA GCAGTTCGAG CCCCAGGAGT ACTGGAGCGT CGAGGCCGAT
GCCGCCAAGG CGCAGCAGCC CTTCATGGCC AAGCTCTCGC AGCTCCACGG CGAGAAGGTC
CGGCAGTTCA CCATCACCGA CGAGACCCAT GCCCAGGAGG TCGACCGCAC CCTGCGCGAG
GCCGCGCGTG CCCAACCCGA CCCCGCCCGG ATCGGCCCCA CCGGCGATGG CGAGACCGAG
GTCATCGGCA CGCTGCGGGT CGCCTCGGTG GAGCGCAAGC AACGCCGGCG CAACCCGGCA
GCGCCATTCA TCACCTCGAC CCTGCAGCAG GAGGCCTCGC GCAAGCTCGG CTTCACCGCC
AGCCGGACCA TGCGCATCGC CCAGCAGCTC TACGAGGGCA TCGACGTCGG CGAAGGCAGT
GCCGTCGGTC TGATCACCTA CATGCGAACC GACTCGGTGA ACCTCTCCGG CGAGGCGATC
ACTGAGATGC GCCAGGCCAT CACCGACCGC TACGGCGCCG ACAAGCTCCC GGGCCAGGCC
CAGGTCTACA AGACCCGCTC GAAGAACGCC CAGGAGGCCC ACGAGGCCAT CCGGCCCACC
TCGGCGTCGC GCCACCCGGA CGATGTCCGC GCCTACCTCA ACGAGGAGCA GCGCAAGCTC
TACGATCTCA TCTGGAAGCG CGCCGTCGCC TCACAGATGA AGCACGCCAC CATCCACACG
GTGGCCGTCG ATCTGGCCGC CGACGCCGAC GCCCGCCATC TGCTGCGGGC CACCGGCTCC
ACGGTGGCCG ACCCGGGCTT CATGGTCGTC TACCGCGAGG GCAACGACGA GGGCAAAGAC
GACTCCGGCG AGAAGTTCCT GCCCGAACTC GAGGAGGGCG AGCAGGTGGA CCTGCACGCC
ATCCGCGCCG AACAGCACTT CACCGAGCCG CCGCCGCGCT ACACCGAGGC GAGCCTGGTC
CGCGCCCTGG AGGAGTACGG CATCGGCCGG CCGTCGACCT ACGCCTCGAT CATCTCCACG
CTGCAGAACC GCAACTATGT GGAGATGGAC GGCAAACGCT TCATCCCCAC CGACATCGGG
CGCACAGTCA ACAAGTTCCT GACCGAGCAC TTCGATCGGT ACGTGGACTA CGACTTCACC
GCCCGACTCG AGGACGACCT AGACGCCATC TCCCGCGGCG AGCAGGACTG GGTCCCGGTC
CTGGAAGCGT TCTGGGAGCC CTTCCGGGAG CGGGTTGAGG AGAAGAAGAA CGTCTCGCGC
CAGGAGGCGG TCCAGGCGCG GGAACTGGGC ACGGACCCGA AGACCGGCAA GCCGGTGACG
GTGCGCATCG GTCGCTACGG CCCCTTCGCC CAGCTCGGCT CCCGCGACGA CGACGAGAAG
CCGCGTTTTG CCGGCCTGCG CCCGGGACAG AGCATCGACA CCATCACCCT CGACGAGGCC
CTGCAGCTGT TCAAGCTGCC GCGGGACATG GGCGAGACCG ACGAGGGCGA AGACGTCCAG
GTCAGCATCG GGCGCTTTGG CCCCTACGTG CGCTACGGCA AGAAGTTCGT CTCCATCCCC
AAGGACGAGG ACCCGTACAC CATCACCAAG GAACGGGCCC ACGAACTGGT GCGGGAGAAG
AAACAGGCCG ACGCCAACCG GATCATCCAC GACTTCGGCG ACGGCATTCA GATCCTGCGC
GGACGCTACG GGCCGTACAT CACCAACGGC GAGAAGAACG CCAAGGTGCC CAAGGACCGG
GAGCCGGACT CGCTCACCCA TGAGGAGTGC CAGGACCTGA TCGCCAAGGC GCCGGCGCGC
AAGGGGCGCC GCGGCGGGGC GGCCAAGGGT GGCCGCGGCC GCAGCAAGGC CACTAGCTGA
 
Protein sequence
MGKSLVIVES PAKARTINKY LGSDYEVMAS YGHVRDLVPK EGAVDPSSGF AMKYAPIDKN 
QKHVDAIAKA ARKADALYLA TDPDREGEAI SWHLVELLRD KGTLDDKPVY RVVFHEITKG
AIQEAMNNPR DISEELVNAQ QARRALDYLV GFNLSPLLWR KITSGLSAGR VQSPALRMIC
ERETEIEQFE PQEYWSVEAD AAKAQQPFMA KLSQLHGEKV RQFTITDETH AQEVDRTLRE
AARAQPDPAR IGPTGDGETE VIGTLRVASV ERKQRRRNPA APFITSTLQQ EASRKLGFTA
SRTMRIAQQL YEGIDVGEGS AVGLITYMRT DSVNLSGEAI TEMRQAITDR YGADKLPGQA
QVYKTRSKNA QEAHEAIRPT SASRHPDDVR AYLNEEQRKL YDLIWKRAVA SQMKHATIHT
VAVDLAADAD ARHLLRATGS TVADPGFMVV YREGNDEGKD DSGEKFLPEL EEGEQVDLHA
IRAEQHFTEP PPRYTEASLV RALEEYGIGR PSTYASIIST LQNRNYVEMD GKRFIPTDIG
RTVNKFLTEH FDRYVDYDFT ARLEDDLDAI SRGEQDWVPV LEAFWEPFRE RVEEKKNVSR
QEAVQARELG TDPKTGKPVT VRIGRYGPFA QLGSRDDDEK PRFAGLRPGQ SIDTITLDEA
LQLFKLPRDM GETDEGEDVQ VSIGRFGPYV RYGKKFVSIP KDEDPYTITK ERAHELVREK
KQADANRIIH DFGDGIQILR GRYGPYITNG EKNAKVPKDR EPDSLTHEEC QDLIAKAPAR
KGRRGGAAKG GRGRSKATS