Gene Hhal_1982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1982 
Symbol 
ID4710449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2183639 
End bp2185891 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content70% 
IMG OID639856455 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_001003548 
Protein GI121998761 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0348571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAA ACGCTGCCTC CGACTTCGAG ACGCAACCGC TCCGCGAGTT CACCGAGCGG 
GCCTACCTCG ACTACTCGAT GTACGTCATC CTCGACCGTG CGCTGCCCCA CGTCTCCGAC
GGGCTCAAGC CGGTGCAGCG GCGGATCGTC TACGCCATGT CCGAACTCGG GTTGTCGGCG
GCGGCGAAGT ACAAGAAATC GGCCCGTACC GTCGGCGACG TGCTCGGCAA GTACCACCCG
CACGGCGACG CGGCCTGCTA CGAGGCCATG GTCCACATGG CCCAGCCGTT CACCTATCGC
TATCCGTTGA TCGACGGCCA GGGCAACTGG GGCTCTCCGG ACGATCCGAA ATCCTTCGCC
GCCATGCGCT ACACCGAGGC GCGCCTGACC CCCTACGCCC GCGTGCTGCT CAACGAGTTG
GGGCAGGGGA CAGTGGAGTG GATTCCCAAC TTCGACGGCG CCCTGCAGGA GCCGGAACGC
CTGCCGGCGC GGCTGCCGAA CGTGTTGCTC AACGGCGGTT CGGGGATCGC CGTGGGCATG
GCCACGGATA TCCCGCCCCA CAACCTGCGC GAGGTGGTCG ATGCCTGCGT CCACCTCCTC
GATCGGCCCG AGGCCACCAC GGCGGAGCTC TGTGAGCACG TGCCGGCGCC GGACTTCCCC
ACCGCCTCGG ACGTGATCAC GCCGCGCGAG GAGCTGCGCA CCATCTACGA GCGGGGCCAC
GGTACCGTCC GCGCGCGGGC CCGCTGGGAG TATGAGCGGG AGCAGGCGCA GATCGTCATC
CACGCACTCC CTTACCAGGT GGCCGGCGCC AAGGTCATGG AGCAGATCGC CGCCCAGATC
CAGTCGCGCA AGCTGCCCAT GGTCGAGGAT CTGCGCGATG AGTCGGACCA CGACGCGCCG
GTGCGCATCG TCGTCCAGCT CCGCTCCCGG CGCCAGGATC CGGAGCGGGT CATGGATCAC
CTGTTCGCCA CCACGGATCT CGAGCGCGGC TACCGCGTCC ACCTGAACGT CATCGGCCTC
GACGGTCGGC CGCGGGTCTT CTCCCTGCGC GACCTTCTGG CCGAGTGGCT GAGCTTCCGC
ACCGAGACGG TCCGCCGTCG CCTGACTTGG CGCTTGGAGA AGGTGGAGGA TCGGCTCCAT
ATCCTCGAGG GCCTGCTCAC CGCGTATCTG AACATCGATG AGGTCATCGC CATCATCCGC
GAGGAGGACG AGCCCAAGCC GGTGCTCATG GAGCGCTTCG GGCTCAGTGA TGCCCAGGCC
GAGGCGATCC TCGAGCTGCG CCTGCGCCAC TTGGCCCGTC TGGAGGAGAT GAAGATCCGC
GGCGAGCAGG GCGATCTGGA GCGTGAGCGC GACGAGCTGC GCGCCACCCT TGGCGACGCC
GCCCGGCTGC GCGAGCAGGT CAAGAGCGAG CTCCTCGGTG ATGCCGAGGC CCACGGGGAC
GAGCGCCGCT CGCCGCTGGT CCAGCGCGGT CCGGCGCGGG CCATGGACGA GACCGAGCTG
ATGCCCAGCG AGCCGGTCAC CGTGGTGCTC TCGGAGAAGG GCTGGGTCCG CGCCGCCAAG
GGCCACGAGG TCGATGCGAC CGGGCTCTCC TACAAGGCCG GCGACGGCTA CCTGGATCAC
GCCCTGGGCC GGTCGAACCA GCCGGCCATC TTCCTCGACT CCACCGGTCG GGCCTACGCG
GTCCCGGCGC ATACCCTGCC GTCGGCGCGC AGCCAGGGGG AGCCGCTGAC CAAGCGGCTC
ACGCCCCCGG AGAAGGCCCG CTTCCAGTCG GTCCTCGCCG GCGATTCGGA GTCGCGCTGG
TTGCTCGCCT CCGATGCCGG CTACGGCTTC CGTGTCCCCC TGCGCGAGCT CTTCTCGCGC
AATCGCTCCG GGAAGGCGGT GCTGACCCTG CCGGATGGGG CGGCTGTGCT CCCGCCCGTA
ACGGTGCCCG CGGAGGCCGA TGGTGCCGAG GTGGTGGTGG CCAGTTCCGA TGGCCGACTG
CTGGTCTTCC CCCTTGAGGA GCTGCCGGAG ATGGCGCGAG GCAAGGGCAA CAAGCTCATT
GGCATCCCTG CCCAGCGCCT GCGCGATCGC GAGGAGGTGG TGGTCGCCGT GGCCGTGCTG
CCCGCCGGCG CAGCGCTGCG TGTGGAGGCC GGCAAGACCG GCAAGCGGCT GAGCCACAGC
GATCTGGAGG CGTTCCGCGC CGCCCGGGGC CGGCGCGGTG TTCAGTTGCC GCGCGGGCTG
CGCCAGATCC GTGGGCTCCA CGTGGAGAAT TGA
 
Protein sequence
MSENAASDFE TQPLREFTER AYLDYSMYVI LDRALPHVSD GLKPVQRRIV YAMSELGLSA 
AAKYKKSART VGDVLGKYHP HGDAACYEAM VHMAQPFTYR YPLIDGQGNW GSPDDPKSFA
AMRYTEARLT PYARVLLNEL GQGTVEWIPN FDGALQEPER LPARLPNVLL NGGSGIAVGM
ATDIPPHNLR EVVDACVHLL DRPEATTAEL CEHVPAPDFP TASDVITPRE ELRTIYERGH
GTVRARARWE YEREQAQIVI HALPYQVAGA KVMEQIAAQI QSRKLPMVED LRDESDHDAP
VRIVVQLRSR RQDPERVMDH LFATTDLERG YRVHLNVIGL DGRPRVFSLR DLLAEWLSFR
TETVRRRLTW RLEKVEDRLH ILEGLLTAYL NIDEVIAIIR EEDEPKPVLM ERFGLSDAQA
EAILELRLRH LARLEEMKIR GEQGDLERER DELRATLGDA ARLREQVKSE LLGDAEAHGD
ERRSPLVQRG PARAMDETEL MPSEPVTVVL SEKGWVRAAK GHEVDATGLS YKAGDGYLDH
ALGRSNQPAI FLDSTGRAYA VPAHTLPSAR SQGEPLTKRL TPPEKARFQS VLAGDSESRW
LLASDAGYGF RVPLRELFSR NRSGKAVLTL PDGAAVLPPV TVPAEADGAE VVVASSDGRL
LVFPLEELPE MARGKGNKLI GIPAQRLRDR EEVVVAVAVL PAGAALRVEA GKTGKRLSHS
DLEAFRAARG RRGVQLPRGL RQIRGLHVEN