Gene Hhal_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1983 
Symbol 
ID4710330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2185927 
End bp2187837 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content69% 
IMG OID639856456 
ProductDNA topoisomerase IV subunit B 
Protein accessionYP_001003549 
Protein GI121998762 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID[TIGR01055] DNA topoisomerase IV, B subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.267905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCA AGACCCAGCA GACCTACGAC GCCTCCGACA TCGAAGTCCT GACCGGACTG 
GAGCCGGTAC GCCGGCGTCC CGGTATGTAC ACCGAAACCA ACCGCCCCGA CCACCTCGCC
CAGGAGGTGA TCGACAACAG TGTCGACGAG GCGGTCGCCG GACACGCCCG GCGCATCGAG
GTGACCGTGC ACGCTGATGG CTCCCTGGAG GTCAGCGACG ACGGCCGCGG CATGCCGGTG
GATATGCATC CCACCGAGGG GGTGCCCGCG GTCGAGGTGA TCCTCGGCCG GCTCCACGCC
GGCGGCAAGT TCTCGAGCAA GAGCTACCGC TATTCCGGCG GCCTGCACGG GGTCGGTGTC
TCCGTGGTGA ACGCGCTATC GACGCGGCTG CAGGTGCATA TCCGCCGCGA CGGCGCCGAG
CACACCATCG CCTTCTCGGC GGGGGAGCGG ATCGAGCCGC TGGAGCGGGT GGGCAAGACG
CGGGAGACCG GCACCACGCT GCGCTTCTGG CCGGATACCA GCTACTTCGA CAGCCCGCGC
TTCTCTCGCG CCCGGCTCGA GCACCTGCTG CGCGCCAAAG CGGTGCTCTG CCCGGGGCTG
ACGGTCATCC TGCGCGACGC CACCGGGGGC GAGGGGCAGG AGCCCGAGGT GCTGACCTGG
TACTACGAGG ATGGGCTGCG CGATTACCTG GCCGGTGCCG TGGCCGACTA CGAGCATCTG
CCGGATCCGC CGTTCACTGC CCGGCTCGGC TCCGAGCACG AGGAGATGGA GTGCGCCCTG
GTGTGGTTGC CGGAGGGCAG CGGGCCGGCG GAGAGCTACG TCAACCTCAT CCCCACTCCT
CAGGGCGGGA CCCACGTCAA CGGCCTGCGC ACCGGCCTGA CCGAGGCCAT GCGCGAGTTC
TGCGAGCTGC GTAACCTGCT GCCGCGCGGG GTGCGCATCG CCCCGGAGGA CGTCTGGGAG
CACATCAGCT ACGTGCTGTC GGTGAAGATG CATGAGCCGC AGTTCGCCGG GCAGACCAAG
GAGCGGCTCT CCTCGCGCAA CTGTGCCGCC TTCGTCTCCG GGGCCGTGAA GGACGCCTTC
AGCCTGTGGC TCAACGAGCA CACCCAGAGC GCCGAGCAGA TTGTCGATCT GGTGGTGCGC
GCCGCTCAGC GCCGCCAGCG GGCCGCCAAG AAGGTGACGC GCAAGCGGGT GGGCAGTGGG
CCGGCGCTGC CCGGCAAGCT GGCCGACTGC ACCGGGCAGG ATCCGGCGCG CAGCGAGCTC
TTCCTGGTCG AGGGGGATTC CGCCGGCGGC TCGGCCAAGC AGGCCCGCGA ACGGGAATTC
CAGGCGGTCA TGCCGCTGCG CGGTAAGATC CTCAATACCT GGGAGGTGGC TCCCGACGAG
GTCATGGCCT CCCAGGAGGT CCACGATATC GCCGTGGCCC TGGGGGTCGA CCCGGGCAGC
GAGGACCTCT CCGGGCTGCG CTACCACAAG GTCTGCGTCC TGGCCGACGC TGACCCCGAC
GGCGCGCACA TTGCCACGCT GCTCTGCGCC CTGTTCCAGC GCCACTTCCC GGCCCTGGTG
GCCGGCGGCC ACGTCTACGT GGCGATGCCG CCGCTGTATC GCATCGATGT TGGCAAGCAG
ACCTTCTACG CCCTCGACCG CGACGAGCGC CAGGGTATCC TCGACCGGAT CGAGGCCGAG
CGCATCAAGG GCAAGGTGCA GGAGACGCGC TTCAAGGGCC TTGGTGAGAT GAACCCGGTC
CAGCTGCGCG AGACCACCAT GGCCCCGGAT ACCCGCCGCC TGGTTCAGCT CACCGTCGAT
GACCCCGACG AGACCGAACG CCTGCTCGGC ATGCTGCTCG GCCGTGGCGC CGCAGCCCAG
CGCCGCGAGT GGCTGGAGAG CAAGGGCAAC CTGGCGGAAA TCGTCCTTTG A
 
Protein sequence
MTTKTQQTYD ASDIEVLTGL EPVRRRPGMY TETNRPDHLA QEVIDNSVDE AVAGHARRIE 
VTVHADGSLE VSDDGRGMPV DMHPTEGVPA VEVILGRLHA GGKFSSKSYR YSGGLHGVGV
SVVNALSTRL QVHIRRDGAE HTIAFSAGER IEPLERVGKT RETGTTLRFW PDTSYFDSPR
FSRARLEHLL RAKAVLCPGL TVILRDATGG EGQEPEVLTW YYEDGLRDYL AGAVADYEHL
PDPPFTARLG SEHEEMECAL VWLPEGSGPA ESYVNLIPTP QGGTHVNGLR TGLTEAMREF
CELRNLLPRG VRIAPEDVWE HISYVLSVKM HEPQFAGQTK ERLSSRNCAA FVSGAVKDAF
SLWLNEHTQS AEQIVDLVVR AAQRRQRAAK KVTRKRVGSG PALPGKLADC TGQDPARSEL
FLVEGDSAGG SAKQAREREF QAVMPLRGKI LNTWEVAPDE VMASQEVHDI AVALGVDPGS
EDLSGLRYHK VCVLADADPD GAHIATLLCA LFQRHFPALV AGGHVYVAMP PLYRIDVGKQ
TFYALDRDER QGILDRIEAE RIKGKVQETR FKGLGEMNPV QLRETTMAPD TRRLVQLTVD
DPDETERLLG MLLGRGAAAQ RREWLESKGN LAEIVL