Gene Hhal_0542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0542 
Symbol 
ID4709620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp610723 
End bp612753 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content68% 
IMG OID639854999 
Productexcinuclease ABC subunit B 
Protein accessionYP_001002130 
Protein GI121997343 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.291443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAGC GGTTCAAGCT GAACGCCCGT TTCCAGCCCG CTGGGGATCA GCCCCAGGCG 
ATCGACGAGC TCGTCGACGG GATCGAGTCG GGGCTCTCGG ACCAGACCCT GCTTGGGGTG
ACCGGTTCGG GCAAGACCTT CACCGTCGCC AACGTGATCG AGCGGCTGCA GCGTCCGGCG
ATCCTGCTGG CGCCGAACAA GACCCTGGCC GCGCAGCTCT ACGGCGAGAT GCGCGAGTTC
TTGCCGGAGA ACCGCGTCGA GTACTTCGTC TCCTACTACG ACTACTACCA GCCCGAGGCG
TACGTGCCCT CCTCGGATAC CTTCATTGAG AAGGACGCCT CGGTGAACGA GCACATCGAG
CAGATGCGCC TGTCCGCCAC CAAGGCGGTG CTCGAGCACA AGGATACGGT CATCGTCGCC
TCGGTCTCCT CCATCTACGG GCTGGGCGAC CCGCAGGCGT ATATGTCGAT GCTCCTGCAC
CTGGTGCGCG GTGAGCAGAT CGACCAGCGG GCGATCCTGC GCCGGCTGGC CGAACTGCAG
TACACCCGCA ACGACACCGA GCTGACCCGG GGCACCTACC GCGCCAGGGG CGAGGTGATC
GACATCTTCC CCGCCGAGTC CCCCGAAGAG GCGGTCCGGG TGCAGCTCTT CGACGACGAG
ATCGAGGAGA TCAGCTGGTT CGATCCGCTC ACCGGCGAGG TGCTGCGCCG GGTGCCGCGT
GTGACCATCT ACCCCAAGAC CCACTACGTC ACCCCGAAGG AGCAGATCCA CCAGGCCGTG
GAGCAGATCA AGGAGGAGCT GCGCGAGCGC CTGGATGAGC TGCGGGCTGC CGACAAGCTG
GTGGAGGCGC AGCGGCTGGA AGAGCGGACC CGCTTCGATA TGGAGATGAT GCTCGAGCTC
GGCTACTGCA ACGGCATCGA GAACTACTCC CGCTATCTCT CCGGCCGCGG GCCGGGAGAG
CCGCCGCCGA CTCTGTTCGA CTACATCCCG CCGGAGGCCG TGCTGTTCAT CGATGAGTCC
CACGTCACCG TTCCGCAGAT CGGCGGCATG TACAAGGGCG ACCGCTCGCG CAAGCAGACC
CTGGTCGAAT ACGGGTTCCG GCTGCCCTCG GCGCTGGACA ACCGGCCGCT GAAGTTCGAG
GAGTTCCGCC GCCTGGCGCC GCAGACGGTC TACGTCTCGG CCACGCCGGG CCCCTTCGAA
GAGGAGCACG CCGGGCAGGT GGTGGAGCAG GTGGTCCGCC CCACCGGGCT AGTGGACCCG
GAGGTGGAGG TGCGTCCGGC TACTGCCCAG GTCGACGATG TCTACGGCGA GATCCGCGCG
CGCGCCGAGC GGGACGAGCG GGTGTTGGTC ACCACGCTGA CCAAGCGTAT GGCCGAGGAC
CTGACCGAAT ACCTGGAGGA GAACGGCGTC CGGGTCCGCT ACCTCCACTC CGACGTGGAC
ACGGTCGAGC GCACCGAGAT CATTCGTGAC CTGCGCCTGG GCCACTTCGA TGTGCTGGTC
GGCATCAACC TCCTGCGCGA GGGGCTGGAC ATCCCCGAGG TCTCGCTGGT GGCTATCCTC
GATGCCGATA AGGAGGGCTT TCTGCGTTCT ACCCGCTCGC TGATCCAGAC CATCGGTCGG
GCCGCCCGCA ATATCGATGG ACGGGCGATC CTCTACGCTG ATCAGATGAC GGACTCCATG
CGCCGCGCCA TCGACGAGAC GGAGCGGCGG CGGGCCAAGC AGATCGCCCA CAATGAGGCC
CACGGCATCA CCCCGCAGGG GATCCGCAAG GAGGTGCCCG ACATCATGGA GCGCGGTGGT
GTGCCGGCGC CGGGGGCCCC GCAGCGGGCG GCCCGGGTGG CCGAAGAGGC GGGCGAGTAC
GCCGGGCTGT CCCCCGCCGA GGCGGTGCGG CGGATCCGCG AGCTAGAGAA GCGTATGCAG
GAGCACGCCC GGGAGCTGGA GTTCGAGGAG GCGGCCCGGG TGCGCGACGA GATCCGCCGC
ATCGAGCAGC ACGCCCTGGG CGGAGGCGGC GCCGCGGCTG CCGCGTCCTG A
 
Protein sequence
MTERFKLNAR FQPAGDQPQA IDELVDGIES GLSDQTLLGV TGSGKTFTVA NVIERLQRPA 
ILLAPNKTLA AQLYGEMREF LPENRVEYFV SYYDYYQPEA YVPSSDTFIE KDASVNEHIE
QMRLSATKAV LEHKDTVIVA SVSSIYGLGD PQAYMSMLLH LVRGEQIDQR AILRRLAELQ
YTRNDTELTR GTYRARGEVI DIFPAESPEE AVRVQLFDDE IEEISWFDPL TGEVLRRVPR
VTIYPKTHYV TPKEQIHQAV EQIKEELRER LDELRAADKL VEAQRLEERT RFDMEMMLEL
GYCNGIENYS RYLSGRGPGE PPPTLFDYIP PEAVLFIDES HVTVPQIGGM YKGDRSRKQT
LVEYGFRLPS ALDNRPLKFE EFRRLAPQTV YVSATPGPFE EEHAGQVVEQ VVRPTGLVDP
EVEVRPATAQ VDDVYGEIRA RAERDERVLV TTLTKRMAED LTEYLEENGV RVRYLHSDVD
TVERTEIIRD LRLGHFDVLV GINLLREGLD IPEVSLVAIL DADKEGFLRS TRSLIQTIGR
AARNIDGRAI LYADQMTDSM RRAIDETERR RAKQIAHNEA HGITPQGIRK EVPDIMERGG
VPAPGAPQRA ARVAEEAGEY AGLSPAEAVR RIRELEKRMQ EHARELEFEE AARVRDEIRR
IEQHALGGGG AAAAAS