Gene Haur_4967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4967 
Symbol 
ID5736803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6300864 
End bp6302960 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content49% 
IMG OID641282134 
Productexcinuclease ABC subunit B 
Protein accessionYP_001547725 
Protein GI159901478 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACCAT TAAAAGTCCA TGCACCCTAC GAACCGCGTG GCGATCAACC ACAGGCGATT 
GCCCAATTAG TCAATGGATT AAACAGCGGG CTTGTCCACC AAACCTTATT GGGGGCAACA
GGGACTGGCA AAACTCATAC AATTGCGCGG GTTATTGAGC AAGTTCAACG GCCTACCTTG
GTGATGGCAC ATAATAAAAC CCTCGCCGCA CAGTTGTATG CTGAGTTTAA AGAATTTTTT
CCTGAGAATG CCGTAGGCTA TTTCGTTTCC TACTACGATG CCTATACCCC CGAAGCCTAC
GTGCCATCAA AGGATTTGTA TATTGAAAAA GAAGCGCAAA TCAATGAAGA AATTGATCGT
TTGCGCCACG AAGCAACCCA AGCGCTGTTT ACCCGCAGCG ATGTCATTAT CGTGGCCTCG
GTCTCGGCGA TCTACGGGCT TGGCTCGCCC ACGGATTACG GTCAAGTGGC ACTCAAGCTT
AAAACTGGTG AGATTCGCAA CCGCGATAAA GTGCTGCGAA CGCTGATCGA TCTACAATTT
GAGCGCAACG ATCTTGATTT TCACCGTGGC ACCTTCCGTG TGCGTGGCGA TACACTCGAA
ATCTTTCCAG CTAATGCCGA AAGCGCCTTT CGGATCGAGA TGTGGGGCGA TGAAATTGAG
CGCATGGTCG AGGTTGATCC CTTGACAGGT GAGATTTTGA CCCAAAAAGA TCACATCGAA
GTCTTTCCGG CCAAGCACTT TATTCCCAAC GCCGACAAAA TGCAGGCAGC AATTGGCGAT
ATTCGGCTTG AGCTTGAACA GCAATTAGCC CATCTCGAAG GCGAAGGCAA AGTGCTTGAA
GCAGCGCGGC TCAAACAGCG CACACTCTAC GATCTCGAAA TTATGGAAGA ATTGGGCTAC
TGCTCAGGAA TTGAAAATTA TAGTCGGCAT ATGGATCGGC GTAGCGAAGG CCAAACACCG
TGGACATTGC TCGATTACTT TCCTGATGAT TTTCTTTTGG TGATCGACGA ATCGCATATT
TCAGTGCCGC AAATTCGTGG GATGTTCAAT GGCGACCGTT CGCGCAAGCA AACCTTGGTC
GATTTTGGCT TCCGTTTGCC CTCGGCGCTC GATAACCGAC CCTTGATGTT TGACGAATTT
TCCAAGCATG TGCATCAAGC AATTTATGTT TCAGCTACGC CTGGGGTCTA CGAATATCAA
CATCATGAAC AGGTAGTTGA GCAAATTATT CGGCCAACTG GCTTGCTCGA CCCTATGGTT
GAAGTGCGAC GCACCCGTGG CCAAATCGAT GATTTACTTG GTGAAATCAA ACGGCGGGTT
GATACTGGCT CGCGGGTATT GGTCACAACC CTCACCAAGC GCATGGCCGA AGATCTGACC
GATTATCTCA AAGAAATGGG CGTGCGCACT CAATATCTCC ACTCCGATGT TGATACGATT
GAGCGCATCG ACATTCTGCG TGATTTACGT TTAGGGGTTT TTGATGTGCT GGTGGGGATC
AACCTCTTGC GCGAAGGCTT AGACTTGCCT GAAGTATCGT TGGTGGCAAT TCTTGATGCT
GATAAAGCGG GCTTCTTACG CTCCGAATCG TCGTTGGTGC AAATTATTGG GCGGGCTGCC
CGCCACATCG ATGGTACGGT GCTGATGTAT GCCGATACGA TTACCCCAGC CATGGATTAT
GCGATCAGCG AAACCCGTCG CCGTCGCCAA ATTCAAGAGC GTTATAATCA ACAACATGGC
ATTGAACCCA AAGGGATCGT CAAAGCAGTA CGCGATTTGA CCGAAGGTAT GAAAAAAGTT
GCTGAAAAAC CAGCAGCCTA CCAAACCGCC GCCAACCCCG ACAGCATGAC CAAAGAAGAA
CTCTTCAAAG TGATCAACGC GCTTGAAAAA CAGATGAAAC AAGCCGCCAA AGACCTAGAG
TTTGAAAAAG CGGCCTTGCT GCGTGACCAA CTGACCGAAA TGCGCCAAAC CCTAGCCTTG
ATCGACAACA CGGCCTTGCT TGAGAGCGTC AATCGCAAAC CACGAGCCAA AGCAGCTATG
GTGGTTGAAG ATACTGGTAA ACGCAAAGTC AAAGGCCGTT CACGCGGCAA ACTTTAA
 
Protein sequence
MPPLKVHAPY EPRGDQPQAI AQLVNGLNSG LVHQTLLGAT GTGKTHTIAR VIEQVQRPTL 
VMAHNKTLAA QLYAEFKEFF PENAVGYFVS YYDAYTPEAY VPSKDLYIEK EAQINEEIDR
LRHEATQALF TRSDVIIVAS VSAIYGLGSP TDYGQVALKL KTGEIRNRDK VLRTLIDLQF
ERNDLDFHRG TFRVRGDTLE IFPANAESAF RIEMWGDEIE RMVEVDPLTG EILTQKDHIE
VFPAKHFIPN ADKMQAAIGD IRLELEQQLA HLEGEGKVLE AARLKQRTLY DLEIMEELGY
CSGIENYSRH MDRRSEGQTP WTLLDYFPDD FLLVIDESHI SVPQIRGMFN GDRSRKQTLV
DFGFRLPSAL DNRPLMFDEF SKHVHQAIYV SATPGVYEYQ HHEQVVEQII RPTGLLDPMV
EVRRTRGQID DLLGEIKRRV DTGSRVLVTT LTKRMAEDLT DYLKEMGVRT QYLHSDVDTI
ERIDILRDLR LGVFDVLVGI NLLREGLDLP EVSLVAILDA DKAGFLRSES SLVQIIGRAA
RHIDGTVLMY ADTITPAMDY AISETRRRRQ IQERYNQQHG IEPKGIVKAV RDLTEGMKKV
AEKPAAYQTA ANPDSMTKEE LFKVINALEK QMKQAAKDLE FEKAALLRDQ LTEMRQTLAL
IDNTALLESV NRKPRAKAAM VVEDTGKRKV KGRSRGKL