Gene Haur_4372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4372 
Symbol 
ID5736929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5583474 
End bp5587040 
Gene Length3567 bp 
Protein Length1188 aa 
Translation table11 
GC content52% 
IMG OID641281534 
ProductXRE family transcriptional regulator 
Protein accessionYP_001547132 
Protein GI159900885 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00730956 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACAA ATTCCAATCA GAGTACATAC GATGCCGCCC AGATTCAAAT GTTGCGCGGC 
TTGGAAGCAG TACGCGAAAA CATGGGGATG TATCTCGGTG GCCAAGACAC GTCAGCATTA
CATCACTTGG TCTATGAAGT TGTCGATAAC TCGGTTGACG AGGCCTTGGC TGGCTTCTGC
GATACGATCA TCGTCGAGAT GCGGACTGAT GGGTCAATCG CTGTCGTCGA TAATGGGCGC
GGGATTCCCA CCGATATTCA CCCAGTCGAG GGGCGTTCGG CCTTGGAAAT TGTGCTGACC
GAGCTGCACG CTGGTGGTAA GTTCAAAGGC TCACAGGGCT ACAAGGTTTC TGGTGGTTTG
CACGGGGTCG GGGTTTCGGC AGTTAACGCA GTTTCCGAAT TTTTGCGGGC TGAAGTTAAA
CGCGATGGCA AGCTGTGGGC GCAAGATTTT CGCCTTGGCA TGCCTCAGGC TCCAGTCAAA
GCGGTTGGCG ATGCTGAAGG CACGGGCACA ACGATTATCT TCAAGCCCGA TGCCCAATTA
TTTACCACGG TTGATTTCAA CTATCGCACC TTGGCTAACC GTTTACGCGA TATGGCCTAC
CTCAACAAGA GCCTGCGCTT CAAGCTCGTC GATTATAACA ACGACCGCGA AGTAACCTAT
TATTTTGATG GTGGGATTGT CTCGTTTGTG CGCCATCTGA CGCGCGAAAA AGGCCCAGTG
CTGGCTCAGC CGTTCTACGT CGAAAAACCT TATGAAAACG TCAATGTTGA AATTGCAATG
CAATACACTG GCGATTTCAA CGAAAATCTT TTAGCCTTCA CCAATAACAT TGCCAACCCC
GATGGTGGTA CGCACGTCAC TGGCTTCCGC GCGGCCTTGA CCCGCACGAT CAATGCCTAT
GGTCGTAACA AAGGCTTGCT CAAAGAAGGC GATGCACTTT CGGGCGAAGA TGTGCGCGAG
GGCTTGACCG CGATCATCAG CATCAAGCTG TTCCGCCCAC AATTCGAAAG CCAAACCAAA
TCGAAGCTGG CAACGCCTGA AGCTAAAACC GCCGTCGAAA CCGTGCTCAA CGAAGCACTT
TCAGCCTTCT TGGATGAAAA TCCTAACGAG GCCCGCCGGA TTATCGAAAA ATCGCTGTTG
GCTTCACGCG CCCGCGATGC CGCCCGTAAA GCCCGCGATT TGGTGCAACG CAAAGGTGCA
CTCGAAGGCT TTGCGCTGCC AGGCAAGCTC GCCGACTGCT CAGATAAAGA GCCTGCCCAC
TGCGAAATCT TCATCGTCGA AGGCGATAGT GCCGGGGGAA GCGCAAAACA AGGCCGTGAT
CGTCGTTTCC AAGCGATTTT GCCGCTGCGC GGTAAAATTC TGAACGTAGA AAAATCACGT
TTGGATAAAA TGCTGGCAAA TAACGAAGTT CGAGCATTAA TCACCGCACT TGGCACAGGT
ATTGGCGAAA CATTTGATAT TTCGCGTTTG CGCTACCACC GCATTTTGAT TATGAGCGTT
GCTGGCGATG AGCCAACCTT GATTCGCAAT GCGCAAGGTC ATACCGAGTT TGTGCGCATC
GGCGAGTTTA TCGATCAATG TATTGCAGGC CAACGTAGTG CTAGTGAATA CGAAGTGATC
AGCTTCGATC AAAAGCGGCA TGTTGCGCGT TTCCGCCCAC TCAAAGCCGT GATGCGCCAC
GCCAACCATG AGCCAATGTA CAAACTGACC ACACGCTATG GCCGCTCGGT CAAAGTAACT
GCCTCGCATA GCGTCTTTGT GCTCGAAAAT GGTCAGCCTG TGCTGAAAAA GGGCGATCAA
ATTCGACTTG GCGATCAGTT GGTTGCCAGC CGTCGGATTC CACGCCCAGC CAGCCAACCT
CGCGAAATCG ACCTTATGAA GTTGTTTGCT GAAGCAGGCT TGATTGATAA CCTGTATTTG
CGCGGCGAAA GCGTGCGCAC GATCGCAGCC CAACGGGTGC TCAACCACGT TTCTCAGCCT
GAACAATGGA ACGAAGCACG GATCAACTTG CATACTGCTG CATGGGAACG CTTGGTCGAA
TATCGCCAAG CCAATGGCCT CAGCCAAAAA GCAGTTGCCC AACGCCTTGG TGTGAAGCAA
GCGATTACGA TTAGCCAATG GGAACGCGGC ATGCTGCGGC CTATTCAATC ACAATTTGAC
AACTATCTGG CAACGATTGG CTGGGAAGAA CCAATCAGTT ACGAACTTGT GCCATCGAAG
ATCGAGCGCT TGTTGTTACA AGATGATAGC AGCGCCAACG CTCGCTGGCG CGAAGTGAGC
AACTACAAAG CCTTTGATAG TTTCAGCAAC GATGAGCTAG ATTTGCTTGA TCGTGACGTT
GAGCTTGTAC CGCAAGCCCA CGACGATCGA GCTTTCCCAC GCATGTTGAG CATCACCCCA
GAATTGCTGT GGTTCTTGGG CTGGTTTACT GCTGAAGGCT CCTTGAGCAA GCATCAAGTC
AGTTTGTCGT TAGGTCAAAA AGATGCAGCC TTCTTCGACG AGCTAAAAAC CACGATCGAG
CAACTCTTTG GCGAAACGCC ACGCTTCTAT CAATCGCCTG ATAATGGCGG GATCAAATGC
TATTTCCATA GCGTGTTGGC GGCGCGACTG ATTCGAGCCT TAGGCTTGGG CGCGGTTGCT
CATCAAAAAC GTGTGCCAAA CATGCTGTTC AGCCTGAGCA ACGATTTGCA ACGCAGCTAT
CTCGAAGGCT ATTTCCTTGG CGATGGCACA CTCAGCGATA GCACAATCAG CATGACCACC
AACTCGACTG AACTCAAAGA TGGCTTACTC TATTTGCTTG GTCAGCTTGG TGTTTTTGCT
GGCGTGAGCA AGATCAAGCC CAACTTACCA GCCGATGCAC CAATTCAAAC CGTGCACGAC
TACTACAATA TTGCAATTAG CGGCAAGCAA CAGCTAGAGC AATTGAGTGG GGTTTGGCAG
CGCCATCATT TGGCTGCCAA AGTCGAGGCA CATTTGGCCA AACCAGCGAC CAAAGCTCAA
GCATTCACGC CATTAAGCGA CGATATGGTT GGCTTAGAAG TGTTGGCAGT TGAAGAATTG
GCTCCAACTG GCGAGTTCGT CTACGACTTC TCGGTCGAAG AAGACGAAAA CTTCCTGTGT
GGCACTGGCG GTTTATACGC TCACAATACC GACGCAGACG TTGACGGCAG CCACATCCGC
ACCTTGTTGC TGACCTTCTT CTTCCGCCAT ATGCGCGATT TGATCACCAA TGGGCACTTG
TATGTAGCTC AGCCGCCATT GTTCCGCGTA CAACATGGCA AGGCCTACAA ATATGTCTAC
GATGAAGCCA CCCGCGATGA GTACATTCGC TCATTGCCAG CTGGCACCAA AGTCACCGTT
CAGCGCTTCA AAGGGCTAGG CGAAATGAAT CCCGACCAAC TGTGGGACAC CACGCTCAAC
CCGGGCAATC GCATGATTTT ACAAGTGACA GTTGAAGATG CAATGGAAGC CGATGAAACC
TTCTCGATGT TGATGGGTGA AATCGTGTTA CCGCGCAAGC GCTTTATCCA AACTCACGCC
GCCGACGTGA AGAACTTGGA TGTGTAG
 
Protein sequence
MATNSNQSTY DAAQIQMLRG LEAVRENMGM YLGGQDTSAL HHLVYEVVDN SVDEALAGFC 
DTIIVEMRTD GSIAVVDNGR GIPTDIHPVE GRSALEIVLT ELHAGGKFKG SQGYKVSGGL
HGVGVSAVNA VSEFLRAEVK RDGKLWAQDF RLGMPQAPVK AVGDAEGTGT TIIFKPDAQL
FTTVDFNYRT LANRLRDMAY LNKSLRFKLV DYNNDREVTY YFDGGIVSFV RHLTREKGPV
LAQPFYVEKP YENVNVEIAM QYTGDFNENL LAFTNNIANP DGGTHVTGFR AALTRTINAY
GRNKGLLKEG DALSGEDVRE GLTAIISIKL FRPQFESQTK SKLATPEAKT AVETVLNEAL
SAFLDENPNE ARRIIEKSLL ASRARDAARK ARDLVQRKGA LEGFALPGKL ADCSDKEPAH
CEIFIVEGDS AGGSAKQGRD RRFQAILPLR GKILNVEKSR LDKMLANNEV RALITALGTG
IGETFDISRL RYHRILIMSV AGDEPTLIRN AQGHTEFVRI GEFIDQCIAG QRSASEYEVI
SFDQKRHVAR FRPLKAVMRH ANHEPMYKLT TRYGRSVKVT ASHSVFVLEN GQPVLKKGDQ
IRLGDQLVAS RRIPRPASQP REIDLMKLFA EAGLIDNLYL RGESVRTIAA QRVLNHVSQP
EQWNEARINL HTAAWERLVE YRQANGLSQK AVAQRLGVKQ AITISQWERG MLRPIQSQFD
NYLATIGWEE PISYELVPSK IERLLLQDDS SANARWREVS NYKAFDSFSN DELDLLDRDV
ELVPQAHDDR AFPRMLSITP ELLWFLGWFT AEGSLSKHQV SLSLGQKDAA FFDELKTTIE
QLFGETPRFY QSPDNGGIKC YFHSVLAARL IRALGLGAVA HQKRVPNMLF SLSNDLQRSY
LEGYFLGDGT LSDSTISMTT NSTELKDGLL YLLGQLGVFA GVSKIKPNLP ADAPIQTVHD
YYNIAISGKQ QLEQLSGVWQ RHHLAAKVEA HLAKPATKAQ AFTPLSDDMV GLEVLAVEEL
APTGEFVYDF SVEEDENFLC GTGGLYAHNT DADVDGSHIR TLLLTFFFRH MRDLITNGHL
YVAQPPLFRV QHGKAYKYVY DEATRDEYIR SLPAGTKVTV QRFKGLGEMN PDQLWDTTLN
PGNRMILQVT VEDAMEADET FSMLMGEIVL PRKRFIQTHA ADVKNLDV