Gene Haur_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2114 
Symbol 
ID5734002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2654695 
End bp2657313 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content51% 
IMG OID641279255 
ProductXRE family transcriptional regulator 
Protein accessionYP_001544882 
Protein GI159898635 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.566849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAA TCGCAGTTAA TTCACCAGCG GCTTTTGGCA AGCAATTGCG CTTGTTGCGC 
CGCCGTGCTC GCCTAACCCA AGCCGAATTA GGGATTGCCG TAGGCTATAG CGATGCGCAA
ATTTGTCGGC TTGAAACAGG CCGCCGTCCC CCCGATTTAA CCACCTTAAT TGCGTTATTT
TTGCCAGCGC TCGATATCGA GGCGCAATCA CACGAGGCTC AGCAGCTTTT AGAATTAGCC
GCAAGCGCCC GCGAGGAATT AATCGCTGAG CCAAGCCAGA ACAATCATGG TGCACTCAAT
CAAGCACTGC CAACCTTGGC CAGCCACCTG CCTAGCCCAA CCAGCTACTT TTTAGGCCGT
GGCGCTGAAC AACAACGCAT CTTGCAATGG CTGAATAATC CTACAATTCG CTTAATCAGC
ATTGTGGGCT TGGCGGGCAT TGGCAAAACC CAGCTTGGCT TGCAATGTTT GCATCAATTT
GCGAGCCAAA GCGAACAGCA GTGTGTTTTT GTTGATCTGG TGACCGCCAA TGACCCTGAA
TCGATGGTTC AGGCGATCAA CAAGGCGCTC GAAATCAGCG AAAGCCCCGA TGAGCATCCC
TTGAGTTTGG CAATTAGTCA GCTTGAGCAG CAACCAAGCT GTTTACTGTT GGATAATTGT
GAACAGATTC AGGATGCTAG CCGGGTGATC AGCCTATTGC TCAGCGAAGT ACCAACCCTC
AAATTAATTA TTACCAGCCA AGTAGCCTTG CGTCTCAGTG CCGAACATGT ACTACAACTC
ACGCCCTTGG CCGTGCCCAA TTTGTTGGCA TTGCCGCCCT TAGCTGAATT AGCCCAAATC
GAAGCTATGG CTTTGTTATT GGCCCGTTTG CAAGTGCATA ACCCCAAACT TGAATTGACC
GCGAAAAATG CGTTGGCGCT TGCTGCCTTG TGTGTACGCG TCGATGGTGT ACCGTTGGCG
CTAGAGTTAG TTGCTGCTTC AGGCCGTTTG TTCGACCCCG AAGCCTTGTT GAGCGAATTG
GCCAGCCATT TTTTGAGCAT GCGACGGCGG GGCCGCGATT TACCGTCGCG CCACTACTCA
GTCACAACCG CACTGACATG GAGTTATCAA CAGCTTGATT CAGCTAGCCA ACGTTTGTTT
GAGCGGTTGA GCGTATTCGT TAGCGGTTGG ACGGTCGAGG CGGCCTTGGC GGTTTGTGGC
CCAGAATACC AACGCCATGA GCTGATTGAA CAATTAAATG TGCTGCTTGA TCATAGCCTG
ATTCAACAAC AAACCAACGA TGATTCGACG CGAATGAGTA TGTTGACGAT GGTACGGACG
TTTGCTCAAG AGCAAGCCAA CAAGCATGCT GAACATGATT TGCTCAAGAG CCGCATGCTC
GATTATTTGA TTGAACTAGC CCAGCAAGCC GAGCAACCAC TGCGTTCAGG CAATAACCAA
GCTATGTGGA TTCAGCGGTT AGAGGCTGAG CACGATAATA TTCGGGCTGG CTTAAATTGG
GCTTGGCAAC ACAATGCCCA TCAACGCGGC ATTCAGTTGG TTGGCTATGT ATGGCGTTTT
TGGTATATGC GCGGCTATTT ACGTGAAGGT CGGCGTTGGT TTGAAAACCT ACTGATCAGC
CATGAACCAA CTGCTGACGT TGATTTTGCC CGAGCACTCG ATGGGGTCGG CATTTTGGCT
TGGAGACAAA GCGATTATCA GCAAGCTGAA CAATGGTATC AGCAAGCCCT TGCCATCTAC
CAAACCACCC AACACACCGC TGGCCAGGCG CAAGTTTTGG GTCATCTGGG CTTAGTGGCG
ATGGATACCG GAGCCTATGC CCAAGCAGCG GCCTACTACG AACAAAGTTT ACCGCTCTAT
CAAGCCGTTG AGGATCAATC CGGAGTGGTT GCCACATTGC ACAATCTCGG CAATCTCTAT
TGCCAACAAT CAGAAAATCA ACGGGCAAGC CAACTCTATC AAGAATGCTT ACAGATCTAT
CAGGAGATGG GTGATCAATC GGGAGTGGCA TTAATTGCCT TGGGTTTGGG GGTGATCGCC
CGTGATGAAC AACGTTTAGA TGCAGCGCAA GCCTCGTTTG AACAAAGCCT CAGCTTGGCG
CGTGAGTTGG GCGATGATTG GAATGAAGCG ACGGCCTTAA TCAATCTTGG TAACATCGCG
ATTGATACCA GCCAGCCCAA ACTTGGCTTG GAGCATTATC AAACCGCCAA ACAGATTTTC
GAGCGTTTAG GCGATCAGCA ATCATTGTGT CTGATCGAAA ACCGAATTTC TAATGCTCAC
TGGCTGCTGG GCGACTATGC CCAAGCCCAA GCTGGCTACC GTCAATGCCT CATGTTAGCC
CATGCAATCG GCTTTGATGG CGGAATTATT GAAGGTTTAG AAGGACTAGC CCATTGCTTG
AGCCAAACAT TGCCAACGAC TGCCGCCCAA CTCATGGCCT ACGCCGCCCA GATGCGTAGC
ACCAAAGGCT ACCCAATTAT TCCTGCCGAT GAAGCTGGTT ACAACCAAAT TGGGCAAGAA
ATTCAGGCCC ATTTAAGCAC CACCGCATGG CAACAGGCCT ACCAGCAAGG CCAACAGCTG
AGTTTGCAAC GAGCAGTAAG TTTGGCGCTG GCCAATTGA
 
Protein sequence
MTQIAVNSPA AFGKQLRLLR RRARLTQAEL GIAVGYSDAQ ICRLETGRRP PDLTTLIALF 
LPALDIEAQS HEAQQLLELA ASAREELIAE PSQNNHGALN QALPTLASHL PSPTSYFLGR
GAEQQRILQW LNNPTIRLIS IVGLAGIGKT QLGLQCLHQF ASQSEQQCVF VDLVTANDPE
SMVQAINKAL EISESPDEHP LSLAISQLEQ QPSCLLLDNC EQIQDASRVI SLLLSEVPTL
KLIITSQVAL RLSAEHVLQL TPLAVPNLLA LPPLAELAQI EAMALLLARL QVHNPKLELT
AKNALALAAL CVRVDGVPLA LELVAASGRL FDPEALLSEL ASHFLSMRRR GRDLPSRHYS
VTTALTWSYQ QLDSASQRLF ERLSVFVSGW TVEAALAVCG PEYQRHELIE QLNVLLDHSL
IQQQTNDDST RMSMLTMVRT FAQEQANKHA EHDLLKSRML DYLIELAQQA EQPLRSGNNQ
AMWIQRLEAE HDNIRAGLNW AWQHNAHQRG IQLVGYVWRF WYMRGYLREG RRWFENLLIS
HEPTADVDFA RALDGVGILA WRQSDYQQAE QWYQQALAIY QTTQHTAGQA QVLGHLGLVA
MDTGAYAQAA AYYEQSLPLY QAVEDQSGVV ATLHNLGNLY CQQSENQRAS QLYQECLQIY
QEMGDQSGVA LIALGLGVIA RDEQRLDAAQ ASFEQSLSLA RELGDDWNEA TALINLGNIA
IDTSQPKLGL EHYQTAKQIF ERLGDQQSLC LIENRISNAH WLLGDYAQAQ AGYRQCLMLA
HAIGFDGGII EGLEGLAHCL SQTLPTTAAQ LMAYAAQMRS TKGYPIIPAD EAGYNQIGQE
IQAHLSTTAW QQAYQQGQQL SLQRAVSLAL AN