Gene Haur_2467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2467 
Symbol 
ID5734347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3154083 
End bp3155093 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content49% 
IMG OID641279606 
ProductLacI family transcription regulator 
Protein accessionYP_001545233 
Protein GI159898986 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGAA GTAAACGGAT CACCATCCAT GATATTGCAC GCAAAGCTGG CGTATCACCC 
AGTACTGTCT CGCGGGTCTT GAACAGCACC ACACCTGTAG CCGAAGCCAA ACGCCAAGCT
GTAACAACGG CGATTCAACA GTTGGATTAT CGCCCAAATC TGATTGCCCA AGGCTTGGCT
CGTGGCACAT CGACGATTAT TGGCGTGCTG ACCCAAGATA TTGGTAGCCC GTTTTATGGC
GAGTTACTGC GCGGCATCGA ATATGGATTT CGTGGCAGTC GCTATCACCC GATTTTTGCC
GATGGCAACT GGCAACAAGC TGAGGAATAC AACGCATTAA ACATTCTGCG CTCACGCCAA
CCTGAGGCAT TAATTATTTT AGGTGGTTTA ATGCCTGATG CCGAAATGTT GGCCGCAGCC
CAAGAATTTC CCTTGATCAT TATTGGGCGA AGTGTGCCAA GTTTGGAAGA ATATTGTGTT
TTGGTTGATA ATTTCCAAGG AGCTTATCGC GCAACCCAAT ATTTAATTGA AATGGGCCAT
CAGCGGATTG CCCATATTAC TGGAATTCGC AGCCATCAAG ATACACTTGA TCGCCAGGCA
GGCTACGAAC AAGCCCTGCG CGATGCCAAC TTGCCAATTA ATCCCGACCT GATTGTTGAG
GGAACGTTTC AAGAACAATC GGGCTTACTA GCCGTCGAAA CCTTATTAAT GCGAGCAAAC
CCGTTTACCG CCCTCTTTGC AGCCAATGAT CAAATGGCCT ATGGTGCTCG CTTAGCGCTC
TATCGGCGAG GAATTCGGGT GCCCGAAGAT GTTTCGCTGA TTGGCTTTGA TGATTTGCCA
AGCTCAGCCT ATACCACGCC CCCATTAACT ACTGTTCGCC AACCAACCTT CGAAATGGGC
ATGAGTGCAG CCAAAGCAAC GCTTAATTTA ATCGATCAAC GGCCATGGCC ATTGCCTCAG
CTAACTCCCG ATTTAGTGAT TCGTGAGTCA ACTGGCTTCG CACGGCGTTA A
 
Protein sequence
MTGSKRITIH DIARKAGVSP STVSRVLNST TPVAEAKRQA VTTAIQQLDY RPNLIAQGLA 
RGTSTIIGVL TQDIGSPFYG ELLRGIEYGF RGSRYHPIFA DGNWQQAEEY NALNILRSRQ
PEALIILGGL MPDAEMLAAA QEFPLIIIGR SVPSLEEYCV LVDNFQGAYR ATQYLIEMGH
QRIAHITGIR SHQDTLDRQA GYEQALRDAN LPINPDLIVE GTFQEQSGLL AVETLLMRAN
PFTALFAAND QMAYGARLAL YRRGIRVPED VSLIGFDDLP SSAYTTPPLT TVRQPTFEMG
MSAAKATLNL IDQRPWPLPQ LTPDLVIRES TGFARR