Gene Haur_1479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1479 
Symbol 
ID5733364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1728072 
End bp1729160 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content55% 
IMG OID641278617 
ProductLacI family transcription regulator 
Protein accessionYP_001544251 
Protein GI159898004 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCACATG ATCACGAACC CGCAAAGCCC CATCGGATCA CCATTAGCAC CGTTGCCGCT 
GCTTTAGGGG TCGCTGTCTC GACCGTTTCT AACGCCTACA ATCGACCCGA CCAACTTTCA
GCAGAGCTGC GTGAACGAGT GCTGGCAGTC GCTACCGAGC TTGGTTACCC TGGGCCAAAC
CCAGTTGCTC GCAGTTTACG CCAACAACGG GCTGGCGCGG TCGGGGTGCT GTTTGCCGAG
CGCCTGCCCT ATGCCTTTCG CGATCCGGCG GTGTTGATGG TGCTTGAAGG GATTGCTACC
ACGCTCGAAC AGGCTGGTCT CGGCTTATTA CTCGTGCCAG GTCGCGACGA CGACACCACC
ACCGTTCAAC AAGCCTTGGT CGATGGCTTT ATTGTCTATT CGATGATGGA AACTGATCCC
TTGGTTCAGG CTGCTCTGAA ACGTCGGCTA CCGACAGTGC TGCTTGACCA ACCGCCCCGC
CCTGATGTAC CGTCGATTAT CGTTGATGAT GAGGCTGGCG CACGTATGGC CACCGAGCAT
CTATTAAGCC TCGGCCATCG CCAATTTGCA ATTATCACCG ATCGCTTGGT CGAAACCAAT
CTACGACCAT CGAGTGCGCC AATAAACGTT CATGATCAGA GCAAACCAAC CTTTTTCGTC
ACCCAATTAC GCTTGCAAGG CTATCGCCAG CCACTTGAAG CAGCAGGCAT CGATTGGCGC
AGCGTGCCAA TTTACGATTG CAACGATAAC AACGAAGCCG ATGGTGCAGC AGCCATCCAA
ATTTTACTCG CCCACAACCC ACGTCCAACT GCCATTCTCT GTTTAACCGA TCGTTTGGCG
TTGGGGGCAA TCGCTGGAGC GCAACAAGCG GGCTATCAGG TTCCGCAACA GCTTTCAATC
GTGGGCTTTG ATGATATTCC TCAAGCCAGT CAAAGCGTGC CGAGCTTAAC CACCATTCGC
CAAGATCATC GCCAAAAAGG CTTATCAGCA GGCCAAGCCT TGATTGAACT GCTGGCTGGC
CAAAGCCCAA CCAGCTATCA ACGGCTGGCA GTCGAGTTGG TAGTCCGCGA TTCGACCGCA
GCAATTTAA
 
Protein sequence
MAHDHEPAKP HRITISTVAA ALGVAVSTVS NAYNRPDQLS AELRERVLAV ATELGYPGPN 
PVARSLRQQR AGAVGVLFAE RLPYAFRDPA VLMVLEGIAT TLEQAGLGLL LVPGRDDDTT
TVQQALVDGF IVYSMMETDP LVQAALKRRL PTVLLDQPPR PDVPSIIVDD EAGARMATEH
LLSLGHRQFA IITDRLVETN LRPSSAPINV HDQSKPTFFV TQLRLQGYRQ PLEAAGIDWR
SVPIYDCNDN NEADGAAAIQ ILLAHNPRPT AILCLTDRLA LGAIAGAQQA GYQVPQQLSI
VGFDDIPQAS QSVPSLTTIR QDHRQKGLSA GQALIELLAG QSPTSYQRLA VELVVRDSTA
AI