Gene Haur_0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0459 
Symbol 
ID5732358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp535805 
End bp536827 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content53% 
IMG OID641277585 
ProductLacI family transcription regulator 
Protein accessionYP_001543238 
Protein GI159896991 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCAT TGACGATTGA ACAAATTGCA GAATTGGCGC ATGTCTCGAT TGCAACGGTT 
TCGCGGGTGC TGAACAATCA GCCGCATGTA CGGGCCGAAG TGCGCGAACG AGTGCTCGCC
GTGATGCAGG AGCATCATTA CACCCCGCAT GCAGCAGCGC GAACCTTGGC AGGCCAACGC
CCCAAAGTAA TTGCGGTCTT TATTTCGCGC ACTACTGCCA CAATCTTTCG TAACCCGACC
TTTTCGCACC TGCTTCAGGG CATGACCGAG GCTTGTAATG CCAAGGGCTA TGTATTAATG
GTCTCGTTGG ATAGCGCCAA TCGGGCCGAT GATCCAGCGA CTAAATTACT GAATGGTCGC
AGCGTCGATG GCCGAATCGT GATTCCCAAT ACAATCAATG ATCCATTGTT ACCGCAACTG
ATCGCCGACC GCGTGCCGAT GGTGCTGGTG GGCAAACATC CCTTGCTGCA TCATATTGTC
AGCGTCGATA TTGACCATTT TGCGGGTAGC TACCAAGCAG TGCATCATCT GCTGAAACTT
GGCCATCAAT CGGTTGGCAT GATCGTTGGC TCGTTGCATG CACTGGCCAC CCACGATCAG
ATTGAGGGCT ATCAACAGGC CTATCGCGAG GCAGGTTTGC CAATTCACCC AGCATTGATT
GCCACTGGAA ATGAGACTGA ACAAGGCGGC CAAGCGGCAA TTGAGCAATT ACTAGCCTTG
CAACCACGGC CAAGCGCCCT GTTTGTCAGC AGCGCCGTGA TGGCCACAGG CGTATTACAA
ACCTTGCATA CCCAAGGGCT GCGCGTGCCT GATGATATAA CCGTGGTCTG TTTTGATGAT
GTGCCAACTG CCGCCAGCCT CAACCCCAGC ATGACCTCGT TGCACCAACC AATGTACGAT
CTTGGTGCAA CCGCCGCCAA TGTGTTGATC GAGCTGATCG AAGGCAAAAC TCCTAGCAGC
GAAAATACCA TTTTGCCAAT TAGCATGATC GTGCGCCACG AAAACGAGGT TTGGGGTGGC
TAA
 
Protein sequence
MASLTIEQIA ELAHVSIATV SRVLNNQPHV RAEVRERVLA VMQEHHYTPH AAARTLAGQR 
PKVIAVFISR TTATIFRNPT FSHLLQGMTE ACNAKGYVLM VSLDSANRAD DPATKLLNGR
SVDGRIVIPN TINDPLLPQL IADRVPMVLV GKHPLLHHIV SVDIDHFAGS YQAVHHLLKL
GHQSVGMIVG SLHALATHDQ IEGYQQAYRE AGLPIHPALI ATGNETEQGG QAAIEQLLAL
QPRPSALFVS SAVMATGVLQ TLHTQGLRVP DDITVVCFDD VPTAASLNPS MTSLHQPMYD
LGATAANVLI ELIEGKTPSS ENTILPISMI VRHENEVWGG