Gene Haur_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1946 
Symbol 
ID5733835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2359933 
End bp2360898 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content53% 
IMG OID641279090 
Producthelix-turn-helix type 11 domain-containing protein 
Protein accessionYP_001544717 
Protein GI159898470 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTCGC CAACTAGCCG ACTGTTGGGC GTGCTCGAAC TCTTACAATC ACGCCAGCAA 
ATCAGTGGAA GCGAGCTGGC TCAGCGGCTC GAAATTCACC CGCGCACGGT TCGCCGCTAT
ATTCAACAAC TGCAAGATAT GGGGATTCCG GTCGAGGCTG AACGGGGCAT CTACGGCGCT
TATCGTTTAA AGCCAGGCCA GCGAGTTCCA CCACTGTTAT TCACCGAACA AGAAATCCTG
GCCTTGAGTT TAGGCTTATT AACGATTCGC GAATTACAGT TTCCGGTTGA GCGAGCAACC
GCTGAAACGA CCTTAGCCAA GATCGAGCGT GTGCTGCCTG CTGGTTTGGT GCAGCATGCT
CGCGCCCTGC AAGCAGCAAT CAGTATCCAA CTTGGCATCC GTTCACCGCG AATCGAGCCA
GCTTGGTTGC TGCAATTGAG TTTGGCGGTG CAATACTGCC AACAAGTTCA ACTTGAATAT
TTGTCTGCTC AACAGAATAT GACTGAACGC ACGATCGAAC CCTATGGAAT TGTTTTTAAC
GAAGGCGCTT GGTATCTAGT AGGCTATTGC GATTTACGCA GGGCCATGCG GGTGTTTCGC
TTGGATCGCA TTCAGGCGAT GCGATTGTTG GATTCAAACT TCGAGCGGCC AGCCTCAATT
GATATTGTCG AAATAGTGCA GGCAGCCCTG AACGCCAGCG ATCAGCCTGA CGAGGTTGAG
GTGCTGCTCA AAACAACTAT CGAGCATGCC CGCCACATTA TTCCGCCAGC TATGGGCCGG
CTTGAAGCAA CGCCGCAGGG AGTTCTGTTG CGCCGCGCCG CCGTGCATTT GGAATGGGTC
GCCTTTACGT TGCTGGAGCT TGATATTCCA GTGGTGGTGC TACAACCTGC TGCATTACGC
GAGCTGCTTC AAGCGATCGC AACCAAAGCT TTGGGCATGA GTAATACTGC TGCGCCAACC
GATTAA
 
Protein sequence
MYSPTSRLLG VLELLQSRQQ ISGSELAQRL EIHPRTVRRY IQQLQDMGIP VEAERGIYGA 
YRLKPGQRVP PLLFTEQEIL ALSLGLLTIR ELQFPVERAT AETTLAKIER VLPAGLVQHA
RALQAAISIQ LGIRSPRIEP AWLLQLSLAV QYCQQVQLEY LSAQQNMTER TIEPYGIVFN
EGAWYLVGYC DLRRAMRVFR LDRIQAMRLL DSNFERPASI DIVEIVQAAL NASDQPDEVE
VLLKTTIEHA RHIIPPAMGR LEATPQGVLL RRAAVHLEWV AFTLLELDIP VVVLQPAALR
ELLQAIATKA LGMSNTAAPT D