Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1434 |
Symbol | |
ID | 5733342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1661386 |
End bp | 1662324 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278572 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001544206 |
Protein GI | 159897959 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.458798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTGT TGGAGCTTGC CCCCACCCGT GAAACTGAAC CCTTATTAGG GCCAGAACGC CGCTATTTGG CCCAGTTATT GGCTCAGCAT GCCCCACATG ATGGGCGGTT TGATTTGGCT TTTCCTCGAA CCTATGCCAT CCGTGAATCA AAACCGCATC AACGCTCGCT GCCAGCACTC TATCAGCCCA GCATTTGTCT TGTAGCCCAA GGAGCCAAAC AGTTGGTCTT GGGTGATCAG ACCTACAATT ATGATACTGA ACATGTGTTG ATTGTGGCTG TTGATTTGCC AGTTGCCGCC CAAATTACCC AAGCCAGCGC TGCCGAGCCA TATTTGTGCT TCAAGCTGGA GTTTGATCCA CAGCGCGTGG CCGAATTAGC GCTTAAAGTC TATTCGCAGG GTGTTCCTGC GCCAACTCAG CTTCGGGCAA TTTCGACTGA ACGCACCAAT CGCCAGTTGG TACAAGCTGC GAGCCGCTTG CTTGATTTGC TCAGCCAGCC CGATGAGGTT GAATTGTTGG CTCCGTTGGT GATCGACGAG ATGATTATTC GCTTGTTGCG TAGCCCATTT GGTGGTCGGC TGGCTCAAAT TGGCCAAACC ACTTCGCGAG TTACTCCAGT AGCGCAGGCG ATTGGTTGGT TGCGCCGCCA TTTTGCCCAA CCTTTGCGGG TTGAGGAGCT TGCCACGCTG GCGAATATGA GTGTTTCGGC TTTTCATCTA CACTTCAAGG CGGTTACCGC GCTTAGTCCA ATTCAGTTTC AAAAAACCTT GCGTTTGCAC GAAGCGCGGC GTTTATTAGT TGCGACAGCA TTAGAGATTG GGTCAATTAG TCAGCAAGTT GGCTATGCCA GCCTCTCACA ATTTAGCCGC GAATATCGAC GCTACTTTGG TTGCTCACCA ACTGAGGATT TGGCTCGCTT GCGGCGATCA AGTGCCTAG
|
Protein sequence | MTLLELAPTR ETEPLLGPER RYLAQLLAQH APHDGRFDLA FPRTYAIRES KPHQRSLPAL YQPSICLVAQ GAKQLVLGDQ TYNYDTEHVL IVAVDLPVAA QITQASAAEP YLCFKLEFDP QRVAELALKV YSQGVPAPTQ LRAISTERTN RQLVQAASRL LDLLSQPDEV ELLAPLVIDE MIIRLLRSPF GGRLAQIGQT TSRVTPVAQA IGWLRRHFAQ PLRVEELATL ANMSVSAFHL HFKAVTALSP IQFQKTLRLH EARRLLVATA LEIGSISQQV GYASLSQFSR EYRRYFGCSP TEDLARLRRS SA
|
| |