Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5075 |
Symbol | |
ID | 5737033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 92103 |
End bp | 93098 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641282240 |
Product | regulatory protein, DeoR |
Protein accession | YP_001547831 |
Protein GI | 159901585 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATG ATCGCCTACG AGCAAGAGCT GCGCGGTTAT ATGGGATTGG TCGCTATCTC TACGAGCGTG GTACTCAAGG TGCACGGGTT GTGGATTTAG CCCAGTATTT CGATGTTCAT CGAAGCCGTA TTTATCATGA TTTGCAGTTT ATGCAGCAAG AGGGTGAGCC TATTTATCAA GATGGGACGC GCTGGTATCT TGAACGTGAT CGCTATATTC ATCGCCTACC AATTAATTTA CGTGAAGCTT TAGCATTTTA TTTGGCTGCA CGATTGCTTT CAAAACAGAG CGATAAGCAT AATCCCTATG TTGTTTCGGC ATTAGATAAA TTAGGCAGCG CTTTAGCCAA TAATCATCGT AATATTGGTT TGCATATTCA ACGGGCAGCC AATGTTGTCC GTCAACGCCC ATCGGATCAA GCCTTTACCA AAATTTTCGA GACCTTTGCA CGGGCTTGGG CTGACCAATG TCGCGTTGAA GTAACCTATC TGTCCGCTAA ATCCAAATAT AGTACTTGGG AGCAGCGAAC TATCTCACCT TATTATTTAG AAGTATCAGG GATTGGCTAT TCAACCTATG TGATTGGTCA CGATAACAAA TCCAATGCGA TTCGCACTTT TAAGCTCGAA CGGATTGCCG CCGCCAATCT CCGTCCCTTC GATACGTTTG AAATTCCTGA AACCTTTGAT CCACAGGAGC GGCTGGGCAA CGCATGGGGC ATTATCTGGC CAGCCGAAGG CGAGGAACCA GTTGATGTAC GCTTGCTTTT TGCGCCAGCG GTTGCCCATC GAGTCAAAGA GACCATTTGG CATCCCAGCC AGATTATCGA AGATTTGCCC AATGGTGGTT GCCGCTATTG TGTGCGGGTG GGCAGCACCT TAGAGATGCG ACCATGGGTG CGTGGTTGGG GTCGTGATGT TGCAGTGGAG TGGCCATTAG CGTTTCGTCA AGAGATGATC GACGAGTTGC AGGCAGCTTT AGCGTTGTAT CAATAA
|
Protein sequence | MSDDRLRARA ARLYGIGRYL YERGTQGARV VDLAQYFDVH RSRIYHDLQF MQQEGEPIYQ DGTRWYLERD RYIHRLPINL REALAFYLAA RLLSKQSDKH NPYVVSALDK LGSALANNHR NIGLHIQRAA NVVRQRPSDQ AFTKIFETFA RAWADQCRVE VTYLSAKSKY STWEQRTISP YYLEVSGIGY STYVIGHDNK SNAIRTFKLE RIAAANLRPF DTFEIPETFD PQERLGNAWG IIWPAEGEEP VDVRLLFAPA VAHRVKETIW HPSQIIEDLP NGGCRYCVRV GSTLEMRPWV RGWGRDVAVE WPLAFRQEMI DELQAALALY Q
|
| |