Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4851 |
Symbol | |
ID | 5736697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6181360 |
End bp | 6182658 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282017 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_001547609 |
Protein GI | 159901362 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATGTC AAGACGTGCG CCGCCAACTT GCGGCAGGAA TTCGTCCGGC GGTGCGTCCG CCCGAACGTG CTCAAATGGG TTTCCACATT GCTAATTGTG CCGATTGCTA TGCCTTATTA GAAGTATTGC AGGTTGAAGA ACAGCGCCAA TTATTAACGA CCCTGCTTGA AGAGCCAGTG CTTGCCGCGC CTACGCCTAC GCCAGCACTC ACCAAACCAA CTCGCCGCCG CTGGCCGCTG GCCGTTTTTG CTATGCTGGC TGTGCTGGTG CTTGGCGGTA GTTGGTGGTG GTGGAGCACG CCTGAGCAAG TGGTTGGGGT GCAAATTGAG CCAAGTAACA CGCCAGCGCC AGTTATTGCA GGCGGCATAG CTGAACCAAG CGTGATTCCA GTGATTCAGC CAACGGCTAC CGCCCAAGCT ATTGCCACAA CTGAGCCAAG CGTAACCCCG CGCCCAACCA GCACCCCGCG ACCGAGCGCA ACCCCGCGCC CATTAGCCGC CATGACAGTG GCAATTTTGG GCTTGGATCG ACGACCTGGC GAGACTGAGC CAGCGCGGGC TGATGCATTG TTGGTTTTGC ATCTCAATCC ACGCAATCAA AGCGCGGCCT TGGTTTCCTT GCCCCGCGAT TTATGGGTCG CTTTGCCGCC GGAATATGGC TTCTCGGTCA AATTAAATGC TGCTTATATG TATGGCGAGG GCGATAATGG CGATGCTGAA GCTGGAGCCG AATTGGCACG CCAAACCCTG AGCACCACCA TCGGCCAGCC AATTGATGCG GTGGTGGTTA CAACCTTTGA AGAAATGATC ACCATGGTCG ATTTAATTGG GGGCATTGAT GTTGATGTGC CCAAGGAAAT TTACGATTCG CGCTACCCAA CCTTTGATTA TGGCTATATG GAAGCGCATT TTTTGCCAGG CATGCAACAT ATGGATGGAG CAACGGCGCT GATTTATAGC CGCACCCGCC ATGCCGACAA CGATTTTGAG CGTGGTAAGC GTCAACAGCA GGTGCTTTTG GCAATTTTTG CCCGTTTGCA AAATCTCATG CAAGCCAATA ATAGCGTTGA CAACATCGAA TTGGTTAGCT CGTTGTACAA TACCCTCGAA TATACCAGCA TCGATTTGCC GACGACCTTG CGTTTGGTGC TGGCAATTCA GAATTTCGAG CCGGCTGGGA TTCAGCGCGA ATCGATTGAT TTAAACTATG GTTATGAAAC CAGCACCAGT GATGGTGCTT ACATCATTCA ACCAGATTTA CCAGCAATTC AATCGCGAAT TGCCGAATTA TTCAAGTAG
|
Protein sequence | MECQDVRRQL AAGIRPAVRP PERAQMGFHI ANCADCYALL EVLQVEEQRQ LLTTLLEEPV LAAPTPTPAL TKPTRRRWPL AVFAMLAVLV LGGSWWWWST PEQVVGVQIE PSNTPAPVIA GGIAEPSVIP VIQPTATAQA IATTEPSVTP RPTSTPRPSA TPRPLAAMTV AILGLDRRPG ETEPARADAL LVLHLNPRNQ SAALVSLPRD LWVALPPEYG FSVKLNAAYM YGEGDNGDAE AGAELARQTL STTIGQPIDA VVVTTFEEMI TMVDLIGGID VDVPKEIYDS RYPTFDYGYM EAHFLPGMQH MDGATALIYS RTRHADNDFE RGKRQQQVLL AIFARLQNLM QANNSVDNIE LVSSLYNTLE YTSIDLPTTL RLVLAIQNFE PAGIQRESID LNYGYETSTS DGAYIIQPDL PAIQSRIAEL FK
|
| |