Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4046 |
Symbol | |
ID | 5736916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5164278 |
End bp | 5165237 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281197 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001546806 |
Protein GI | 159900559 |
COG category | [S] Function unknown |
COG ID | [COG1426] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTGTT TCTTACGGAC ATTCGTCCGA TTGGGCCCAG TTATAGTGAA CATGAGATTG CTTGGTGGCA GCGTTGCCAC AACTGCACCT TCATCCACTC CGTTCCCCAT GGACGAATAC CAGCATGCAG ACAACCATCA TCATGGCACA AGGCAAAGCA TGACAAACGC TGGACAAAAC AGATTAGGCC TTCGATTTCG CGAAGCCCGC GAAGCCCGTG GCATTTCACT GGCTCAAGCT TCAAGTGAAA CCCGAATTAT CCAACGCTAT TTGGCGGCCC TCGAAAATGG TGAATATCAT CATCTGCCGG GCGATGTTTA TGCCCGTGGT TTTATTCGCA ATTATGCTCA ATATTTGAAT CTGCCCGCAG ATGAGTTGAT CGATCTGTAT CGGATTGAGC GCGGTGCTTC AACCCCGATC CGGATTGTGC CAGCCGCAGT GCCACCCCGT CGGAACACGA TTTTTCTGCC CAGTCTTTGG ACGGTAATTT TGGTGGTGCT GGCGTTGGTC GTCATTGGCT ATCTGACGCT GAATGCGCTT GGGCTTACGA CAATCAATAG TACGTCAGTG TCTGGTGGTG CAACCACGAC GTTGGCTATT GCCACCCCAA CCTTACTCGC AACCCCAACC GCTCAACCAA CCAATCCCGA TGGTTCTGTC CCATCGCCAA TTAATCCATT GGCAACCCCA ACCGTTACGA TAACTCCGAC CCAAGATGTG CCAGTCCAGG TTGTCTTGCG GATTGATGGC GGTTCGTCGT GGTTGCAGGT CTTGGTTGAT GGCCAAAATA CGATTGAAGG GATTCAAAAT AACGGCTGGA CGCAGACATT TTCAGCGCAG CAAACGATTC AGGTCAAGGC TGGTAACGCT GCCGTGGTCG AGGTGATTCA CAATGGCAAG CCGCCAGTTC GCATGGGTGC ACCCAACCAA GTTGTGACCA CCATCTATAC GCCCAACTAA
|
Protein sequence | MACFLRTFVR LGPVIVNMRL LGGSVATTAP SSTPFPMDEY QHADNHHHGT RQSMTNAGQN RLGLRFREAR EARGISLAQA SSETRIIQRY LAALENGEYH HLPGDVYARG FIRNYAQYLN LPADELIDLY RIERGASTPI RIVPAAVPPR RNTIFLPSLW TVILVVLALV VIGYLTLNAL GLTTINSTSV SGGATTTLAI ATPTLLATPT AQPTNPDGSV PSPINPLATP TVTITPTQDV PVQVVLRIDG GSSWLQVLVD GQNTIEGIQN NGWTQTFSAQ QTIQVKAGNA AVVEVIHNGK PPVRMGAPNQ VVTTIYTPN
|
| |