Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5269 |
Symbol | |
ID | 5737227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 51029 |
End bp | 51946 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641282433 |
Product | hypothetical protein |
Protein accession | YP_001548024 |
Protein GI | 159901779 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.258018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCACC GGAACGAATC GGGGCGACTG ATGGGGCGAA TGCGCGTTGG AGGGTGGATG GGGGTGGGGA TGGTTGGTCT CGTGGCGTGG CTGCTTGCCC TGCCAGCAGT GTATCCAGCG GGGGCGGCGA CCGATCCGCC GCTGCATGGA GGCACGAATA CTGGGGCGGA TCGGCCCAGG AGGTCGGTGC CCACGCCAAT CCTCGATAGG GGTTGGGACT ATGCCTACCA CCAGGCCATG GGGGTCGATC CGCAGGTGGA ATTGGTTCCT GGGCAAACCT ATCCCTTTGT GCATGCGGCG CATGGGGTGA TCGACCAAAC GACGGGGGTG ACGATCTCCG TGATCAGACC CGCAGAGTCC ATGACTGACT TGAGCATTCT CTATGGGACG GGCATCCCGA ATAATGTTGC TCCGTACCAA TCCTGTGACC TAGAATTCGC CTTTGTCTTC GTCGGCAGCT TTCGTGGTCA ACCGGAACAA TCGCACTTTA TCGAACAGTG CATGATTCCG CCATACCCCA GCGGAACGCG GGTCTGGTAT CGGGTGTACG GAGCGCCGAG TGTTGGGGAA GACCCATTCT ATCGATTCTT ATTGCCCGGT CTCGGTCTTG AGCCGTGTAC GCAGATTAAC ATCCTCGGCC AATGTATCCA TCCAAGCCTA CTGTTTCCTG AGCTACACCT GCCGTATAAC TTCTACGATG TCGGCAATGC CAGTCCAACG CCGACCAATA CGCCAACGGA GACCCCGACC ACCACACCGA CCAATACGCC AACGGAGACC TCGACACCAA CGAATACACC AACCAACACG CCGACGGAGA CCCCGACACC GACCAGCACA CCAACGAGCA CGCCAACCGC CACCATGACC CCGATTCCGC CGCGCTCCAC CGTGTTTTTG CCGTGGGCGC AGAAGTGA
|
Protein sequence | MDHRNESGRL MGRMRVGGWM GVGMVGLVAW LLALPAVYPA GAATDPPLHG GTNTGADRPR RSVPTPILDR GWDYAYHQAM GVDPQVELVP GQTYPFVHAA HGVIDQTTGV TISVIRPAES MTDLSILYGT GIPNNVAPYQ SCDLEFAFVF VGSFRGQPEQ SHFIEQCMIP PYPSGTRVWY RVYGAPSVGE DPFYRFLLPG LGLEPCTQIN ILGQCIHPSL LFPELHLPYN FYDVGNASPT PTNTPTETPT TTPTNTPTET STPTNTPTNT PTETPTPTST PTSTPTATMT PIPPRSTVFL PWAQK
|
| |