Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4015 |
Symbol | |
ID | 5735876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5122023 |
End bp | 5122943 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281165 |
Product | hypothetical protein |
Protein accession | YP_001546775 |
Protein GI | 159900528 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACC GTACTCCCCC AACCGAAGTT CTCGAATTTG AAAACGATCT TGAGCCAGAT CCTGAACCGG AAGAGGTGCC TTCAGCAATT ATTCAAGAGG ATGAAGCGAT CGCGCCACAA GGCATGACTT TTCAAGCGTT AGTATTTGGC GTAGCCATTA TTCTGATTAT CTTTTTGGTA CTGCTGTTGG CGAGTGGCAA TAATATCAAT CTCTTTAATC TTTTTGGGAG CGATAGCAAT GCCCAGCCAA CCTTGCAACC TTTACTCAAT AGCGTTCAAA CCCAATCCAA TCGCTCGCTC GATGTTGAAG CGGCGGTGCT GGAAGGCCAA CGCTTTTATG GTGTTGATGG ATTAAACGCC CTGCCAAGTT ATGCTGCTGA GATCGACCCG CTGTTTTTGC CCTATTGGGT CAAGAATGGC GGCGAACGGA TCTTTGGACG ACCGATCAGT GGCTTGCTTG AAGAAAATGG CCGCCAATTT CAATGGTTTG AGCGGGCACG GCTTGAATGG TGGCCTGAGC ATCAAGCAAC GAAATACGAA GTGCAGCCAG GCCGAGTTGG GGTTGAATTC ACCCAAAACC GCGATTTTCC TAACCAGCAA TTTTTTGCCA ATCGCCCAGG GCTGCGTTAT AGCGAAGCGA CCCAACAAGG GTTGCGCGGG GCATTCCTTG AATTTTGGGA AGCCAACGGC GGCATCGATC TGCTGGGCTG GCCAATTAGC GAGGAGTTGC AAGAGGTGCT GCCTGAAGAT CGCACAGTGC ATACCGTGCA ATATTTCGAG CGTGGTCGGC TTGAGCTACA CCCCAACGAT CCCAATCAAA TGGTCAAAAT TGGCTTATTA GGCCGCGCCC TCTACTTCCA AGATAGTCAG CCAAACTACA TTGAGCCAGT TGCCCCCACC GCAGTTCCGG TCATTCCTTA G
|
Protein sequence | MSDRTPPTEV LEFENDLEPD PEPEEVPSAI IQEDEAIAPQ GMTFQALVFG VAIILIIFLV LLLASGNNIN LFNLFGSDSN AQPTLQPLLN SVQTQSNRSL DVEAAVLEGQ RFYGVDGLNA LPSYAAEIDP LFLPYWVKNG GERIFGRPIS GLLEENGRQF QWFERARLEW WPEHQATKYE VQPGRVGVEF TQNRDFPNQQ FFANRPGLRY SEATQQGLRG AFLEFWEANG GIDLLGWPIS EELQEVLPED RTVHTVQYFE RGRLELHPND PNQMVKIGLL GRALYFQDSQ PNYIEPVAPT AVPVIP
|
| |