Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1915 |
Symbol | |
ID | 5733804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2308474 |
End bp | 2310057 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641279059 |
Product | hypothetical protein |
Protein accession | YP_001544686 |
Protein GI | 159898439 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCAG TTTTAGTGTT GATCGAAACC AGTGGAATTC AACAGTACAT CTTTGGCAGT AACCGCCTAC GTGAACAAAT TGGCGCATCA CATCTCGTTG ATCAAGTGAC TGATACTTGG TTACAAGAAT ATGGTTTCTC GGAAGCTGAG CCTCTATTAC CACCAAAAGA ACTTCCCGCT GCCCAAATTC TTTGGAGTGG TGGCGGTAAT ACGCGGCTAT TATTTGCTAA CAGTGAATTA GCGGTTCAAT TTACCCGCAC ATGGAGCGAA AAACTATTGC GTGAAGCTCC CGATTTGCGC TGTTTGGTGG TACATCAAGC CTGGAATGAG AATTTAAATG AAACTCATCG GCAGGCGCTC GCACAGATGC GCCTTAAGAA GCAGACAACC CCAACCCAAA CACCCAGTCT TGGCCTTGCA GTGAATGCCT TTTGCCAATC AACCGGTATG CCCGCAACAG GTTTTAATCG AGCTACCGAT AAAGAAGCAC CAGAGTTGTA TCCAGTTTCG CATGCGGTTG CTGCAAAATT GGTTTCACTT AAAGATGCAA ACGACAGGTT GGATGAAACA TTACTTAGTG CTGAGCAAGC TAAAAAATAT GTATTTCCCT ACGATTTCGA TAACCTTGGG CGGAGTGAAC ATGAACATAG CTATATCGCC GTGGTGCATA TTGATGGCAA TGGTATGGGC AAGCGCTTTG CAGCAATTCA GCAAAAATAT CGGGATGATG ACCAAGGGTA TTTAGAAAAA GTACGCAAAT TATCTGAGCT AGTGAGATGC GCAAGCAAAA AAGCTTTACA GGAATGTGTA AGGCTTGTTA TAAAACAAAT TATTGATACG AAAGAGCTTA AAGCAAGCAT TCTGTCTGAT AAAAAGCCTC CTAAAAAAGC TTTTCGATTT CCGTCGTTAG AAAATGAGTC TGAACAGTTT TTCTATTTAA ACGATACACG ACAAGGAGAT CAAACATTTC TGCCGTTTCG GCCAATTATT TATGGTGGCG ATGATGTGAC CTTTGTGTGT GATGGACGGC TGGGAATTGC CCTCGCGATT AAGTTTCTCC AAGCGTTTAA CCAAGCAAGT GCTCCCGAAA ACTTGCATGG GTGTGCTGGG ATCGCAATTG TTAAGGCCCA CTACCCTTTT GCGCGAGCCT ACGATTTAGC CGAACAGTTG TGTAGCTCGG CTAAGCAGAA AATTGGCGAC CATAAGGCCT CAGCAATCGA TTGGCAAATT GCCATGACTG GAATTACTGG ATCATTGGGA GAGATTCGTG AGCGTGAATA TCAAACAACT TCTGGTGCAT CACTGAGTCT ACGTCCAGTT TCAATTACTG AGAATGCGCC ATCAGATATG GTTGATTGGA ATTCGGTAGC AGAAATTATT GATAACCTCC GATCGAGCGA ATGGACTGAT AAGAAAAATA AAATAATGCA ACTGCGCGAA GTTCTACGCC AAGGCTCAGA TAAAGTCGCT GAATTTGAAA AATTATACAA GATTGATATT AATCGAGGAT GGATTGATAA CGTATGTATG TACTTTGATC CAATTGAATT ACTTGATTTT TATTATCCGT TGGAGGCCCG ATGA
|
Protein sequence | MTPVLVLIET SGIQQYIFGS NRLREQIGAS HLVDQVTDTW LQEYGFSEAE PLLPPKELPA AQILWSGGGN TRLLFANSEL AVQFTRTWSE KLLREAPDLR CLVVHQAWNE NLNETHRQAL AQMRLKKQTT PTQTPSLGLA VNAFCQSTGM PATGFNRATD KEAPELYPVS HAVAAKLVSL KDANDRLDET LLSAEQAKKY VFPYDFDNLG RSEHEHSYIA VVHIDGNGMG KRFAAIQQKY RDDDQGYLEK VRKLSELVRC ASKKALQECV RLVIKQIIDT KELKASILSD KKPPKKAFRF PSLENESEQF FYLNDTRQGD QTFLPFRPII YGGDDVTFVC DGRLGIALAI KFLQAFNQAS APENLHGCAG IAIVKAHYPF ARAYDLAEQL CSSAKQKIGD HKASAIDWQI AMTGITGSLG EIREREYQTT SGASLSLRPV SITENAPSDM VDWNSVAEII DNLRSSEWTD KKNKIMQLRE VLRQGSDKVA EFEKLYKIDI NRGWIDNVCM YFDPIELLDF YYPLEAR
|
| |