Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3232 |
Symbol | |
ID | 5735100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4092375 |
End bp | 4093595 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280378 |
Product | hypothetical protein |
Protein accession | YP_001545997 |
Protein GI | 159899750 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000018306 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTATC GTCGAGGGTT GGCGTGTTTA TTACTTGGCT TAGTTGGCTG TAGCAGCGCT ACCACGCCCA CGAGCCAACT CCAACCAACC ACCAATGTGA TTGCCGTTGA GCCTACTAGT CTACCAGCAG CAACAATTAT TCCCACTGTG GCGGCAACGA TGGCCCCAAG CCCTGAAGTG ATGAGCGAAC TCCTGCCCCA GCCGTTGTAT GCGATTTCGT CGCAGCCCAA TGGTGTGCAA ATTATGCGCC TAGAAACTGA TGGCAACACC CTCACCTCAA TCACAAACGA ACCTAATGGT GTGTTTGATT ATCATGTCAG CACGCTTGGC CAAATTGCCT ACATTGCCGA TAATGACCTG ATTTTGGTTG ATGGCTTGGG TCAAAATCGC CAAATTTTGG TAGATGGCCC AACGATTAAC CCACAGGCTG ATACCAACAT TATGCATCGC GATCAATTGA ATACGCCACG CTGGTCGCCT GATGGCAGTT CGATCGCCTT TTCATGGGGC GGTTTGCAAA TCTACAATTT GATCACTGGC GAACAAACCA TGCTGCAACC TCACACCAGT AGCAGCGATT CAGCGCTTGC CGACATTTAC GATTACTCGG TATTGTATGG CCCCAACAGT TGGTCGCCCG ATGGGACAAC TATTCTAGCT AATGCTGGTT TTGTGCCCGA GGGTGGCAAT TTGGTCTTGT TTGATTTAGC CACATCTGAG CCATTAACGA TTACGATTGA TGGCAATTAT CCATGTTGCC ATGCTGCTTG GGAGCCAGAT GGCAGTGCAA TCAATGTCAC CAACGAAGCA TTTGGTATGT TTCCAACCGG CTGGTGGCAT ATCAATCCGA GCACTGGCGA ATCGAAGGCC TTGATCGATG GCGATAATGT TGAGCCATTT CAGCCAGTTT CGAGTGTTTT CATGCAGCCC GATGGGAGTC ACCTCGCCTT TGTGGGCACA GTCAATAGCA GCGATCCTAT GCAGATTTCG GCTCCTTTGA CCTTGAGCCA AGTTGGGCTT GATGGCACAA TTAGAGCTTT GCGCCAAGAT AGCCAAACGG TTGGCAGTGT GTTGTGGTCG CCTGATGGCA GCGGGGCAGT GATTGTGCCT TCGATGATGG CCGAAACTCA ACTCCAATGG GTTCCGAGTA ATGCTAGCCC AGTGGCTAAT TTGGCCTTGA GCGGGGCTAA TATGCAGTGG GCAACGACCT TAGATCAGTG A
|
Protein sequence | MRYRRGLACL LLGLVGCSSA TTPTSQLQPT TNVIAVEPTS LPAATIIPTV AATMAPSPEV MSELLPQPLY AISSQPNGVQ IMRLETDGNT LTSITNEPNG VFDYHVSTLG QIAYIADNDL ILVDGLGQNR QILVDGPTIN PQADTNIMHR DQLNTPRWSP DGSSIAFSWG GLQIYNLITG EQTMLQPHTS SSDSALADIY DYSVLYGPNS WSPDGTTILA NAGFVPEGGN LVLFDLATSE PLTITIDGNY PCCHAAWEPD GSAINVTNEA FGMFPTGWWH INPSTGESKA LIDGDNVEPF QPVSSVFMQP DGSHLAFVGT VNSSDPMQIS APLTLSQVGL DGTIRALRQD SQTVGSVLWS PDGSGAVIVP SMMAETQLQW VPSNASPVAN LALSGANMQW ATTLDQ
|
| |