Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0540 |
Symbol | |
ID | 5732398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 625020 |
End bp | 625898 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641277667 |
Product | UBA/THIF-type NAD/FAD binding protein |
Protein accession | YP_001543316 |
Protein GI | 159897069 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATGC TCAGTTTTGA GCCTATTGCC CCGTTTGTAA TAGGAAACCA GTCACGCTTT CATATCACTA TTGTTGGGGC TGGTGGTACT GGCTCTTATA TCGTGCAAAC TGTGGCTCGG TTAATGGCGC ATGCCAATGA GAAGGGTTCC CCCAAAATCG ATGTAACGCT GGTTGATGGC GATACAATTG AAGCAAAAAA TATTGGTCGT CAACTCTTTG CCCCAGCCGA AATAGGCCGA AATAAAGCAC AATCATTGGC TACTCGGTTC AACCAAGCCC TAGGCCTCAA AATACAAGCT ATACCCACAA TGATCGATCA ATATTCGCTT CCGCATGTTG ATCGCTATCA ATGGCATCAA TCGGCTGCGC TTGATAAGCG GCCCTACTCA ATTGTTGTGG GCGCTGTCGA TAATGCCGAA GCACGCAAGG TTATTCATGA TGGTTTAGCT AAATATGTCT TTAACCTATG GATTGATAGT GGCAACCACG AGCATGGGGG ACAAGTGCTT GTAGGCAACT GCGTAGCACC GCATAAGCTA AAAGGAAGCT TTGCCCTTGA GGGAATGGTG CAATCATTAC CAGCACCATC GTTGATGGCA CCATCGCTGA TAACACCACC AGAGCCACTG CCTGAATCGC AGCAGCTTGA TTGTGCAACC GCGATGGAGA ATAACCGCCA ATCCTTACTT ATTAATCAGC AAATGGCCTC AATTGTTGGC CAATACTTAC ATCAAATCAT CTTTACCCAA CGCTTGACCA CTTTTGAAAC AATTGTCGAT ATGAGCACAC TCACGATGCG AAGCACCCCA ATCACGGTTT CTAATGTGGC TCAATGTCTT GGGGTAACGT CTGCATTTTT AAAAGGAATT AACGCATGA
|
Protein sequence | MQMLSFEPIA PFVIGNQSRF HITIVGAGGT GSYIVQTVAR LMAHANEKGS PKIDVTLVDG DTIEAKNIGR QLFAPAEIGR NKAQSLATRF NQALGLKIQA IPTMIDQYSL PHVDRYQWHQ SAALDKRPYS IVVGAVDNAE ARKVIHDGLA KYVFNLWIDS GNHEHGGQVL VGNCVAPHKL KGSFALEGMV QSLPAPSLMA PSLITPPEPL PESQQLDCAT AMENNRQSLL INQQMASIVG QYLHQIIFTQ RLTTFETIVD MSTLTMRSTP ITVSNVAQCL GVTSAFLKGI NA
|
| |