Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2163 |
Symbol | |
ID | 5736874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2728446 |
End bp | 2729549 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641279304 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001544931 |
Protein GI | 159898684 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATTC TGTTGGCGAA GTCGCCACTC TCTGATATTG CTGAAAAAGT TGAAGCAGGC GAACGCCTTT CGTTTGACGA TGGAATGCGT TTGTATCAAA CTAACGATAT TTTAGCCTTG GGTAAATTGG CCGATACGGT GAATCGACGC AAAAATGGCG ATGTGGTGTA TTTTGTGCAA AATCACCGCA TTACACCAAC CAATGTTTGT GCATTTCACT GTAATTTCTG CTCGTTTCGG CGTAATGGCA ACGAACCCGA TGCCTTTGTG CGCACTCCCG AACAAATTAT CGATCACGTT GGGCGTTTGT TTAGCGAACG TACCCGTGAA TTTCATATTG TCGGCGGTTT AGTGCCCGAT CTCGATGTTG AATATTATGC CGATATCATT CGTGAATTGA AGGATCATTA TCCCAATGTT CACGTCAAAG CCTTTACGGC AGTTGAAATT GATTATATGG CTCAAATTTC GCATCTTGAT TGGCGCACAA CCCTTGAGAT TTTGCGCAAG GCTGGGCTGG ATGCCTTGCC TGGTGGCGGT GCTGAAATTT TCCATCCAGC GGTGCGCCGT AAAATCTGCC CCGAAAAGGT TGATGGCGAT GGTTGGTTGG AAATTCATGG CATTGCTCAC GAATTAGGCA TCAAAACCAA TGCCACTATG CTCTATGGCC ATATCGAAAC CCTCGAACAA CGGGTTGATC ACTTGTTACG TTTGCGCGAA CAGCAAGATA AAACTGGCGG TTTTGTAACC TACATTCCGC TGGCTTTCCA CCCCGAAAAC AACAATTTAG GGCGGGTCAA AAAGCTCGAT TGGACGACAG GCTTCGAGGA TTTGAAGAAT TTGGCGATTG GCCGTTTGTT GCTCGACAAC TTTGCCCATG TCAAAGCCTA TTGGATCTCG CTCACGCCAC GGTTGGCCCA AGTCGCTTTG TCGTTTGGGG TTTCCGATGT TGACGGCACG GTGATCGAAG AAGAAATCTA TCACGCTGCT GGGGCTAAAA CCGAACAAGG TATCTCACGG GCAGAATTAG TTCATCTGGT GACGACTGCT GGCAAAACCG CAGTTGAGCG GGATGCACTT TATAATCACA TCGCTGTGAA CTAA
|
Protein sequence | MAILLAKSPL SDIAEKVEAG ERLSFDDGMR LYQTNDILAL GKLADTVNRR KNGDVVYFVQ NHRITPTNVC AFHCNFCSFR RNGNEPDAFV RTPEQIIDHV GRLFSERTRE FHIVGGLVPD LDVEYYADII RELKDHYPNV HVKAFTAVEI DYMAQISHLD WRTTLEILRK AGLDALPGGG AEIFHPAVRR KICPEKVDGD GWLEIHGIAH ELGIKTNATM LYGHIETLEQ RVDHLLRLRE QQDKTGGFVT YIPLAFHPEN NNLGRVKKLD WTTGFEDLKN LAIGRLLLDN FAHVKAYWIS LTPRLAQVAL SFGVSDVDGT VIEEEIYHAA GAKTEQGISR AELVHLVTTA GKTAVERDAL YNHIAVN
|
| |