Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4707 |
Symbol | |
ID | 5736554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6012859 |
End bp | 6014058 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281871 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001547466 |
Protein GI | 159901219 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTAC AATCGCTTCT TGAAAAAGCA GTCAATGGTG AGCGTCTTTC GGCTGAGGAA GGGCTGTTTT TGTATCGTGA AGCTGACCTG CTGGATTTAG GCATGGCTGC TGATGCGATT CGCCAACGGC TTGTGCCCGG AAACTTAGTG ACGTATCTTG TTGATCGCAA CGTCAATTAT AGTAATGTTT GCACGACCAA TTGCCAATTC TGTGGTTTTT ATCGCCCATT GGGTCACCCA GAATCCTATG TGCAAACCTA CGAAGAAATT GGTGCTCGTT TAGCCGAGCT TGTCGAATAT GGTGGAACCC GCGTGCTGAT GCAGGGTGGC CATCATCCTG AATTGCGGCT TTCGTGGTAT ACCGGCTTAC TGAGCTATTT GCGCGAACAT TATCCTAGCA TCGAGCTTGA TTGCTTCTCG CCTTCAGAAA TCGATAATCT GGCTAAGGTC GAGAACATGA GCATTCGCGA GGTGTTGATT GCGCTGAAGG AAGCTGGCTT GGCTGGCGTG CCTGGCGGCG GCGCTGAAAT TCTCGATGAT GAAGTGCGCC AACGGGTTAG CCCACTCAAG CAAGATGCTG CTGGTTGGCT CGAAGTTCAG GGTACAGCGC AATCGCTGGG CATGGCCACC ACCGCAACTA TGGTGATTGG CTTCGGCGAG TCGCTTGAAC AACGTATGAA TCACCTTATG AAATTGCGTG ATTTGCAGGA TGTTTCGTTG CGCGAGTATA ACAATGGCTT TATTGCCTTT ATTTCGTGGA CCGTCCAGAA ATCGGAGTTG ACCTCGTTGG GTCGCTCGAA GTTCTCGGAC GACTACGGGG CAACCGCTGC CGAATATTTG CGCCACACCG CGCTTGCCCG AATTATGCTC GATAACTTTA ATCATATTCA GGCTTCATGG GTAACTCAAG GACCAAAGAT TGGCCAAGTT TCGCTGAAGT TTGGCATCGA TGATTTTGGC TCGACCATGC TTGAGGAAAA TGTGGTTTCC AACGCCGCTC ACGACACCTA CCAATGTATG GGCGAACGCG AAATCCACGG TTTTATTCGC GATGCTGGCT TTATTCCAGC CAAACGCGAT ACCCACTACA ACTTGATTCA AGCTTTTGAA GACCCCAAAG ATAGCGAAAA CGTGCCATCG ATGGGCATCA AACAGCCACG TCAAGCCAAG CAGGTCGAAA TTCCATTAAT GGTTAAATAG
|
Protein sequence | MNVQSLLEKA VNGERLSAEE GLFLYREADL LDLGMAADAI RQRLVPGNLV TYLVDRNVNY SNVCTTNCQF CGFYRPLGHP ESYVQTYEEI GARLAELVEY GGTRVLMQGG HHPELRLSWY TGLLSYLREH YPSIELDCFS PSEIDNLAKV ENMSIREVLI ALKEAGLAGV PGGGAEILDD EVRQRVSPLK QDAAGWLEVQ GTAQSLGMAT TATMVIGFGE SLEQRMNHLM KLRDLQDVSL REYNNGFIAF ISWTVQKSEL TSLGRSKFSD DYGATAAEYL RHTALARIML DNFNHIQASW VTQGPKIGQV SLKFGIDDFG STMLEENVVS NAAHDTYQCM GEREIHGFIR DAGFIPAKRD THYNLIQAFE DPKDSENVPS MGIKQPRQAK QVEIPLMVK
|
| |