Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3166 |
Symbol | |
ID | 5735038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3998559 |
End bp | 3999815 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280309 |
Product | dihydropteroate synthase |
Protein accession | YP_001545931 |
Protein GI | 159899684 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00152005 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAAT TTGCCTATAA CATCCGTCAA TATTATCCTC AGCAACGCCA AGAACTGTTG CAGATGATCG CCCAAATTGA AACCTATCCC AATGCTGCTG AGCGCGTATT GACCAAAGCC AACTTGATTA TGCTGCATTG CGATCAGGTT GATCCACATA CGGCCATGAT TGTGAAGCAA GAATTATTGG CGCTCGATGG CGATGCCTTG GTTAGCCCAC ACGTCTATCT GGGCCAAAGC ACGCAGCCAA CCAATCTGCT GGCGTGGGCC AATGAACGCT CGTGGCGAGC CTTATGTGGC AAGTTTCAAG CGATTCCTTT ACCTGCCTTG CAAGCCTTAG CCCAACAAAT TGGCGCGTTG CTGTTGCATA ATCAAGCACG AGGCAGCCTC AAACTTGGCG CAACTCAATG GGATTGGGGT CGAAAAACCT TGGTGATGGG CATTGTCAAT GTTACACCCG ATTCTTTCTC TAACGATGGA TTGCTTGAGG TCGGAACCAG CCAGATTCAG CAGCAAGCGC TTGAGTTTGC CGCAGCTGGA GCCGATATTT TGGATGTTGG CGGTGAATCG ACGCGGCCTG GAGCCAGCAG TGTCAGTATC GCACAGGAGA TTGCGCGGGT TGTGCCAGCG ATTCAAGCGA TTCGTCACGT TTGCCAATTG CCAATTTCGA TTGATAGCTA CAAAGCTGAG GTTGTGGCGG CTGCGCTTGA AGCTGGTGCA AATGTGGTTA ATGATATTTG GGGTTTGCGC CAAGCCGATG GTAGTTGGAA TACGGCACTG GCGCAGTTGG TGGCACAAGC AAACGTGCCA ATTATTTTGA TGCACAATCG AGTCAGCACG GTTGAGCAAT TTGCCCATGG CACAAATTAC GCTGCTAGCG ACTATGGCGA TATTATCGGC GAAGTTTGTG CCGAATTACG CCAAAGCATC GATTTCGCCC TGCAAGCGGG CATTGCCAAC GATTTAATTG TGCTTGATCC AGGCATTGGT TTCGGCAAAA GCCCTGAACA AAATCTACAA GTATTACGTC AACTACGGAC AATTGCAAGC TTAGGCTACC CGTTGTTGGT TGGCACTAGC CGAAAATCCA TGATTGGGAT AACATTAAAC CGACCTGTTG ATCAACGCCT GTGGGGCACA GCCGCCACCG TGGCCTATGC AATTCAGGCA GGAGCCGATA TTGTGCGGGT GCACGATGTT GCGGCAATGG TCGATGTTTG TCGAATGACC GACGCTTTAG TTCGTCACGA AGGATAG
|
Protein sequence | MPEFAYNIRQ YYPQQRQELL QMIAQIETYP NAAERVLTKA NLIMLHCDQV DPHTAMIVKQ ELLALDGDAL VSPHVYLGQS TQPTNLLAWA NERSWRALCG KFQAIPLPAL QALAQQIGAL LLHNQARGSL KLGATQWDWG RKTLVMGIVN VTPDSFSNDG LLEVGTSQIQ QQALEFAAAG ADILDVGGES TRPGASSVSI AQEIARVVPA IQAIRHVCQL PISIDSYKAE VVAAALEAGA NVVNDIWGLR QADGSWNTAL AQLVAQANVP IILMHNRVST VEQFAHGTNY AASDYGDIIG EVCAELRQSI DFALQAGIAN DLIVLDPGIG FGKSPEQNLQ VLRQLRTIAS LGYPLLVGTS RKSMIGITLN RPVDQRLWGT AATVAYAIQA GADIVRVHDV AAMVDVCRMT DALVRHEG
|
| |