Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0222 |
Symbol | |
ID | 5732117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 258893 |
End bp | 260092 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277346 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_001543002 |
Protein GI | 159896755 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0133] Tryptophan synthase beta chain |
TIGRFAM ID | [TIGR00263] tryptophan synthase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.580627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC ACGCTGTTCT TGATGAATTA AATGGCCGCT ATGGGGATTT CGGTGGACGC TATGTGCCAG AAACCTTGAT GGCCGCGATC GAAGAATTAA CCGAAGCCTT TTTTCGGATT CGCACCGACC CTGAGTTTCA GGCTGAACTC CAACATTTGC ACCAGACCTA TACGGGCCGA CCAACTGCCC TCACCTATGC CCGCCGCTTG ACCGAGGAAT TGGGTGGTGC TCAAATTTGG CTCAAACGCG AAGACCTGAC CCACACTGGC GCACATAAAA TCAATAATGC CTTAGGGCAA GGCTTGTTGG CCAAACGCAT GGGCAAACAG CGGATCATCG CTGAAACTGG CGCTGGCCAG CATGGCGTTG CTACCGCTGC CGTTTGTGCC CTGCTTGGGC TGCAATGTGT GGTCTATATG GGCACCGAAG ATATGGAGCG CCAAAAGCCC AATGTCTTTC GTATGCGCTT GCTGGGAGCC GATGTGCGTG GAGTCAGCAC TGGCTCGAAA ACCCTCAAAG ATGCAGTTAA CGAAGCCATG CGCGATTGGG TCAGCAACCC CGATTCGTAC TATTTGCTTG GCTCGGCGCT TGGCCCACAC CCTTATCCAT TGATGGTACG CGAATTTCAA AGCATCATCG GAATTGAAGC CCGCGAGCAA ATTTTAGCAG CAACTGGCAA ATTGCCCAAC ACGATTATTG CTTGCGTTGG GGGTGGCTCG AACGCAATCG GGATGTTCCA CGCCTTTATC AACGATGAAC ATGTTGATTT GCGAGGAGTT GAAGCTGGTG GTCATGGAAT TGAGCTTGGT CGCCATGCAG CGCGGTTTGC AGGCGGGCGC TTGGGCGTTT TCCAAGGCAC CCGTTCGTAT GTGCTGCAAA ATAGCGATGG CCAAATTGCC AATACCCATA GCATTTCTGC TGGTCTCGAT TATGCTGCTG TAGGCCCAGA GCACGCTTGG CTCCACGACG AGGAACGGGC TTTCTATACC TATGCCACCG ACGAAGAGGC CTTGAATGGT TTTCAAATGC TCTGTCGAAC TGAAGGCATT ATCCCAGCCT TAGAATCGTC GCATGCGATT GCCGAAGCTG TACGTTTAGC CCCAACCATG AGCAAAGAAA GCATTATTTT GGTCAATCTG TCGGGGCGTG GCGATAAAGA TATTTTCACC GTTGCAGATG TATTGGGAGT GCAAATGTAG
|
Protein sequence | MTDHAVLDEL NGRYGDFGGR YVPETLMAAI EELTEAFFRI RTDPEFQAEL QHLHQTYTGR PTALTYARRL TEELGGAQIW LKREDLTHTG AHKINNALGQ GLLAKRMGKQ RIIAETGAGQ HGVATAAVCA LLGLQCVVYM GTEDMERQKP NVFRMRLLGA DVRGVSTGSK TLKDAVNEAM RDWVSNPDSY YLLGSALGPH PYPLMVREFQ SIIGIEAREQ ILAATGKLPN TIIACVGGGS NAIGMFHAFI NDEHVDLRGV EAGGHGIELG RHAARFAGGR LGVFQGTRSY VLQNSDGQIA NTHSISAGLD YAAVGPEHAW LHDEERAFYT YATDEEALNG FQMLCRTEGI IPALESSHAI AEAVRLAPTM SKESIILVNL SGRGDKDIFT VADVLGVQM
|
| |