Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4467 |
Symbol | |
ID | 5736318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5713299 |
End bp | 5714372 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281630 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001547227 |
Protein GI | 159900980 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000156063 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGACGTT TAATTCGAGC TTTGTTGATT GTTGCCACCA TCGGCGCACT AGTTGTCGCG TGTGTTGCAA CCCTGTTTCT GCGTGAGTTA ACCCAGCCTG CTGGCGAGAG CAATATTGCC CAAGATTTTA CCATCGCTCC CAGCGAAAGT TTGGCGGTGA TCAGCAGCAA TCTGGAATCT GAAGGTTTGG TGCGGCGGGC AATTGTTTTC CGCGTATTCG CCGATTTACG TAATGCCGAG ACCGATTTGT ATCCTGGCAC CTACAAAATT AGCCCAAATA TGACGATCAA TCAGATTTTA GAGATGTTTC GGGTTGCCCC AGAAGTTCAA ACTGCGGTGC GCTTTACCGT GCCTGAAGGC TTGCGGATTG AAGAAATCGC GGCGGTGATT GAATCGACTG GCGTAGTTAG TGCCGATGAT TTCTTGGCTG TGGCCCGCGA TGGCTCGCAA TTTAAGGCCG ATTATAGCTT TTTATCCAGC TTGCCAGATA GCGCAACCTT GGAAGGCTAT CTCTTCCCTG ATACCTATGA AATCTTTTCT GATGCAACCA GCGAAGAGAT TATTCGCAAA ATGCTCGATA CCTTTGCAAT TCGCTGGGCT GATTCGCCGC TGAGCAGCGC CACGACCGGG CGTTCTGTCC ATGAAGTGGT GACTTTAGCC TCGATTGTGC AGCGTGAAGC CAGCAATAAC GAAGAAATGC CACGGATTGC TGCCGCCTTC TGGAATCGCC TGAAACCAGA ATTTGCTGGC AATCAGCTGG GAGCCGATCC GACAATTCAA TATATTTTAG GCGAATCAGG CAATTGGTGG CCAAAGCTTG ATCAGCTAAC GGTTGAACAA ATTAATAGTG CTGCTGGCCC TTATAACACA CGGGTCAACC CCGGCTTGCC ACCTGGGCCA ATTAGTGCGC CTGGTTTGTT TGCCTTGCAA GCCGCTGCCT CGCCTGCCGC CGAAGATGTG ACCTATTTTG TGACCAAGTG TGTGGCTGCT GGCGAACGCC CAACCCACAA CTTTACCAAC GACTATAGCG AATTTTTGCA ATTTCAAGAA GAGTTTTTGG CGTGTCCCAA ATAG
|
Protein sequence | MRRLIRALLI VATIGALVVA CVATLFLREL TQPAGESNIA QDFTIAPSES LAVISSNLES EGLVRRAIVF RVFADLRNAE TDLYPGTYKI SPNMTINQIL EMFRVAPEVQ TAVRFTVPEG LRIEEIAAVI ESTGVVSADD FLAVARDGSQ FKADYSFLSS LPDSATLEGY LFPDTYEIFS DATSEEIIRK MLDTFAIRWA DSPLSSATTG RSVHEVVTLA SIVQREASNN EEMPRIAAAF WNRLKPEFAG NQLGADPTIQ YILGESGNWW PKLDQLTVEQ INSAAGPYNT RVNPGLPPGP ISAPGLFALQ AAASPAAEDV TYFVTKCVAA GERPTHNFTN DYSEFLQFQE EFLACPK
|
| |