Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1114 |
Symbol | |
ID | 5733006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1275907 |
End bp | 1276977 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278253 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_001543890 |
Protein GI | 159897643 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.222874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGACGA CGATTCAGGC CATCAGTGCC GAAGCGATTA ATTTGCCTTT GACCGAGCCG TTTGCAATTG CCAGCGGGGC GCAAGCGGTC GCTGCCAATG TTTTAGTCAA AGTTCAGTTG GCTGATGGCA CGCTTGGCTT AGGCGAGGCG GCTCCTTTTC CCGCTGTCAG CGGCGAAACC CAGACTGGCA CAAGCGCTGC GATTGAGCGC TTGCAAAGCC ATTTGCTGGG AGCCGATGTG CGTGGATGGC GCAAACTAGC GGCGATGCTG GATCATGCTG AACATGAGGC GGCTGCCGCT CGTTGTGGCC TTGAAATGGC CATGCTTGAT GCGCTAACTC GCCATTATCA CATGCCATTA CACGTATTTT TTGGCGGCGT GAGTAAGCAA CTTGAAACTG ACATGACGAT TACCGCAGGT GATGAGGTGC ATGCGGCGGC CTCAGCCAAG GCCATTCTTG CCCGTGGCAT CAAATCGATC AAAGTTAAAA CGGCTGGGGT TGATGTGGCC TATGATTTGG CGCGACTGCG GGCCATTCAT CAAGCAGCGC CCACCGCACC ATTAATTGTT GATGGCAATT GCGGCTATGA TGTCGAACGG GCACTAGCTT TTTGCGCTGC TTGTAAGGCC GAGTCAATTC CGATGGTGTT GTTTGAGCAA CCGTTGCCGC GCGAGGATTG GGCAGGCATG GCCCAGGTTA CGGCCCAATC AGGCTTTGCA GTTGCTGCTG ATGAATCTGC GCGTTCAGCT CACGATGTGC TACGGATCGC CCGCGAAGGT ACAGCCTCGG TGATTAACAT TAAATTGATG AAGGCTGGCG TAGCTGAAGG CTTGAAGATG ATTGCAATTG CCCAGGCTGC TGGCTTGGGA TTGATGATTG GCGGCATGGT TGAAAGTATT TTGGCCATGA GCTTTTCGGC CAATCTGGCG GCTGGTAATG GCGGCTTCGA TTTTATCGAT CTTGATACGC CATTATTTAT TGCCGAGCAT CCCTTTATTG GTGGCTTTGC TCAAACTGGC GGAACCCTGC AATTAGCCGA TGTTGCTGGC CATGGCGTGA ATCTGGCCTA A
|
Protein sequence | MPTTIQAISA EAINLPLTEP FAIASGAQAV AANVLVKVQL ADGTLGLGEA APFPAVSGET QTGTSAAIER LQSHLLGADV RGWRKLAAML DHAEHEAAAA RCGLEMAMLD ALTRHYHMPL HVFFGGVSKQ LETDMTITAG DEVHAAASAK AILARGIKSI KVKTAGVDVA YDLARLRAIH QAAPTAPLIV DGNCGYDVER ALAFCAACKA ESIPMVLFEQ PLPREDWAGM AQVTAQSGFA VAADESARSA HDVLRIAREG TASVINIKLM KAGVAEGLKM IAIAQAAGLG LMIGGMVESI LAMSFSANLA AGNGGFDFID LDTPLFIAEH PFIGGFAQTG GTLQLADVAG HGVNLA
|
| |