Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4196 |
Symbol | |
ID | 5736058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5352901 |
End bp | 5353956 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281351 |
Product | aspartate-semialdehyde dehydrogenase |
Protein accession | YP_001546956 |
Protein GI | 159900709 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0136] Aspartate-semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR00978] aspartate-semialdehyde dehydrogenase (non-peptidoglycan organisms) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0158843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGA TTCCTGTTGG TGTGCTAGGC GCAACAGGGA TGGTTGGACA ACGCTTTTTG AGTTTGTTGA TCGATCACCC CTATTTTGAA GTAACGGTAG TTGCCGCTTC GGAGCGTTCG GCTGGTAAGA CCTTGGCCGA AGCTGGCCGC TGGATGATCG GCGGCGAAAT GCCTAGCCAA TTTGCCAAGA TGACCGTCCA GCCCGTTGAT CAGGTTGGCG ATGTTGGCTT AGTGTTCAGT GCCTTGCCCA GTGATGCTGC TGGCCCGACC GAAATTGCTT GGGCCAAATC AGGCGCGTTG GTTTTCTCAA ATGCTGGTGC ACATCGCCGC GATCCACTTG TGCCGCTGTT AGTGCCCGAA GTCAACCCTG ACCATGTTAA TTTGTTGGAA TTACAACGCC AACAGTATGG CTGGAGCGGC GGCATTTTGA CCAACCCCAA CTGCACAACC ACCCATGCTG TGTTGCCAAT GCGAGCTTTG CACGATGCCT TTGGCCTCAC CAAGGTTTTG TTGGTGAGCA TGCAAGCAAT TTCGGGCGCA GGCTATCCAG GTGTGCCAAG CCTCGACATT ATTGACAATG TTGTGCCGTT GATCAAAGGC GAAGAAGAAA AAGTTGAGTG GGAGCCACGC AAGCTTTTGG GTACGCTTGG CGCTCAGGGC GTGGAAGAAG CCCAAATCAC AATTAGCGCT CATTGTAATC GGGTGGCGGT TATTGATGGC CATACCGAGT GTTTATCATT GGCGTTCGAG CGTCCGCCCG CCGATGAAGC TGAATTGATC GCGGTCTTGC GTGAATTTAA GGCTGAACCA CAAGCGCTCA ATTTGCCAAG TGCGCCAGCC CACCCAATTT TGGTAACTGA ATTGGCCGAT GGCCCGCAGC CACGCCGTGA TCGCGATGCT GAGCGTGGTA TGGCGACGAC CGTTGGGCGC GTTCGCCGTT GCCCAATTCT TGATTACAAA TTAGTTTTGT TGGGGCACAA TACCTTGCGC GGAGCCGCTG GGGGTTCGTT GCTCAACGCT GAATTAATGG TTGCCAAAGG CTTGATCAAA GCCTAA
|
Protein sequence | MKKIPVGVLG ATGMVGQRFL SLLIDHPYFE VTVVAASERS AGKTLAEAGR WMIGGEMPSQ FAKMTVQPVD QVGDVGLVFS ALPSDAAGPT EIAWAKSGAL VFSNAGAHRR DPLVPLLVPE VNPDHVNLLE LQRQQYGWSG GILTNPNCTT THAVLPMRAL HDAFGLTKVL LVSMQAISGA GYPGVPSLDI IDNVVPLIKG EEEKVEWEPR KLLGTLGAQG VEEAQITISA HCNRVAVIDG HTECLSLAFE RPPADEAELI AVLREFKAEP QALNLPSAPA HPILVTELAD GPQPRRDRDA ERGMATTVGR VRRCPILDYK LVLLGHNTLR GAAGGSLLNA ELMVAKGLIK A
|
| |