Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2503 |
Symbol | |
ID | 5734384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3197273 |
End bp | 3198187 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279643 |
Product | dihydrodipicolinate synthetase |
Protein accession | YP_001545269 |
Protein GI | 159899022 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.143344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATGT GGCATGGAGT TATTCCAGCA ATCACCACCC GTTTTAATCA CGATGGTAGC GTTGATCATG GATTTTTGGC CGAGCACTGT GCTTGGATGT TGGATGCTGG CTGTGTTGGG ATCGTGCCGC TTGGCTCTTT GGGCGAGGGT GCAACGCTTA GCGCCGCCGA AAAACAGGCA ATTTTACAAA CCTGCGTCAA AGTGGCTGGC TCGAACCCAG TAATTCCAGG CATCGCGGCA CTTTCAACTG CCGAAGCCTG CCAGCTAGCC CAAATCGCCG CTGCCGAAGG CTGCTCAGGC CTGATGGTGC TGCCACCTTA TCTCTACTCG ACCGATTGGC GCGAAATGCA AGCTCATATG AGCGCTGTGA TCAGCGCCAC CGAGTTGCCA GTGATGATCT ACAACAATCC GGTGGCCTAT CGCACCGATT TTGTGCCCAG CCAAATTGCC GAATTAGCAC AACGCCATGC CAATGTGCAA GCGGTCAAAG AATCGAGCAC CGACGTGCGA CGGGTGACGG CGATTCGGGC CGAACTTGGC CAACGCCTCG AAATTCTGGT TGGGGTTGAT GATGCAATTG TTGAGGGCAT CGCGGCTGGG GCGGTTGGCT GGATCGCAGG CTTGGTCAAT GCTTTTCCGC ACGAATCGGT TGAGCTATTT CAACTAGCCC AAGCCGTTGC CGCAGGACAT GGTGATCGCG CCCGACTCGA CGCGATTTAC ACTTGGTTCT TGCCATTGTT GCGGCTTGAT ACTGTGCCAA AATTTGTTCA ATTGATTAAA CTCACCCAAG CGATGGTCGG CATGGGCAGC GAAACTGTGC GAGCACCACG GCTTGAGTTA GTTGGAGCCG AGCGCGAAGC TGCCGTCAGC GTGATTGAAC AAGCCTTGGC GCGGCGGCAG GAGTTATGGC CATGA
|
Protein sequence | MSMWHGVIPA ITTRFNHDGS VDHGFLAEHC AWMLDAGCVG IVPLGSLGEG ATLSAAEKQA ILQTCVKVAG SNPVIPGIAA LSTAEACQLA QIAAAEGCSG LMVLPPYLYS TDWREMQAHM SAVISATELP VMIYNNPVAY RTDFVPSQIA ELAQRHANVQ AVKESSTDVR RVTAIRAELG QRLEILVGVD DAIVEGIAAG AVGWIAGLVN AFPHESVELF QLAQAVAAGH GDRARLDAIY TWFLPLLRLD TVPKFVQLIK LTQAMVGMGS ETVRAPRLEL VGAEREAAVS VIEQALARRQ ELWP
|
| |