Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4206 |
Symbol | |
ID | 5736068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5359688 |
End bp | 5360632 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281361 |
Product | malate dehydrogenase |
Protein accession | YP_001546966 |
Protein GI | 159900719 |
COG category | [C] Energy production and conversion |
COG ID | [COG0039] Malate/lactate dehydrogenases |
TIGRFAM ID | [TIGR01763] malate dehydrogenase, NAD-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTAATC GTAAGAAAGT AACCATCATC GGGGCTGGAT TTGTTGGCTC AACCTGTGCA CACTGGTTGG CAAGCAAAGA ACTCGCCGAT GTTGTTTTAG TCGATATTGT TGAGGGCATT CCTCAAGGCA AAGGCTTGGA TTTACTACAA TCAGGGCCAG TTGAAGGCTT TGATGTCAGC GTGATCGGCA CCAACAGCTA TGAAGAAACC ACCGATTCAG ATGTGGTAAT TTTGACCTCA GGTGCGCCAC GCAAACCAGG CATGACCCGC GAAGATTTGC TCAAAATCAA CGCTGAAATT ACCAAATCAA ACATCGAAAA GGTTGCCAAA ACCTCGCCTA ACGCTTGTAT CATTGTGGTC AACAACCCAA TGGACACGAT GACCTACCTC GCTCGCGTGG CTTCAGGCTT CCCCAAAGAG CGCGTGATGG GTCAAGGTGG CGTGTTGGAT GCAGCTCGCT ATCGCACTTT CTTGGCGCAA GAATTGAATG TTTCAGTTGA AGATATTCAA GCCATGTTGA TGGGCGGCCA CGGCGATGAA ATGGTTCCAT TGCCACGCTA CACCACGGTT TCAGGGATTC CTGTAACCGA ATTTATCAGC GCTGAACGCT TGAACCAAAT CGTCGAACGC ACCAAAAAGG GTGGCGGCGA AATCGTTTCA TTGCTCAAAA CTGGCTCAGC CTACTATGCT CCTGCCGCTG CTACCATCCA AATGGTTGAA GCGATCTTGA AGGATAAGAA GCGTGTTCTG CCAGCCGCTG CTTATCTCGA AGGCGAATAT GGCATTAACG ATCTCTACTT CGGCGTGCCA GTTGTTTTGG GCGCTGGTGG TGTTGAAAGA ATTCTCGAAT TGCCATTGAG CGACGACGAA AAGGCCTTGA TGGCCAAATC AGCCGACTTG GTTCGCAGCT CAGTCGATAC CTTACGCACC TTGATCGATT TCTAA
|
Protein sequence | MANRKKVTII GAGFVGSTCA HWLASKELAD VVLVDIVEGI PQGKGLDLLQ SGPVEGFDVS VIGTNSYEET TDSDVVILTS GAPRKPGMTR EDLLKINAEI TKSNIEKVAK TSPNACIIVV NNPMDTMTYL ARVASGFPKE RVMGQGGVLD AARYRTFLAQ ELNVSVEDIQ AMLMGGHGDE MVPLPRYTTV SGIPVTEFIS AERLNQIVER TKKGGGEIVS LLKTGSAYYA PAAATIQMVE AILKDKKRVL PAAAYLEGEY GINDLYFGVP VVLGAGGVER ILELPLSDDE KALMAKSADL VRSSVDTLRT LIDF
|
| |