Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1016 |
Symbol | |
ID | 5732920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1161510 |
End bp | 1163102 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278151 |
Product | malate synthase |
Protein accession | YP_001543792 |
Protein GI | 159897545 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01344] malate synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC GGCAACATGG CGTGAAAATC AACGCCCCTA TCACCCCCGC TGCGGCGGAA TTGTTGACGG AACCAGCCCT ACACTTCTTA GCTGCCTTGC ATCGCACTTT TGACCAAACT CGCCGCGACT TATTGCTCGG ACGAGTGGAA CGCCAAAGCC GCCTTGATGC AGGCGAAAAC CCTGATTTTC TCGCTGAAAC TGCCCATATT CGTGCTAGCG ACTGGCAAAT TGCGCCCATC CCCGATGAGA TTCGTAATCG TCGCGTGGAA ATTACTGGGC CAATTGATCG CAAAATGATC ATCAATGCGC TCAACTCTGG AGCCAATGTC TTCATGGCCG ACTGTGAAGA TGCAACCACT CCAAGCTGGG ATAATTTGGT CAGCGGCCAA CTTAACTTGC GCGATGCGGT CAATCGGACG ATAAGCTTCA CCAATGAAGC TGGCAAAGCC TATCAATTAA ACGATCAGGT TGCGGTGCTG TTTGTGCGGC CTCGTGGCTG GCACTTGCTC GAAAAGCATG TCACCGTCGA TGGCGAACCC TTGGCTGGTG GTCTGTTCGA CTTTGGTTTG TATTTGTTCC ACAATGCCAA AACCTTGCTC GAACGTGGCT CGGCTCCTTA CTTCTATCTG CCAAAACTCG AAAGCCATCG CGAAGCCCGT TTGTGGAATG ATGTGTTCGT GTTTGCCCAA AAGCAACTCG GCCTGCCCCA TGGCTCAATC AAGGCAACGG TTTTGATTGA AACAATTTTG GCCGCCTTCG AGATGGACGA AATTCTGTAT GAATTGCGCG ACCACTCGGC TGGCCTCAAC TGTGGCCGCT GGGATTACAT CTTCAGCTGC ATCAAGAAAT TTGCTAAATT ACAACATTTT GTGCTGGCTG ATCGTGCTTT AGTGACGATG ACTTCACGCT TTATGCGCTC ATATTCGTTG CTGGCGATCA AAACCTGCCA TCGCCGTGGT GCTCACGCAA TGGGCGGGAT GGCTGCTCAG ATTCCGATCA AGCACGATGC CCAAGCCAAT GCCGAAGCCC TCGCCAAAGT GCAAGCCGAT AAAGAGCGCG AAGCTCGCGA CGGCCACGAC GGCACATGGG TCGCTCATCC AGGTTTGGTT CCGTTAGCTA AGGCCGCCTT TGATGCTTTG ATGCCTGAAG CTAACCAAAT TGGCAAGCAG CTTGATGTTG AAATTACTGC CGATGATTTA CTGCGCTTCG AGCCATCAGC GCCGATTACC GAGCAAGGCC TGCGCAAAAA TATCAGCGTT GGCATCCAAT ATATCGAAGC TTGGTTGGGT GGCTTAGGCT GCGTGCCGCT GTACAACTTA ATGGAAGATG CCGCAACCGC CGAAATCTCC CGTGCTCAAG TTTGGCAATG GGTACATCAA CCTAATGGCA TTACCGAAGA TTTTCGCAAA ATCACCCTCG ATTGGGTGCG CGAGTTGATC GTCGAAGAAC TGGCCAAGAT CGAACAAGAA GTTGGCGCAG AACGCTATCG CAACGGTCAT TATGATCGGG CTAGCCAATT GTTTGATCAA TTGGTTGCCA ACCCAACCTT TACCGAATTT CTCACGCTTC CTGCTTACGA ACAAATCGAT TAA
|
Protein sequence | MTDRQHGVKI NAPITPAAAE LLTEPALHFL AALHRTFDQT RRDLLLGRVE RQSRLDAGEN PDFLAETAHI RASDWQIAPI PDEIRNRRVE ITGPIDRKMI INALNSGANV FMADCEDATT PSWDNLVSGQ LNLRDAVNRT ISFTNEAGKA YQLNDQVAVL FVRPRGWHLL EKHVTVDGEP LAGGLFDFGL YLFHNAKTLL ERGSAPYFYL PKLESHREAR LWNDVFVFAQ KQLGLPHGSI KATVLIETIL AAFEMDEILY ELRDHSAGLN CGRWDYIFSC IKKFAKLQHF VLADRALVTM TSRFMRSYSL LAIKTCHRRG AHAMGGMAAQ IPIKHDAQAN AEALAKVQAD KEREARDGHD GTWVAHPGLV PLAKAAFDAL MPEANQIGKQ LDVEITADDL LRFEPSAPIT EQGLRKNISV GIQYIEAWLG GLGCVPLYNL MEDAATAEIS RAQVWQWVHQ PNGITEDFRK ITLDWVRELI VEELAKIEQE VGAERYRNGH YDRASQLFDQ LVANPTFTEF LTLPAYEQID
|
| |