Gene Haur_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1016 
Symbol 
ID5732920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1161510 
End bp1163102 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content51% 
IMG OID641278151 
Productmalate synthase 
Protein accessionYP_001543792 
Protein GI159897545 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC GGCAACATGG CGTGAAAATC AACGCCCCTA TCACCCCCGC TGCGGCGGAA 
TTGTTGACGG AACCAGCCCT ACACTTCTTA GCTGCCTTGC ATCGCACTTT TGACCAAACT
CGCCGCGACT TATTGCTCGG ACGAGTGGAA CGCCAAAGCC GCCTTGATGC AGGCGAAAAC
CCTGATTTTC TCGCTGAAAC TGCCCATATT CGTGCTAGCG ACTGGCAAAT TGCGCCCATC
CCCGATGAGA TTCGTAATCG TCGCGTGGAA ATTACTGGGC CAATTGATCG CAAAATGATC
ATCAATGCGC TCAACTCTGG AGCCAATGTC TTCATGGCCG ACTGTGAAGA TGCAACCACT
CCAAGCTGGG ATAATTTGGT CAGCGGCCAA CTTAACTTGC GCGATGCGGT CAATCGGACG
ATAAGCTTCA CCAATGAAGC TGGCAAAGCC TATCAATTAA ACGATCAGGT TGCGGTGCTG
TTTGTGCGGC CTCGTGGCTG GCACTTGCTC GAAAAGCATG TCACCGTCGA TGGCGAACCC
TTGGCTGGTG GTCTGTTCGA CTTTGGTTTG TATTTGTTCC ACAATGCCAA AACCTTGCTC
GAACGTGGCT CGGCTCCTTA CTTCTATCTG CCAAAACTCG AAAGCCATCG CGAAGCCCGT
TTGTGGAATG ATGTGTTCGT GTTTGCCCAA AAGCAACTCG GCCTGCCCCA TGGCTCAATC
AAGGCAACGG TTTTGATTGA AACAATTTTG GCCGCCTTCG AGATGGACGA AATTCTGTAT
GAATTGCGCG ACCACTCGGC TGGCCTCAAC TGTGGCCGCT GGGATTACAT CTTCAGCTGC
ATCAAGAAAT TTGCTAAATT ACAACATTTT GTGCTGGCTG ATCGTGCTTT AGTGACGATG
ACTTCACGCT TTATGCGCTC ATATTCGTTG CTGGCGATCA AAACCTGCCA TCGCCGTGGT
GCTCACGCAA TGGGCGGGAT GGCTGCTCAG ATTCCGATCA AGCACGATGC CCAAGCCAAT
GCCGAAGCCC TCGCCAAAGT GCAAGCCGAT AAAGAGCGCG AAGCTCGCGA CGGCCACGAC
GGCACATGGG TCGCTCATCC AGGTTTGGTT CCGTTAGCTA AGGCCGCCTT TGATGCTTTG
ATGCCTGAAG CTAACCAAAT TGGCAAGCAG CTTGATGTTG AAATTACTGC CGATGATTTA
CTGCGCTTCG AGCCATCAGC GCCGATTACC GAGCAAGGCC TGCGCAAAAA TATCAGCGTT
GGCATCCAAT ATATCGAAGC TTGGTTGGGT GGCTTAGGCT GCGTGCCGCT GTACAACTTA
ATGGAAGATG CCGCAACCGC CGAAATCTCC CGTGCTCAAG TTTGGCAATG GGTACATCAA
CCTAATGGCA TTACCGAAGA TTTTCGCAAA ATCACCCTCG ATTGGGTGCG CGAGTTGATC
GTCGAAGAAC TGGCCAAGAT CGAACAAGAA GTTGGCGCAG AACGCTATCG CAACGGTCAT
TATGATCGGG CTAGCCAATT GTTTGATCAA TTGGTTGCCA ACCCAACCTT TACCGAATTT
CTCACGCTTC CTGCTTACGA ACAAATCGAT TAA
 
Protein sequence
MTDRQHGVKI NAPITPAAAE LLTEPALHFL AALHRTFDQT RRDLLLGRVE RQSRLDAGEN 
PDFLAETAHI RASDWQIAPI PDEIRNRRVE ITGPIDRKMI INALNSGANV FMADCEDATT
PSWDNLVSGQ LNLRDAVNRT ISFTNEAGKA YQLNDQVAVL FVRPRGWHLL EKHVTVDGEP
LAGGLFDFGL YLFHNAKTLL ERGSAPYFYL PKLESHREAR LWNDVFVFAQ KQLGLPHGSI
KATVLIETIL AAFEMDEILY ELRDHSAGLN CGRWDYIFSC IKKFAKLQHF VLADRALVTM
TSRFMRSYSL LAIKTCHRRG AHAMGGMAAQ IPIKHDAQAN AEALAKVQAD KEREARDGHD
GTWVAHPGLV PLAKAAFDAL MPEANQIGKQ LDVEITADDL LRFEPSAPIT EQGLRKNISV
GIQYIEAWLG GLGCVPLYNL MEDAATAEIS RAQVWQWVHQ PNGITEDFRK ITLDWVRELI
VEELAKIEQE VGAERYRNGH YDRASQLFDQ LVANPTFTEF LTLPAYEQID