Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1557 |
Symbol | |
ID | 6974967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1735722 |
End bp | 1736861 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643391088 |
Product | homocitrate synthase |
Protein accession | YP_002275951 |
Protein GI | 209543722 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.319892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGAAA ACGCGCGTCT GCTGATCAGC GACACCACCC TGCGCGATGG AGAGCAGGCG CCGGGGGTGG CCTTTACCGC GTGGGAAAAG CTGGCCATCG CCGGCGCCCT CGACGCGGCC GGCGTGGATG AGATCGAGGC CGGCGTTCCG GCCATGGGGG ATGCCGAGAT CGCCATGATC GCGGCGATCG GCGACGAGGT GGAACGCGCC CGCGTCATCC CCTGGTGTCG CATGCGCGAC GAAGACGTGC ACGCGGCGCG GCGGACGGGG CTGGGCAGCG TGCACCTGTC CGTTTCGACC TCGGTCCGGC AGATCCAGGC GAAATACCGC ATGTCGCCGC GTCGCACGCT GGACATGGCG CGCGACGTCG TCTCCCGCGC CAGGGATTAC GGCCTGGCGG TGTCGGTGGG GGGCGAGGAC GCCAGCCGGG CCGATCCGGC CTATCTGGTC GATCTGCTGG GCGTGATCGC CGAGGCAGGG GCGTTCCGCT TCCGCTTCGC CGATACGCTG GGCGTGATGG ACCCGTTCGG CGTGCACGAG GTCATGCGGT ATCTCTGCCA GTCCAGCCCG CTTCAGCTTG AATTTCACGG CCATGACGAT CTGGGCCTGG CAACGGCCAA TACGCTGGCC GCCGTGCGGG CCGGCGCGGC GTGCGCCAGT GTCACCGTCC TGGGACTGGG CGAACGCGCG GGCAACGCCC CGCTGGAGGA AGTGGTGGCG GGGGTATATC GCCTGCTGGG CCGTCCGGCG GGCGTGAAGC TGGACAGCCT GCCCGGGCTT GCCACCCTGG TCTCGCGCGC CGCCGTGCGT GATATCCCGC CCGACAAGGC GATCGTCGGC GACGCGGTCT TTCGTCATGA ATCGGGCATC CATGTCTCGG GCCTGCTGCG TGACGCCGCG ACGTATGAGG CACTGGACCC CGTGCAGTTC GGCCGCCAGC GCGAAATCGT GCTGGGCAAG CATTCCGGCC GGGCCGCGGT ACGGCACGCG CTGGCGGCGC TGGGCCTGGA TGCCGACGAA ACCGTCATCG CGGCCACGCT TGCCGCCGTG CGCGCGCGTG CCTCCGCCGC CAAGCGCACG GTGGCGCTGG CCGAACTGGC CGAAATGCAT GCGGGCCTGA TGGCAGGCGT GTCGAAATAA
|
Protein sequence | MSENARLLIS DTTLRDGEQA PGVAFTAWEK LAIAGALDAA GVDEIEAGVP AMGDAEIAMI AAIGDEVERA RVIPWCRMRD EDVHAARRTG LGSVHLSVST SVRQIQAKYR MSPRRTLDMA RDVVSRARDY GLAVSVGGED ASRADPAYLV DLLGVIAEAG AFRFRFADTL GVMDPFGVHE VMRYLCQSSP LQLEFHGHDD LGLATANTLA AVRAGAACAS VTVLGLGERA GNAPLEEVVA GVYRLLGRPA GVKLDSLPGL ATLVSRAAVR DIPPDKAIVG DAVFRHESGI HVSGLLRDAA TYEALDPVQF GRQREIVLGK HSGRAAVRHA LAALGLDADE TVIAATLAAV RARASAAKRT VALAELAEMH AGLMAGVSK
|
| |