Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mflv_3756 |
Symbol | aroB |
ID | 4975072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium gilvum PYR-GCK |
Kingdom | Bacteria |
Replicon accession | NC_009338 |
Strand | - |
Start bp | 4006404 |
End bp | 4007483 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640457980 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001135016 |
Protein GI | 145224338 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.209272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGAAC CGGTGACCGT CGATGTGCGG ACCGACCCGC CCTACCCGGT GATCATCGGC CGGGGCCTGC TGGGCGACCT GGGCCGCGTC CTCGACGGCA GGCACAAAGT CGCGATCCTG CATCAGCCGA CGCTGACCCA GACTGCCGAA GCGATCCGAA CTCACCTGTC GGAGAAGGGA ATCGACGCGC ACCGCATCGA GATCCCGGAC GCCGAGGGCG GCAAGGAGCT GCCGGTCGTC GGGTTCATCT GGCAGGTGCT GGGCCGCATC GGGGTCGGCC GCAAGGATGC GATCGTCAGT CTCGGCGGGG GAGCGGCCAC CGATGTCGCC GGATTCGCCG CGGCGACCTG GCTGCGCGGT ATCGACATCG TCCACGTTCC CACCACGCTG CTGGGCATGG TCGACGCCGC GGTGGGCGGC AAGACCGGCA TCAACACCGA CGCGGGCAAG AACCTGGTCG GTGCGTTCCA TCAGCCGGCC GCGGTGCTGA TCGACCTCGC GACGCTGGAG TCGTTGCCGC GCAACGAGAT CGTCGCGGGA ATGGCCGAGA TCGTGAAGGC CGGTTTCATC GCCGACCCGG TGATCCTCGA CATGATCGAG GCCGACCCGG AGGCGGCGCT CGATCCCTCC GGTGCGGTCC TCCCGGAGCT GATCCGGCGG GCTGTCGTCG TCAAGGCCGA GGTGGTCGCC GCCGACGAGA AGGAATCGCA GCTGCGCGAG ATCCTCAACT ACGGGCACAC TCTGGCGCAC GCCATCGAAC GCCGCGAGCG CTATCAGTGG CGCCACGGCG CCGCCGTGTC GGTCGGGCTG GTGTTCGCCG CGGAGCTGGG CCGGCTGGCC GGCCGGCTCG ACGACGACAC CACCGAACGC CATCGCGCAA TCCTGACCTC GCTGGGACTG CCGGTCACCT ACGACGCCGA CGCCCTGCCC CAGCTGATGG AGTCGATGCT GGGCGACAAG AAGACCCGCG CCGGAGTGCT GCGATTCGTG GTCCTCGACG GGTTGGCCAA ACCGGGCCGT CTGGAGGGGC CGGATCCGTC GCTGCTGGCC GCGGCCTACG CGGAGGTCGC GCGGGACTGA
|
Protein sequence | MTEPVTVDVR TDPPYPVIIG RGLLGDLGRV LDGRHKVAIL HQPTLTQTAE AIRTHLSEKG IDAHRIEIPD AEGGKELPVV GFIWQVLGRI GVGRKDAIVS LGGGAATDVA GFAAATWLRG IDIVHVPTTL LGMVDAAVGG KTGINTDAGK NLVGAFHQPA AVLIDLATLE SLPRNEIVAG MAEIVKAGFI ADPVILDMIE ADPEAALDPS GAVLPELIRR AVVVKAEVVA ADEKESQLRE ILNYGHTLAH AIERRERYQW RHGAAVSVGL VFAAELGRLA GRLDDDTTER HRAILTSLGL PVTYDADALP QLMESMLGDK KTRAGVLRFV VLDGLAKPGR LEGPDPSLLA AAYAEVARD
|
| |