Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3207 |
Symbol | aroB |
ID | 3906173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3799777 |
End bp | 3800865 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880531 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_482293 |
Protein GI | 86741893 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0774582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTGA AGGCCTCCGA CATCGTGCGG ATCCCGGTAC GGCCGGGCGG CGGGCGTCCC TACGACGTCG TGCTCGGGGT GGGCCTGCTC GGCGAGCTCG CCGAGACCGT CGTCGGGCGG ACCCGCGCCG CGGTGATCCA TCCCGGGGCG TTGCGGGCCA CCGCGGACGC CGTCGTCGCC GACCTGCGGG AGAACGCGGG CGTGGAGGCG CACGCCATCG AGGTGCCCGA CGGCGAGGAG GCCAAGCAGC TGCGGATCGC CGGCTTCTGC TGGGACGTGC TCGGCCGGAT CGGCTTCACC CGGGACGACA TGGTGATCGG CCTCGGCGGC GGCACCGTGA CAGACCTGGC CGGGTTCGTC GCCGCGAGCT GGCTGCGCGG GGTGGACGTC GTCCAGGTGC CGACCACCGT GCTCGGCATG GTCGACGCGG CGGTCGGCGG GAAGACCGGC ATCGACATCG ACGCGGGCAA GAACCTGGTC GGGGCCTTCC ACCAGCCGCT CGGCGTGCTG TGCGATCTGG CGGCGCTGGA GTCTCTGCCG GCCGTCGAGG TACGCGCCGG GCTCGCCGAG GTCGTCAAGA CCGGTTTCAT CGCCGACGCG GCGATCCTCG ATCTGCTCGA CGCCGATCCG ACCGGCGCCG CGCATCTGCC CGAGCTGATC GAGCGGTCCA TCCGGGTCAA GGCCGAGGTG GTCTCCGGCG ATCCGCGGGA GGACGGCCGG CGGGAGATCC TCAACTATGG TCACACCCTC GGTCATGCCA TCGAGAAGGT CGAGCACTTC AGCTGGCGGC ACGGCGCGGC GATCTCGGTG GGCATGGTGT TCGCGGCGGA GCTGTCCCGG CTCGTCGTCG GCCTCGACGA CGCCACCGCC GACCGGCACC GTGAGCTGCT GACCCGCATC GGCCTGCCGG TGACCTACCG GGACGACCGG TGGGCGGCGC TGCTCGACGC GATGCGGGTC GACAAGAAGG CGCGGGGACG GCGGATGCGT TTCGTCGGCC TCGAGGCGCA GGGCCGCACC GTCATCCTGG ACAATCCGGA CGCTGGCCTG CTGATCGCGG CGTTCACCAC TGTCGCCGAG GGTCGGTAG
|
Protein sequence | MQVKASDIVR IPVRPGGGRP YDVVLGVGLL GELAETVVGR TRAAVIHPGA LRATADAVVA DLRENAGVEA HAIEVPDGEE AKQLRIAGFC WDVLGRIGFT RDDMVIGLGG GTVTDLAGFV AASWLRGVDV VQVPTTVLGM VDAAVGGKTG IDIDAGKNLV GAFHQPLGVL CDLAALESLP AVEVRAGLAE VVKTGFIADA AILDLLDADP TGAAHLPELI ERSIRVKAEV VSGDPREDGR REILNYGHTL GHAIEKVEHF SWRHGAAISV GMVFAAELSR LVVGLDDATA DRHRELLTRI GLPVTYRDDR WAALLDAMRV DKKARGRRMR FVGLEAQGRT VILDNPDAGL LIAAFTTVAE GR
|
| |