Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3573 |
Symbol | ispG |
ID | 3904512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4270849 |
End bp | 4272003 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637880894 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_482654 |
Protein GI | 86742254 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTAA CTCTGGGCAT GCCAACCGCT CCGGCCCGTC CGCTCGGCAC GCGGCGGCAC AGCCGGCAGA TCCACGTGGG CAACGTCCTG GTCGGGGGTG ACGCTCCGGT CTCCGTTCAG TCGATGTGCA CCACGTTGAC GTCGGACGTC AACGCCACGC TGCAGCAGAT CGCACAGCTG ACAGCGTCGG GATGCCAGAT CGTCCGGGTC GCGGTGCCAA GCCAGGACGA CGCCGACGCC CTCGCCGCGA TCGCCCGCAA GTCCCCGATC CCGGTGATTG CCGATATCCA CTTCCAGCCC AAGTACGTCT TCGCCGCGAT CGACGCGGGC TGCGCCGCGG TCCGGGTCAA CCCCGGCAAC ATCAAGGCTT TTGACGACAA GGTCGGGGAG ATTGCTCGCG CGGCGAAGGC CGCCGGCGTT CCGATCCGGA TCGGGGTCAA CGCGGGTTCA CTCGACAAGC GGCTGTTGGC GAAGTACGGC AAGGCCACGC CGGAGGCGCT GACGGAGTCG GCCTTGTGGG AATGCTCGCT GTTCGAGGAG CACGACTTCC GTGACATCAA GATCTCGGTG AAGCACCACG ACCCGGTCGT CATGATCCAG GCGTACCGGC TGCTCGCCCA GGCCTGCGAC TACCCGCTGC ACCTCGGTGT CACCGAGGCC GGACCGTCCT TCCAGGGCAC GGTCAAGTCC GCGGTCGCCT TCGGGGCCCT GCTCGCCGAG GGAATCGGTG ACACGATCAG GGTGTCGCTG TCGGCACCGC CGGTCGAGGA GGTGAAGGTC GGCACCGCGA TCCTGGAGTC CCTGGGACTT CGGCAGCGTA AGCTCGAAAT CGTATCCTGC CCTTCCTGCG GTCGGGCTCA GGTCGATGTC TACACCCTTG CCAATCAGGT CAGCGCCGGT CTCGAGGGCA TGGAGGTCCC GTTGCGCGTC GCCGTCATGG GCTGCGTCGT GAACGGGCCG GGCGAGGCCA GGGAGGCCGA TCTCGGCGTC GCATCCGGGA ACGGCAAGGG TCAGATCTTC GTCCGAGGTG AGGTCGTGAA GACCGTTCCG GAGGCGCAGA TCGTGGAAAC CCTCATTGAG GAAGCCATGC GGCTGGCCGA GGAGATGGCG GCGGACGGCA CCCCGTCCGG CGAACCCTCG GTTTCCGTGG GTTAA
|
Protein sequence | MTVTLGMPTA PARPLGTRRH SRQIHVGNVL VGGDAPVSVQ SMCTTLTSDV NATLQQIAQL TASGCQIVRV AVPSQDDADA LAAIARKSPI PVIADIHFQP KYVFAAIDAG CAAVRVNPGN IKAFDDKVGE IARAAKAAGV PIRIGVNAGS LDKRLLAKYG KATPEALTES ALWECSLFEE HDFRDIKISV KHHDPVVMIQ AYRLLAQACD YPLHLGVTEA GPSFQGTVKS AVAFGALLAE GIGDTIRVSL SAPPVEEVKV GTAILESLGL RQRKLEIVSC PSCGRAQVDV YTLANQVSAG LEGMEVPLRV AVMGCVVNGP GEAREADLGV ASGNGKGQIF VRGEVVKTVP EAQIVETLIE EAMRLAEEMA ADGTPSGEPS VSVG
|
| |