Gene Francci3_3573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3573 
SymbolispG 
ID3904512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4270849 
End bp4272003 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content67% 
IMG OID637880894 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_482654 
Protein GI86742254 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTAA CTCTGGGCAT GCCAACCGCT CCGGCCCGTC CGCTCGGCAC GCGGCGGCAC 
AGCCGGCAGA TCCACGTGGG CAACGTCCTG GTCGGGGGTG ACGCTCCGGT CTCCGTTCAG
TCGATGTGCA CCACGTTGAC GTCGGACGTC AACGCCACGC TGCAGCAGAT CGCACAGCTG
ACAGCGTCGG GATGCCAGAT CGTCCGGGTC GCGGTGCCAA GCCAGGACGA CGCCGACGCC
CTCGCCGCGA TCGCCCGCAA GTCCCCGATC CCGGTGATTG CCGATATCCA CTTCCAGCCC
AAGTACGTCT TCGCCGCGAT CGACGCGGGC TGCGCCGCGG TCCGGGTCAA CCCCGGCAAC
ATCAAGGCTT TTGACGACAA GGTCGGGGAG ATTGCTCGCG CGGCGAAGGC CGCCGGCGTT
CCGATCCGGA TCGGGGTCAA CGCGGGTTCA CTCGACAAGC GGCTGTTGGC GAAGTACGGC
AAGGCCACGC CGGAGGCGCT GACGGAGTCG GCCTTGTGGG AATGCTCGCT GTTCGAGGAG
CACGACTTCC GTGACATCAA GATCTCGGTG AAGCACCACG ACCCGGTCGT CATGATCCAG
GCGTACCGGC TGCTCGCCCA GGCCTGCGAC TACCCGCTGC ACCTCGGTGT CACCGAGGCC
GGACCGTCCT TCCAGGGCAC GGTCAAGTCC GCGGTCGCCT TCGGGGCCCT GCTCGCCGAG
GGAATCGGTG ACACGATCAG GGTGTCGCTG TCGGCACCGC CGGTCGAGGA GGTGAAGGTC
GGCACCGCGA TCCTGGAGTC CCTGGGACTT CGGCAGCGTA AGCTCGAAAT CGTATCCTGC
CCTTCCTGCG GTCGGGCTCA GGTCGATGTC TACACCCTTG CCAATCAGGT CAGCGCCGGT
CTCGAGGGCA TGGAGGTCCC GTTGCGCGTC GCCGTCATGG GCTGCGTCGT GAACGGGCCG
GGCGAGGCCA GGGAGGCCGA TCTCGGCGTC GCATCCGGGA ACGGCAAGGG TCAGATCTTC
GTCCGAGGTG AGGTCGTGAA GACCGTTCCG GAGGCGCAGA TCGTGGAAAC CCTCATTGAG
GAAGCCATGC GGCTGGCCGA GGAGATGGCG GCGGACGGCA CCCCGTCCGG CGAACCCTCG
GTTTCCGTGG GTTAA
 
Protein sequence
MTVTLGMPTA PARPLGTRRH SRQIHVGNVL VGGDAPVSVQ SMCTTLTSDV NATLQQIAQL 
TASGCQIVRV AVPSQDDADA LAAIARKSPI PVIADIHFQP KYVFAAIDAG CAAVRVNPGN
IKAFDDKVGE IARAAKAAGV PIRIGVNAGS LDKRLLAKYG KATPEALTES ALWECSLFEE
HDFRDIKISV KHHDPVVMIQ AYRLLAQACD YPLHLGVTEA GPSFQGTVKS AVAFGALLAE
GIGDTIRVSL SAPPVEEVKV GTAILESLGL RQRKLEIVSC PSCGRAQVDV YTLANQVSAG
LEGMEVPLRV AVMGCVVNGP GEAREADLGV ASGNGKGQIF VRGEVVKTVP EAQIVETLIE
EAMRLAEEMA ADGTPSGEPS VSVG