Gene Francci3_3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3881 
SymbolispH 
ID3906649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4643460 
End bp4644488 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID637881207 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_482960 
Protein GI86742560 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCAT CGGGTGATGC ATCGCCTAGC CGACTTCCCG GCCGCCCGCC GCCGCGTAGA 
CTCCCACCCA TGGGCCGGGT CCTTCTTGCC AAACCGCGCG GCTACTGCGC TGGTGTCGAT
CGAGCCGTCG TGACCGTGGA GAAAGCACTA GAGCTGTACG GTTCACCGGT CTACGTACGC
AAACAAATCG TTCACAACCT TCATGTCGTG AAGACGCTGG AGGCCAAGGG CGCGATCTTC
GTGGACGAAA CCGACGAGGT GCCGCACGGC GCCACCGTCG TGTTTTCAGC CCACGGAGTG
GCACCAACGG TGCACGAAGA GGCGGCGGTG CGCGAGCTCC GCACAATCGA CGCCACGTGC
CCTCTGGTCA CCAAGGTGCA TTCTGAAGCC AGGCGGTTCG CCCGTGAGGA CTACGACATC
CTCCTCATCG GCCACGAGGG CCACGAGGAG GTCGTCGGCA CCACCGGGCA GGCTCCGGAC
CGCATCCACC TCGTGGACGG GCCCGAGGAC GCCGCCGGGG TGAAGGTCCG CGATCCGGAG
CGGGTGGCTT TCCTCTCCCA GACCACACTG TCCGTCGACG AGACGATGAC GACGGTTGAC
GCGCTGCGCG AGCGCTTCCC GCATCTGCAG GGTCCACCGA GCGACGACAT CTGCTACGCC
ACGCAGAACC GCCAGGTCGC CGTCAAGGAG ATCGCCGGCG CGGTCGACCT GGTCATCGTC
GTCGGCTCGC GGAACTCCTC GAACTCGGTC CGGTTGGTCG AGGTCGCGCT CGACGCCGGC
GCCCCGGCCG CCTACCTCGT GGACGACTCC ACCGAGGTGG ACCTGAGCTG GTTCGATGGT
GTCGAGACCG TCGGGGTCAC CAGTGGCGCA TCGGTGCCGG AGGAACTCGT CACCGGCGTG
ATGGCCTGGC TCGCCGAGCG GGGCTTCACC GATGTGGAGG AGGTCACGTC CGCGGACGAG
CACCTTCTCT TCGCGTTGCC GCCGGAGCTT CGCCGGGAGA TGCGTACCCG CGAGCGCGCC
GCCGGCTGA
 
Protein sequence
MSPSGDASPS RLPGRPPPRR LPPMGRVLLA KPRGYCAGVD RAVVTVEKAL ELYGSPVYVR 
KQIVHNLHVV KTLEAKGAIF VDETDEVPHG ATVVFSAHGV APTVHEEAAV RELRTIDATC
PLVTKVHSEA RRFAREDYDI LLIGHEGHEE VVGTTGQAPD RIHLVDGPED AAGVKVRDPE
RVAFLSQTTL SVDETMTTVD ALRERFPHLQ GPPSDDICYA TQNRQVAVKE IAGAVDLVIV
VGSRNSSNSV RLVEVALDAG APAAYLVDDS TEVDLSWFDG VETVGVTSGA SVPEELVTGV
MAWLAERGFT DVEEVTSADE HLLFALPPEL RREMRTRERA AG