Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3881 |
Symbol | ispH |
ID | 3906649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4643460 |
End bp | 4644488 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637881207 |
Product | 4-hydroxy-3-methylbut-2-enyl diphosphate reductase |
Protein accession | YP_482960 |
Protein GI | 86742560 |
COG category | [I] Lipid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0761] Penicillin tolerance protein |
TIGRFAM ID | [TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCAT CGGGTGATGC ATCGCCTAGC CGACTTCCCG GCCGCCCGCC GCCGCGTAGA CTCCCACCCA TGGGCCGGGT CCTTCTTGCC AAACCGCGCG GCTACTGCGC TGGTGTCGAT CGAGCCGTCG TGACCGTGGA GAAAGCACTA GAGCTGTACG GTTCACCGGT CTACGTACGC AAACAAATCG TTCACAACCT TCATGTCGTG AAGACGCTGG AGGCCAAGGG CGCGATCTTC GTGGACGAAA CCGACGAGGT GCCGCACGGC GCCACCGTCG TGTTTTCAGC CCACGGAGTG GCACCAACGG TGCACGAAGA GGCGGCGGTG CGCGAGCTCC GCACAATCGA CGCCACGTGC CCTCTGGTCA CCAAGGTGCA TTCTGAAGCC AGGCGGTTCG CCCGTGAGGA CTACGACATC CTCCTCATCG GCCACGAGGG CCACGAGGAG GTCGTCGGCA CCACCGGGCA GGCTCCGGAC CGCATCCACC TCGTGGACGG GCCCGAGGAC GCCGCCGGGG TGAAGGTCCG CGATCCGGAG CGGGTGGCTT TCCTCTCCCA GACCACACTG TCCGTCGACG AGACGATGAC GACGGTTGAC GCGCTGCGCG AGCGCTTCCC GCATCTGCAG GGTCCACCGA GCGACGACAT CTGCTACGCC ACGCAGAACC GCCAGGTCGC CGTCAAGGAG ATCGCCGGCG CGGTCGACCT GGTCATCGTC GTCGGCTCGC GGAACTCCTC GAACTCGGTC CGGTTGGTCG AGGTCGCGCT CGACGCCGGC GCCCCGGCCG CCTACCTCGT GGACGACTCC ACCGAGGTGG ACCTGAGCTG GTTCGATGGT GTCGAGACCG TCGGGGTCAC CAGTGGCGCA TCGGTGCCGG AGGAACTCGT CACCGGCGTG ATGGCCTGGC TCGCCGAGCG GGGCTTCACC GATGTGGAGG AGGTCACGTC CGCGGACGAG CACCTTCTCT TCGCGTTGCC GCCGGAGCTT CGCCGGGAGA TGCGTACCCG CGAGCGCGCC GCCGGCTGA
|
Protein sequence | MSPSGDASPS RLPGRPPPRR LPPMGRVLLA KPRGYCAGVD RAVVTVEKAL ELYGSPVYVR KQIVHNLHVV KTLEAKGAIF VDETDEVPHG ATVVFSAHGV APTVHEEAAV RELRTIDATC PLVTKVHSEA RRFAREDYDI LLIGHEGHEE VVGTTGQAPD RIHLVDGPED AAGVKVRDPE RVAFLSQTTL SVDETMTTVD ALRERFPHLQ GPPSDDICYA TQNRQVAVKE IAGAVDLVIV VGSRNSSNSV RLVEVALDAG APAAYLVDDS TEVDLSWFDG VETVGVTSGA SVPEELVTGV MAWLAERGFT DVEEVTSADE HLLFALPPEL RREMRTRERA AG
|
| |