Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5997 |
Symbol | |
ID | 5674318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7314567 |
End bp | 7315697 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244845 |
Product | inosine 5-monophosphate dehydrogenase |
Protein accession | YP_001510247 |
Protein GI | 158317739 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase |
TIGRFAM ID | [TIGR01304] IMP dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.833679 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0905948 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGACG TCGAAATCGG TATCGGGAAG AACGCGCGAG TGGGCTACGG CCTTGACGCC GTGGGCATCG TTCCCTCGCG GCGCACCCGG GACCCAGCCG ACGTGTCACT CGCCTGGGAG ATCGACGCCT ATCACTTCGA TCTGCCGATC GTCGCGGCGC CGGCCGACGC GGTGACCTCG CCGGAGTCCG CGATCGCCGT CGGGCGCCAG GGTGGGCTCG GCGTGCTGCA CCTCGAGGGG CTGTGGACCA GGCACGAGGA CCCCGAGCCG CTGCTCGAGG AGGTCGCGGA GCTGGGCGCG CGTTCCGGCG CGGTGGCGGC GACCCGGCGC CTGCGTGAGT TCTACGCGGC GCCGGTGCAG CCCGAGCTCA TCGGGGCGCG GCTGGCGCGG ATGCGGGAGG CCGGCGTGGT CACCGCCGCC GCGCTGCGCC CGCAGAAGGT CCGGGCACTG TGCCCGCACG TGCTGGCCGC GGGCGTGGAC CTCCTCGTCA TCCATGGCAC GGCGGTGTCG GCCGAGCACC AGTCCAGGCG CACTGAGCCG CTGAATCTCA AGCAGTTCAT AGGACAGCTC GACATCCCGG TGATCGTCGG TGGCTGCGCA TCCTTCTCCA CGGCACTGCA TCTCATGCGG ACGGGCGCGG CCGGCGTGAT CGTCGGAGTC GGTGCCGGCC TCGGCGACGA CACGGCCGAG ACCCTCGGAA TTGGTGTTCC GCTGGCGACT GCGATCGCCG ACGCGGCCGG CGCCCGGATG CGGTACCTCG ACGAGTCCGG CGGCCGGTAC GTCCACGTCA TCGCGCATGG CGACCTGCGC ACCGGCGGGG ACGCGGCGAA GGCCGTGGCC TGCGGAGCCG ACGCCGTGAT GGTGGATTCG CCGCTCGCGC AGGCGGTGGA CGCCCCTGGG CGGGGCTCGG TCTGGTCGAT GGAGATCCTG CACTCGGACC TGCCGCGCGG GCGGTGGGCG CCGGTCGAGA CGTCACGTAC CGTCGCCGAG ATCCTCACCG GGGGCGAGGT CGCGGCCGAG GACGGTGTGG CGAACATCGC CGGGGCCCTG CGGGCGGCGA TGGCCACAAC GGGCTACGCG ACGTTGAAGG AGTTCCAGAA GGCGGAGATC ATGATCGCCG CCGGGCGCTA G
|
Protein sequence | MADVEIGIGK NARVGYGLDA VGIVPSRRTR DPADVSLAWE IDAYHFDLPI VAAPADAVTS PESAIAVGRQ GGLGVLHLEG LWTRHEDPEP LLEEVAELGA RSGAVAATRR LREFYAAPVQ PELIGARLAR MREAGVVTAA ALRPQKVRAL CPHVLAAGVD LLVIHGTAVS AEHQSRRTEP LNLKQFIGQL DIPVIVGGCA SFSTALHLMR TGAAGVIVGV GAGLGDDTAE TLGIGVPLAT AIADAAGARM RYLDESGGRY VHVIAHGDLR TGGDAAKAVA CGADAVMVDS PLAQAVDAPG RGSVWSMEIL HSDLPRGRWA PVETSRTVAE ILTGGEVAAE DGVANIAGAL RAAMATTGYA TLKEFQKAEI MIAAGR
|
| |