Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5931 |
Symbol | |
ID | 5674252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7203844 |
End bp | 7205205 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641244779 |
Product | hypothetical protein |
Protein accession | YP_001510181 |
Protein GI | 158317673 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.090162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00187432 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAGGCG TCTTCGTCGG CCTGACGACG CTTGATTCGG TCTACCTCGT GGATCGGCTT CCAGGTGCCG ACGAGAAGTG TGTGGCGCGG GATTTCGCCA TGAATGCGGG CGGGCCCGCG ACGAACGCCG CGGTGACGTT CGCCTACCTC GGCGGGCGGG CCTGCCTCGT GAGCGCCATC GGCACGTCCC CGGCCGCGGC GCTCGTGCAT GCGGATCTCG CGCGCTACGG GGTCCGGCAC ATCGAGCTGG TGCCGGAGCA GTCCGGCGAC GGTGCCGCCG GTCCCTACGG AGCCCTCGCC GCGCAGGGAG TGGCCGGTCC CGCCGGCCAG CACGGCCCGC CCGGCAAGCC CACCGGGCTG CGCTCCGGCG GCACCCGCTC CGGCTACGGC CCGGCGGGAC CGCTCACCGG ATACCCGCAC GTGCGCGGCC GGGGCGGGCC CGGCCTGCCC GGCGGCGCGA CCCACAGCCC CGCCGGGGCG GTCAAGCCAG CCGGCCACGT GCACGGCCCG GTCGGCGGCG CCCACGGCGG CGGGCACGGT GGGCACGGCA GCGGACCCGG CGGTGGCCAC GGTGGGCCTG GCGGTCACGG TGCGATCGGC GGTCACGGCG GCCACGGCGG GATCGGCGGC CACGGGGTGG CCGGCGGGGC CGGGCTCGGT GGAGCCGGCG TCCTCGGCGG GATGGCCGGC CTGGCCGGTG CCGTCCAGAC CGGGCCGAAC ACGGCGGCCG CCGGCCACCT GGCCGGCCAG TCGGCGATGT CCTACGCCCT CCCGATGTCA GCGGTCATGG TGACCTCGCA GACCGGTGAG CGTGCGGTGA CCTCGACGCA CGGCATGGTC CCCCGGTGCA CCGCGAACAC CTCCGCCGCG GCCGCGGTCG CCGACGCCGA CGTGGTCGTG CTCGACGGCC ACCAGGTCGA CGCCGCCATC GGCCTGCTCC GCACGCTGCG CGGCTCAGGC CCGCCGGTCC TCCTCGACGG CGGAAGCTGG AAGCCCGGCA CCGAGCAGAT CCTGCCGTTC GTCGACGTCG TGATCTGCTC GACCGCGTTC CGCCCCCCGG GCTTCGACCC CGGCGCGGAC ATCCTCGGCC TGCTGCTGCG CTACGGGCCG TTCTTCGTCG CCGTCACGGA CGGCCCCGGG CCCATCCGCT GGGCCACGGC GGAACGGCGC GGCCACGTGC TGCCCCCGGT GGTGGCCGCC CGCGACACCC TGGGCGCCGG CGACGTCTTC CACGGTGCCT TCGCCTGGAT GATGGCCCAC GGCGCGCTCG CGACCGACGA GCTGGTCGGA GCCCTCGGCG AGGCCTCACG GGTCGCCGCC CGCTCCGTCC AGACCTTCGG CCCCCGCAGC TGGATGACCT GA
|
Protein sequence | MKGVFVGLTT LDSVYLVDRL PGADEKCVAR DFAMNAGGPA TNAAVTFAYL GGRACLVSAI GTSPAAALVH ADLARYGVRH IELVPEQSGD GAAGPYGALA AQGVAGPAGQ HGPPGKPTGL RSGGTRSGYG PAGPLTGYPH VRGRGGPGLP GGATHSPAGA VKPAGHVHGP VGGAHGGGHG GHGSGPGGGH GGPGGHGAIG GHGGHGGIGG HGVAGGAGLG GAGVLGGMAG LAGAVQTGPN TAAAGHLAGQ SAMSYALPMS AVMVTSQTGE RAVTSTHGMV PRCTANTSAA AAVADADVVV LDGHQVDAAI GLLRTLRGSG PPVLLDGGSW KPGTEQILPF VDVVICSTAF RPPGFDPGAD ILGLLLRYGP FFVAVTDGPG PIRWATAERR GHVLPPVVAA RDTLGAGDVF HGAFAWMMAH GALATDELVG ALGEASRVAA RSVQTFGPRS WMT
|
| |