Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0938 |
Symbol | |
ID | 5669352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1097189 |
End bp | 1098820 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239865 |
Product | hypothetical protein |
Protein accession | YP_001505300 |
Protein GI | 158312792 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0169298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.687676 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGACA TCCCACGGCG AGCGGTCGTC CGCACCGCCA AGCTCGCGAC CCTCCCGATC GGCATCGCCG GCCGGGCGAC ACTCGGCGTC GGAAAGCGCC TCGGCGGCAG GCCCGCCGAG GCCGTCGCGA CCGAGCTGCA GCAGCGCACC GCGGCGCAGA TCTTCCGGGT GCTCGGCGAG CTCAAGGGCG GGGCCATGAA GCTCGGGCAG GCGCTGTCCG TCTTCGAGGC CGCGCTGCCC GACGAGGTCG CCGGCCCCTA CCGCGCCGCG CTGACGAAAC TGCAGGAGGC GGCGCCACCG CTGCCCGCCG CCACGGTGCA CAAGGTCCTC GCCGAGGAGC TCGGCCCCGA ATGGCGGGCC CTGTTCGCCA GCTTCGACGA CACACCCGCC GCCGCCGCGA GCATCGGCCA GGTGCACCGC GCCGTGTGGG CGGACGGCCG GCCCGTCGCG GTGAAGATCC AGTACCCGGG GGCGGGCTCG GCCCTGCTGG CTGATCTGAA CCAGCTGGGC CGCGCCGCGC GGCTGTTCGG CGCGCTGACG CCCGGTCTCG ACATCAAGCC GCTGGTCGCC GAGCTCAAGG CACGCATCAC CGAGGAGCTC GACTACCGGC TCGAGGCCGC CTGGCAGCGG GCGTTCGCGC AGGCCTACGC CGACGACCCG GACATCGTCG TCCCCCGGCC GATCGCCGGG GCGGACCGGG TCCTGGTCAG CGAGTGGATC GACGGGGTGC CGCTGTCGAC CATCATCGAC CGGGGGACGC AGGAGGAGCG GGACCGCGCC GGCCTGCTCT TGGTCCGCTT CCTCTACTCC TGCCCCGGCC GGGCCGGGCT GCTCCACGCC GACCCGCACC CGGGCAACTT CCGGCTCCTC GCGGACGGCC GGCTCGGTGT TCTCGACTTC GGGGCGGTGA ACCGTCTGCC CGGCGGGCTG CCCGAGCCGA TCGGCCGGCT CGCCCGGCTG ACCCTCGCCG GTGACGCCGA GGCCGTCGCC GAAGGGCTGC GGGCGGAGGG GTTCATCCCG GAGGGCGCCG CCATCCCCGC GGAGGACCTG CTCGACTACC TGGCACCGAT GCTCGCGCCG ATCACCGACG AGGAGTTCAC CTTCTCCCGC GACTGGCTGC GCGGGGAGGC GCTCCGCCTC GGTGACTGGC GCTCCGCCGC GGCGCAGCTC GGCCGCCAGC TCAACCTGCC ACCGTCCTAC CTGCTGATCC ACCGGGTGAC GCTCGGCGCG ATCGGGATCC TCTGCCAGCT CGGCAGCTCC GGCCCGTTCC GGGCCGAGAT GGAGCGCTGG CAGCCCGGGT TCGCCCCGCC GCGCAGCGCC GCCGCACGCC ACGCGGCAGC CGCGAACCGG CCGAACCGCC GCCTTCCCAG ACTCGACATC GAGGACGGCA CCGGCGTCAT CCGTCCGCTC CCCGGACCGG TGGTCCTCGC CACCGCCCCC GCGCAGCGCT CGGGGCGCTC GGGCCGCGCA CGCAGCCGCG TCCGCCCGCC GGAGCAGGCC AGCACGCCGG AGCAGGCTGG TCCACCGGAG ACCAGCCGAC CGTCGCGGCA GGCCCGGCCA GCTCCAGGAA ACCGACGCAA GCTCGAGAAA GAGGCCCAGC CCGAACCAGG GACCCAGCCC GAACCCCGCT GA
|
Protein sequence | MSDIPRRAVV RTAKLATLPI GIAGRATLGV GKRLGGRPAE AVATELQQRT AAQIFRVLGE LKGGAMKLGQ ALSVFEAALP DEVAGPYRAA LTKLQEAAPP LPAATVHKVL AEELGPEWRA LFASFDDTPA AAASIGQVHR AVWADGRPVA VKIQYPGAGS ALLADLNQLG RAARLFGALT PGLDIKPLVA ELKARITEEL DYRLEAAWQR AFAQAYADDP DIVVPRPIAG ADRVLVSEWI DGVPLSTIID RGTQEERDRA GLLLVRFLYS CPGRAGLLHA DPHPGNFRLL ADGRLGVLDF GAVNRLPGGL PEPIGRLARL TLAGDAEAVA EGLRAEGFIP EGAAIPAEDL LDYLAPMLAP ITDEEFTFSR DWLRGEALRL GDWRSAAAQL GRQLNLPPSY LLIHRVTLGA IGILCQLGSS GPFRAEMERW QPGFAPPRSA AARHAAAANR PNRRLPRLDI EDGTGVIRPL PGPVVLATAP AQRSGRSGRA RSRVRPPEQA STPEQAGPPE TSRPSRQARP APGNRRKLEK EAQPEPGTQP EPR
|
| |