Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4164 |
Symbol | |
ID | 5672519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4948777 |
End bp | 4949736 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243037 |
Product | hypothetical protein |
Protein accession | YP_001508454 |
Protein GI | 158315946 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00796352 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.264326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAG TCGAGCTACC CGACTCCACC CGTCCCGACC GGCTCGGCCT GCCCGCCGCC GCCCGCGCCG CCGCCATGCT CTCCCGGGAG ATCGCCTACG CCGACCCCGA CCCGCTGACG ATGGCCGAGC TCGATGACCG GCTGCACCGG TTCGCCACCA CCCTGCTCAC CACCCCGCAC ACCGCGCTGC TGCGCCCGGT GATCGCCGAC TGGCGCCGTG TGCAGGCCGC GCTGGGACGC CGGCTGTCCC CCGCGACGCA CCGTGAGCTC ACCGGGGTCG CCGGGTTCCT GTCGTTCTAC CTGGGTGTGC TCGCCGTCGA GGCCGGGGAT GATCCGTCGG CGCGGCGGTT CGCCACCCTC ACCGACCAGT TCGCCACCCA GCTCGACGAC CCGCTGCTGA CCGGTACCGC CGCCACCCTG GACTCCCTCG TCGCGTTCGT CGGTGGCCGC TATGACGCGG CCCGGGGTGC CGCCCGCCGT GCCGCGGCCG CCGGCCATCC GTATCTGCAC GGGTGGGCGG CCGCGTTGGA GGCGGGTGCG GCCGCGACGC TCGGTGACAT CGACGGCGCC CTGGCCGCGC TCGCCCGGCT GTCACAGACG CCGTGTGTGC AGGGGCTGCG TCATCCGGGG TGGCCGGCGT TCGACGAGGT CCGGGAGGCC TGCGTCGTCG CGGACGTCGT CAGCCGCATG GGCGGTGAGG GCGCGGCCAG CCTGAGCCGG GTCGCGGTCG ACCTGACCAG TTCGGGCACC CCCGAACGAG GGTGGGCGCT GGCCGCGCTC GCCGGTGCGC TGGCACCCGA CGACCCGCCG GAGGCCAGCC GGCTGCTCGG TGAGATCGTC GAGATCCTGG ACGTGCACCC GTCGCGGTTG CTGTCGAGCC GGGTCAACGA CCTGGTGCGC ACCGCCGGCT ACCCGCGGCC GCTGGCCCCC CATCCCGGCC AGGCCGGCAC CTCCGCCTGA
|
Protein sequence | MSRVELPDST RPDRLGLPAA ARAAAMLSRE IAYADPDPLT MAELDDRLHR FATTLLTTPH TALLRPVIAD WRRVQAALGR RLSPATHREL TGVAGFLSFY LGVLAVEAGD DPSARRFATL TDQFATQLDD PLLTGTAATL DSLVAFVGGR YDAARGAARR AAAAGHPYLH GWAAALEAGA AATLGDIDGA LAALARLSQT PCVQGLRHPG WPAFDEVREA CVVADVVSRM GGEGAASLSR VAVDLTSSGT PERGWALAAL AGALAPDDPP EASRLLGEIV EILDVHPSRL LSSRVNDLVR TAGYPRPLAP HPGQAGTSA
|
| |