Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4061 |
Symbol | |
ID | 5672419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4840722 |
End bp | 4841825 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641242937 |
Product | hypothetical protein |
Protein accession | YP_001508354 |
Protein GI | 158315846 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.302749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.187686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCGG TGACGGCGGG AGTCGGTCGC GAGGTGGCAG TCGAGGTCGT GACGACGGCT TCGCTCGCAC CGTCCATTCA CAACACCCAG CCGTGGCGGT GGCGTCTCGC CGACGGCCGG CTGACGCTGC GGCCGGACGT CGAGCGGCGC CTCGCCGTCC TCGACCCGGA CGGGCGGCAG CTCCTGGTGA GCTGCGGCGC GGCGCTGCTG CACGCCCGGC TGGCGTTCCG GGTGCTCGGC TTCGAGCCGG TGGTGGAGCT GCTGCCGGAG GCGACCGGTT CCGGCGGGCT GGAGCCGGTG CTGTCCGGTG GCCTGGTCCT CGCGTCGATG GGGATCGGCG GGCGGGCGGC GCCGACGGAT TCCGACCGGC TGCTGTTCTC GGCCGCCGGG GCCCGGCACA CCGACCGTCG GCCGTTCGAG GCACGCCCGC TCGCCGGCGA CCTGGTCGAC CGGATGCGCC GCGCCGCCGA GTCCGAGGGC GCCTGGCTGA TGCCGCTGTC GCACTCCGAC CAGCGGCTGG ACGCGGCGGT GGTCGCGGCC CGGGCGGACT GGGCGGAGCA GGCCGACCCC GCCTACCGGG CCGAGCTGGC GTCGTGGCGG CGTGGGCAGG GCCCGACCGC CGACGGGGTG CCGACCGGCG TGGCCTCGAC CCATCACCCC GAGCGGGCCA GCGACTTCCC GCTGCGCGAC TTCGACACCC GCGGCGCGCC GGGCATCGCG GCGCCGCGTT CCGAGGAGGC CGACGGCGGC GCCGACGCGG ACGCGGGTCT CGCCCCGACC GTCGAACAGC GCCCGGAGGA GCTGCCGTCG GTGGAGCGCC CCGCTGTCGT CGTGGTCGGC ACGGACGCCG ACCAGGCGGA GGACAGGCTG CGCGCCGGGC AGGCGCTGGC GCGGGTTCTG CTCACCGTGA CCGCGGCCGG CGCGGCCGCG TCGCCGATGG GCCAGCCCAT CGACAGCGAC GGGTACCGTT CGGTGATCGA GCGGATGCTG AGCATCGCCG GGCACGTGCA GATGATCCTT CGGGTCGGCT ATCCCCGCCG GGACGCGGTG GCCTCCGCGC TGACCCCGCG GCGCCCGATC GGCGAGATCC TGACCTTCGG TTAG
|
Protein sequence | MESVTAGVGR EVAVEVVTTA SLAPSIHNTQ PWRWRLADGR LTLRPDVERR LAVLDPDGRQ LLVSCGAALL HARLAFRVLG FEPVVELLPE ATGSGGLEPV LSGGLVLASM GIGGRAAPTD SDRLLFSAAG ARHTDRRPFE ARPLAGDLVD RMRRAAESEG AWLMPLSHSD QRLDAAVVAA RADWAEQADP AYRAELASWR RGQGPTADGV PTGVASTHHP ERASDFPLRD FDTRGAPGIA APRSEEADGG ADADAGLAPT VEQRPEELPS VERPAVVVVG TDADQAEDRL RAGQALARVL LTVTAAGAAA SPMGQPIDSD GYRSVIERML SIAGHVQMIL RVGYPRRDAV ASALTPRRPI GEILTFG
|
| |