Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2550 |
Symbol | |
ID | 5670944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3033413 |
End bp | 3034426 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241466 |
Product | hypothetical protein |
Protein accession | YP_001506886 |
Protein GI | 158314378 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.753306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAC CGCTGCCGCA GCGCGCCCTG CGCGTCCCGT TGCGTCCCGC CGCCCGCGTC GTCGCCGACG CCGAACACCT GGCGAACCTC ACCCCCCACC TCAGCCCGCG CGACCGCTGG ATCGCCCGGC TCCTCTACGA GCACCGGGTC CTGACCACCC ACCAGCTCGT CCAGGTCGGC TGGCCCACCC GGCGCACCGC CAACGAACGG CTCCTCCAGC TCTACCGCTG GCGCGTCGTC GACCGTTTCC AGCCCCTCAG CCCTCTCGGG GAGGGCATGC CGCCCGCGCA CTACGTGTGT GACGTCGCCG GCGCCGCGAT CCTCGCCGCC GAAGACGGCA TCGACTTGGC CGCGACCGGC TACCGGCACG ACCGCGCGCT CGGCGTCGCC TACTGGCCCC AGCTCGCCCA CCGCGTCGCC GTCAACGGCT TCTTCACCCA CCTCATCGCC CACGCCCGCA AGCCCAACCC GCCCGGCACG CTCACCGCCT GGTGGTCCGA GGCCCGGACC CGCCGCGCGT TCGGCGACAT CGTCCGCCCC GACGCCTACG GCCGCTGGAC CAGTCGCGGC GGTGACCTTG AATGGTTCCT CGAACTCGAC TGGGCCACCG AGCCCTACGC ACGCCTCGCC GCGAAGATCG ACAAGTATGG GCGGCTCGCG ACCGCGACCG GCATCACCAC GCCGGTCCTG TTCTGGTTCC CCACCACCAA CCGGGAGAGC CGCGCCCGCC GCGCGTTGGC CGACGCCGTC GCCGGGCTCG ACCAGCCGAA CACCGTCCCG GTCGCGACCA CCGCCGCCAC CCTCGCCCCG CCCGACGACC AGCTCGACCC GGCCCCCGCC CGCTGGCTGC CCCTCGCCTC CAGCCGCGCC GGCCGGCTCC CCCTCGACCA GCTGCACCGG GCCTGGCCCC GGCTGTCCGC ACCCGCGCCG GTGTCCGACC GGCCCGATGC CATCCCGACC GGGTCGGGCC TGCGTCCGCC CGCGCCGATG CCGCCCGAGC GGTATTGGGG GTGA
|
Protein sequence | MAKPLPQRAL RVPLRPAARV VADAEHLANL TPHLSPRDRW IARLLYEHRV LTTHQLVQVG WPTRRTANER LLQLYRWRVV DRFQPLSPLG EGMPPAHYVC DVAGAAILAA EDGIDLAATG YRHDRALGVA YWPQLAHRVA VNGFFTHLIA HARKPNPPGT LTAWWSEART RRAFGDIVRP DAYGRWTSRG GDLEWFLELD WATEPYARLA AKIDKYGRLA TATGITTPVL FWFPTTNRES RARRALADAV AGLDQPNTVP VATTAATLAP PDDQLDPAPA RWLPLASSRA GRLPLDQLHR AWPRLSAPAP VSDRPDAIPT GSGLRPPAPM PPERYWG
|
| |