Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1224 |
Symbol | |
ID | 5669637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1464725 |
End bp | 1465696 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641240156 |
Product | extracellular solute-binding protein |
Protein accession | YP_001505584 |
Protein GI | 158313076 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.228597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATGT CGCTCATCCG TAGCAAACGG CGAACGGCCC CCACGGACGC CATCGACTCC ACCACCCGCG CGGCCTCGGG CTCCCGCCCG CGGCGGCCGC TGCGCAGCGC GGCCCTGCTG CTCGCCGCGG CACTGGCCGG CTCGCTCGCG CTGTCCGCCT GCGGCGACGA CGACAGCTCC GACGAGCCGG CCGCCGCCAC CGCCACCACG TTCCCGGCGG GGAGCACGAT GGCCAAGCTC CAGTCGGCGG GCACGATCAC CGTTGGCACG AAGTTCGACC AGCCGCTGTT CGGCCTGAAG AACCTGCGCG GCGAGCCCGA GGGCTTCGAC GTCGAGATCG CCAGGATCAT CACCGACGCG CTGGGGATCC CCGCGGACAA GGTCAAGTTC GTCGAGACGG TCTCGGCCAA CCGCGAGCCG TTCATCGAGC AGCACCGCGT GGACCTGGTG GTGGCCACCT ACACCATCAA CGACAAGCGC AAGCAGGTCG TCGACTTCGC CGGCCCGTAC TACGTGGCCG GTCAGACGCT GATGGTGCGG GCCGGCGAGA CCGCCATCAC CGGGAAGGAC ACGCTCGCGG GCAAGAAGGT CTGCTCGGTG AGCGGCTCCA CCCCGGCTGA GCGCATCCGC ACACAGGCAC CGGACGCCGA GCTGACCCTG TTCGACGTCT ACAGCAAGTG CGCCGAGGCG CTCAAGGCCG GCCAGGTGGA CGCGGTCACG ACCGACAACG CGATTCTGCT CGGCCTGATG GACTCCGACC CGGGCGCCTT CAAGCTGGTC GGCGAGCCGT TCAGCACGGA GCCCTACGGC ATCGGCATCG CCAAGGGCGA TGACGAGTTC CGCACGTTCA TCAACGACAC GCTCGAGGCG GCCTACACCG ACGGCCGGTA CGAGACGGCC TACAAGGACA CTATCGGCAA GGTCGAGCCG GACATGCCCA CGCCTCCCGC GGTGGACCGC TACACCTCCT GA
|
Protein sequence | MRMSLIRSKR RTAPTDAIDS TTRAASGSRP RRPLRSAALL LAAALAGSLA LSACGDDDSS DEPAAATATT FPAGSTMAKL QSAGTITVGT KFDQPLFGLK NLRGEPEGFD VEIARIITDA LGIPADKVKF VETVSANREP FIEQHRVDLV VATYTINDKR KQVVDFAGPY YVAGQTLMVR AGETAITGKD TLAGKKVCSV SGSTPAERIR TQAPDAELTL FDVYSKCAEA LKAGQVDAVT TDNAILLGLM DSDPGAFKLV GEPFSTEPYG IGIAKGDDEF RTFINDTLEA AYTDGRYETA YKDTIGKVEP DMPTPPAVDR YTS
|
| |