Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0140 |
Symbol | |
ID | 5668565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 168435 |
End bp | 170036 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641239068 |
Product | FHA domain-containing protein |
Protein accession | YP_001504513 |
Protein GI | 158312005 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1716] FOG: FHA domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.425814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00649643 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCGTGC TGCAGCGCTT CGAACGGCGC CTTGGCGGCC TCGTCGAGGG TGCGTTCGCG AAGGTCTTCA AAGGCGGGGT CGAACCCGTC GAGATCGCCA GCGCCCTGGC TCGCGAGACC GACGACCGGC GTGCGAGCAG CTCGAACCGT GTCCTCGTCC CGAACGAGTT CGCCGTCGAG CTGGCCGGTG GCGACTTCGC CCGGCTGGCC CCCTACACCC GGGCGCTCTG TGACGAGCTG GCGGAGATGG TCCGTGAGCA TGCCGCGGAG CAGCGCTACA CCTTCGTCGG CCCGGTGACT GTCCGACTGG CGGAGGCCGC CGATCTCGAC ATCGGCGTCT TCCGAATCCG CAGCAGTGTG GCCTCCGCGG ATCCCGCGGT GGTCGGTGGC CGCCGGCCGC GGCCGCGGCC GGCGGCGCCG GGTACCCCGC ATCTACTGAT CACGACACGC GCGCCGGGCG GGTCGGGCAG CGGCGAGCGG GAGTACCCGC TGGACGCGGA GACCACGGTG ATCGGGCGCA GCGTCGAGTG CGACATCCGG CTGAACGACA CGGGCGTCTC GAGGCGGCAT GGCGAGATCC GCCGCCTGCC TGACGGGCAG TTCCTGTACG TGGATGCCGG CTCCACGAAC GGCAGCCTGG TCAACGGGCG CGCGGCGACG CAGGTGAAGC TTGTCAACGG CGACCGGATC GAGCTGGGGA CCGCGGTCGT GGAGTTCCGG CGTGAGGAGG CCCGCGGCGC AACCGGCCCG CGTGGGCCGC GCACGCCGGT CCCGGCACGC TCGTCACCGC CGCCGGGCCG CCCGACCCCG CCTCCGGCGG ATCACCGCAC GCCGCCGCCG TTCGACCCGC GCCAGCCAAC TCCGTCGACC CGCCGGGGTA CTCCGTCGCC GGACGACCGC TATGCCCGCC CGGAGCCGGG CGTCCCGGGT GGTCCGGGCG GCCGCGAGCC CGGTTACCGC GACCGCCCCG GCCGGGATCC GCGCGATGGC GACTACCGCG AGGGGCCGTC TCCGCGTTTC CGGGACGACC CGCCCGGCCG GCCCGCCGGC GACCACCGGC CGGACGCGTA CCGCGAGGGG CCGTCGCCCC GCTTCCGCGA CGAGCCGGGC GCGCCGCCCG ACGGTTACCG CGAGGGGCCG TCCGCGCGTT TCCGGGACGA GCCGCCCGCC CGGCCGCCGG CCGGGCCCCG GTCGGACGGC TACCGCGAGG GGCCCTCGCC CCGCTTCCGC GACGACGGGG CTCCCGCCCG GGGCCGCCCC GGGCCGACGC CGCCGCCGAA CGCGCGCCAG GGCGGGGAGC GGGGTGACGG CGGCCGGCGG GCCCCCGGGC CGGGCCGGGG GCCCGAGTAC GGGCGGGCTG GCAGGCCCGA TCCGGGCCGC GGTCACCGCG GGGAGCCACC GCGTGCCGGC GCGCGCGGCG CGGGCGGTCC CGCCGACGAG ATGTACCGCT CGTCCACCGA TCCACGCCAG CGCCCGCCCG CGCGCACCGA CGACGGCCTC GAGGCGCTGG AGCCTCTCGA CGGCTTCGAC GACGCCGAGA CCCGCCTGCC CAGCCGGGCG GCGAACCATC CCGGCGACGA CCGCCGGCGC AGGGGCTGGT AG
|
Protein sequence | MGVLQRFERR LGGLVEGAFA KVFKGGVEPV EIASALARET DDRRASSSNR VLVPNEFAVE LAGGDFARLA PYTRALCDEL AEMVREHAAE QRYTFVGPVT VRLAEAADLD IGVFRIRSSV ASADPAVVGG RRPRPRPAAP GTPHLLITTR APGGSGSGER EYPLDAETTV IGRSVECDIR LNDTGVSRRH GEIRRLPDGQ FLYVDAGSTN GSLVNGRAAT QVKLVNGDRI ELGTAVVEFR REEARGATGP RGPRTPVPAR SSPPPGRPTP PPADHRTPPP FDPRQPTPST RRGTPSPDDR YARPEPGVPG GPGGREPGYR DRPGRDPRDG DYREGPSPRF RDDPPGRPAG DHRPDAYREG PSPRFRDEPG APPDGYREGP SARFRDEPPA RPPAGPRSDG YREGPSPRFR DDGAPARGRP GPTPPPNARQ GGERGDGGRR APGPGRGPEY GRAGRPDPGR GHRGEPPRAG ARGAGGPADE MYRSSTDPRQ RPPARTDDGL EALEPLDGFD DAETRLPSRA ANHPGDDRRR RGW
|
| |