Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7049 |
Symbol | |
ID | 5675360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8603053 |
End bp | 8604372 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641245895 |
Product | extracellular solute-binding protein |
Protein accession | YP_001511286 |
Protein GI | 158318778 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0982583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGCA CTCCGAGCGA AGCCAGATCC GGGCTCACCC GTCGCGGGGT CCTGCAGATG AGCGGGGCAG TGGGGCTCAC CAGTATGATT GCATCTGCCT GCGGTGCCGG TGGGCAGAGT GGCGACACCA GCGGTTCCAA CGCCCCCCTC ACCCTGTTGT GCGAGGCCGG TGGCAAGGCG GAGCTGACGA AGATCGCCGA GTTGTTCCAT CAGGAGACCG GTCATGCGGT CTCCTTCGTG GAGTTACCGT ACAACGGGCT CTTCAACAGA CTGAGCAGCG AACTTTCTTC GGGCACGGTC TCGTTCGACG TCGCGGCGGT CGACGCGATC TGGTTGTCGA CCTTCGCCGG CGCCCTGCAC CCGCTGGACG AGCTGTTCAC CGCGGACGTC AAGTCCGACC TATTCCCGGC GCTGGTCTCC GAGGCACAGG TCGACGGCAG GTTCGTGGCC ATGCCCACCT GGACCAACGC GGAGATCCTC TTCTACCGGA AGGACCTGTT CGAGGCTCCG GGGGAACGGA CGGCGTTCGA GAGCCAGTTC GGGTATCCGC TCGAAGTGCC GAAGACCTGG CAGCAGTTCG AGGACACGGC CCGCTTCTTC ACGCGGGGCA CCGAGCTCTA CGGAACCGAC GTGAAGGGGG CGGTGGAGAC CGAGTGGCTC GCCCACGTCC TGCAGGCGGG GTCCCCCGGT GTGGTTCTGG ACCCGGACGA CAACATCATC ATTGACAACG AGCAGCATCT GGCCGCGCTC CGCTTCTACA GCGACCTCAA CAACCGTCAT CGGGTGGCTC CGCCGGGAGC CGCGCAGCTC GACTGGGCCG GGGCACAGAA CCTGTTCAAC CAGGGAAAAA CGGCGATGCT GCGCTTCTGG GCCCACGCGT TCCCGCTGAT CCCCTCGGAC TCGCCCGTCC ACGGCAAGGT GGGGGCAGCA CCCATGATCG CGGGAAGTGC CGGGATCGCG GCCATTCCGG GGCCATGGCA CCTGTCCGTT CCCGCGGCCG GCCGCAACAC CGAGCTGGCC ACGGAGTTCA TCCAGTTCAG CTATGAGAAC AACGCGCTGG GCATCCAGTC CTCACTCGGC CTGGCGGCCC GCAGATCGGC CTTCGATAAG TACTCCGACA AACCCGGCTA CGAGCACTTC ACTCCGCTGC TGGACACCCT GTCCGCCCCG GCGACGAAGG TCCGCCCGGC GACCCCCAAA TGGCAGCAGA TCGTCGACAC CGTCCTCGTG CCCATGCTGC AGAAGTCGCT GACCGACAAC GCCGACTACG CAGCCCTGCT GAAGGACGCC CGCGAGGATG TGCAGCGTCT TGTCAGCTAG
|
Protein sequence | MASTPSEARS GLTRRGVLQM SGAVGLTSMI ASACGAGGQS GDTSGSNAPL TLLCEAGGKA ELTKIAELFH QETGHAVSFV ELPYNGLFNR LSSELSSGTV SFDVAAVDAI WLSTFAGALH PLDELFTADV KSDLFPALVS EAQVDGRFVA MPTWTNAEIL FYRKDLFEAP GERTAFESQF GYPLEVPKTW QQFEDTARFF TRGTELYGTD VKGAVETEWL AHVLQAGSPG VVLDPDDNII IDNEQHLAAL RFYSDLNNRH RVAPPGAAQL DWAGAQNLFN QGKTAMLRFW AHAFPLIPSD SPVHGKVGAA PMIAGSAGIA AIPGPWHLSV PAAGRNTELA TEFIQFSYEN NALGIQSSLG LAARRSAFDK YSDKPGYEHF TPLLDTLSAP ATKVRPATPK WQQIVDTVLV PMLQKSLTDN ADYAALLKDA REDVQRLVS
|
| |