Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0630 |
Symbol | |
ID | 5669047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 729666 |
End bp | 731117 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239557 |
Product | extracellular solute-binding protein |
Protein accession | YP_001504995 |
Protein GI | 158312487 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.679852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGGA ACGTGTCAGA ACCCCGCCCG CGGCACCTGG ACCCGCGAGG CCCCGGCACC CGCCCGGGCC CGGGCACCCG CCGGGCCGGC TCGTCCCCAC GCCGCGCCGG CCCGCTGGCC GCGCTGCTCG CCGGGACGGT CACGGTCGGC CTGCTCGCGG GCTGCGGCGG CGGCTCCGGC GCGGCCACCG ATCCGGCCGA GAGCCTGCGT CCGACGGCGC GGCCGGCGAC CGCGAACGTG GACGACGTCG CCGGCGCTAA GGCGTCGCCG GAGTGCGCCG CCGCGGTGAA GACGCTGCGG ATGTTCGCCC TGGGGTCGCT CAACGACGCG GCGAAGTCGG GCAAGGCGTA CATGGAGAAG GCCCATCCCG GCCTGACCGT CGAGCTCACG GCCGACGCGA CCGGCTACCC CCAGCTCGTC CAGCAGATCA GCGCGGACCG GGCCGCCGGG CGCCCGGCGG ACGTCGCGGT CGCCGGCTTC GACCTGCTGC CGACCTTCGC CGACAAGCTC GGCGCGCAGC CGCTGTCGCC CCGGTTGCTG CGGGCGTCCT ACGACCAGCG GTTCCTGCCG CTCGGCGAGT ACGGCGGCCG GCTCGTCGCG GTGCCGCAGC AGGTGTCCAC GCTCGCACTC GTCTACAACG CGGACGTCCT GGCGAAGGCC GGGGTCGACC CGAAGACGCT GGGCACGACC ACGGGGGTGC TCGCCGCGGC CGAGCGGATC AGGAACTCCG GTCAGCAGAT CCAGCCGATC GACCTGCCGA CGGGCGGGTT CGCCCAGTGG TACCTGACCA CGCTGGCCAG CTCGAAGAAC ACCCCCGCGA TGAAGGCGGA CGGCCAGCCC GACCTGACCA GCCCGGCCGT CCGCGAGGCC GCCGCGTTCC TGGCCAAGGT GGGCACCTAC GGAACACAGT CGAGCGACCC GACCACGCAG GGCCTGCTGC GCTTCGGCAT CCGGCACGAG ACGGCGATCA GCGCGGTCAC CCTGCCGTCG CTGGCCGCGG GGCTGCGCTA CGTCCATGAC CAGGGCGCGC AGGGCTTCAA GGTGGGTGTC GCCCCGTTCC CGACCCTGCC CGGGGGCACT CAGCACCCCG TCGCGGGCGG CAACGGGCTG TCCGTACTGT CGACGGACCG CTGCCAGCGG GAGATGGCCA CCGAGCTGGT CGTCGCGCTG CTCGCCCCGG ACGTGATCGC CGCCGGCACC GAGGCGTTCA GCTTCCTGCC GGTCGACACC GAGGCCCGCA GGCAGCTCGC GCCGTTCTAC CGGGAGTTCC CGGAGCTGAC CCAGTTCGAC GCGCTGATTC CGGATCTCGT CCAGGCGCCG ACCTGGGGCG GTGAGCGCGG CGGCGAGGTC CACGACGCGC TGAACGACGA GGTGGTCGCA ATCATGTCAG GAGCCGACCC CGGCACCACC CTCACCGAGG CCCAGCGGAA GATCGCCACC CTGGTGAAGT GA
|
Protein sequence | MLGNVSEPRP RHLDPRGPGT RPGPGTRRAG SSPRRAGPLA ALLAGTVTVG LLAGCGGGSG AATDPAESLR PTARPATANV DDVAGAKASP ECAAAVKTLR MFALGSLNDA AKSGKAYMEK AHPGLTVELT ADATGYPQLV QQISADRAAG RPADVAVAGF DLLPTFADKL GAQPLSPRLL RASYDQRFLP LGEYGGRLVA VPQQVSTLAL VYNADVLAKA GVDPKTLGTT TGVLAAAERI RNSGQQIQPI DLPTGGFAQW YLTTLASSKN TPAMKADGQP DLTSPAVREA AAFLAKVGTY GTQSSDPTTQ GLLRFGIRHE TAISAVTLPS LAAGLRYVHD QGAQGFKVGV APFPTLPGGT QHPVAGGNGL SVLSTDRCQR EMATELVVAL LAPDVIAAGT EAFSFLPVDT EARRQLAPFY REFPELTQFD ALIPDLVQAP TWGGERGGEV HDALNDEVVA IMSGADPGTT LTEAQRKIAT LVK
|
| |