Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0464 |
Symbol | |
ID | 3903195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 542550 |
End bp | 543908 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637877795 |
Product | extracellular solute-binding protein |
Protein accession | YP_479579 |
Protein GI | 86739179 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.364922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.192606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCGTA TTCGCCGAAT ACTGCCGCTG CCCGTCGCCG CCACGATGAT CATCCTGACC GCGTGCGGCG GCGGTGGTGA CGCGAGCCCG GCAGACCCCG CGGAGAGCCT GCGCCCCACG GCCCGTCCAG CCGACGCCGG CGTCGACAAC GTGGCGGGCG CCAAGGCGTC GCCCGCCTGC GTGGCCCAGG TCAAGACGCT GCGGATGTCC GCCGTCGGCA CGCTCAACGA CGTCGCGAAA TCCGGCAAGG CCTATCTGGA GAAGGCGCAT CCCGGCCTCA CCGTGGACCT CAACACGAGT GCGCCGGACT ACACCTCCCT CGTCCAGCAG ATCAGCGCCG ACCGCTCGGC CGGCCGTTCC GTCGACGTGG CGGTCGCCGG CTTCGACCTG CTCCCGACCT TCGCCCGGGA CCTCGGCGCC CAGGAGCTCT CCCCACGCCT GCTGCGCGCG TCCTACGACC AACGGCTGGT CGGCCTCGGG CAGGTCGCCG GGAAGCAGAT CGGCATTCCC CAGCAGGTGT CGTCTCTGGC CCTGGTCTAC AACCTCGACG TGCTGCAGAA GGCCGGCGTC GATCCGGCGA CGCTGGGCAC GACCGACGGG GTGATCGCCG CCGCCGACAA GATCAAGGCT TCGGGTCAGA ATATCCAGCC CCTCGACCTG CCGACCGGCC AGCAGTTCGG GCAGTGGATC CTCAACACCC TGGCCAGCTC CAAGGGGACG CCGATCCAGG ACGCGAACGG TCGGCCCGCC CTGAACACCC CGGCGGCCCG CGAGGCCGCC GCGTTCCTCG CGAAGGCCGC GAGCTACGGC ACGCACTCCG CCGATCCGAC CCAGCAGGGC CTGCTGCGGT TCGGCATCCG CCGGCAGACG GCGATGACCG CCGTGACGGT GGCCTCCGTG GCCGGCGGGC TGAAGTTCAT CGCGGGGCAG GGGACGAAGG GTTTCCGGGC CGGCGCGGTC CCGTTCCCGA CTCTGCCCGG CGGGACCCAG CACCCGGTCG CGGGCGGCAA CGCGCTGACG GTCCTGTCCA CCGACCGCTG CCAGCGGGAG ATGGCGACCG AACTGGTCGT GTCGCTGCTT TCGCCGGACG TCGTGGCCGC GAGCACGGAG GCGTTGAGCT ACATCCCGGT GGATACTCAG GCCGTCAGCC AGCTCGGGTC GTTCTACGAG ACCTATCCGC AGCTCAAGCC GTTCAACGCG CTCATCCCCT CGCTGGTGAA GGCGCCGGCT TGGAGCGGGG CCCGCGGCGG GGAGGTCCCG AGCGCGATCT CGGACCAGGT GCAGCGCATC CTTAAGGGCG AGGACCCGGT CAGGGCCCTC GCCGCGGCCC AGAGCCAGGC TGTGGAACTC ACCCGTTGA
|
Protein sequence | MIRIRRILPL PVAATMIILT ACGGGGDASP ADPAESLRPT ARPADAGVDN VAGAKASPAC VAQVKTLRMS AVGTLNDVAK SGKAYLEKAH PGLTVDLNTS APDYTSLVQQ ISADRSAGRS VDVAVAGFDL LPTFARDLGA QELSPRLLRA SYDQRLVGLG QVAGKQIGIP QQVSSLALVY NLDVLQKAGV DPATLGTTDG VIAAADKIKA SGQNIQPLDL PTGQQFGQWI LNTLASSKGT PIQDANGRPA LNTPAAREAA AFLAKAASYG THSADPTQQG LLRFGIRRQT AMTAVTVASV AGGLKFIAGQ GTKGFRAGAV PFPTLPGGTQ HPVAGGNALT VLSTDRCQRE MATELVVSLL SPDVVAASTE ALSYIPVDTQ AVSQLGSFYE TYPQLKPFNA LIPSLVKAPA WSGARGGEVP SAISDQVQRI LKGEDPVRAL AAAQSQAVEL TR
|
| |