Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7253 |
Symbol | |
ID | 5675554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8855234 |
End bp | 8856274 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641246090 |
Product | periplasmic solute binding protein |
Protein accession | YP_001511478 |
Protein GI | 158318970 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0518447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0520295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCCC TGCCCCTCCG GTCCACCGCC GGACACCCGT CGGACCAGGC CTCGCGCCCC CGGTCGCTGC CGGTGGCGCG TCCCGCCGCC GCTCTGCTCA CCGCGGCTGC GCTCGCCGCC GCGCTGGCGG CCTGCGGCGG CTCGGGTGGG GGCTCCGGCG GCGGCGCGGG CGGCTCGGCG CCACGGGTGG TCGCGACGAC CACCTACGCG GCGGATTTCG CCCGCGTCAT CGGCGGTGAC GGCATCGAGG TCAGGCAGAT CCTGCGGCCG GGCGTCGATC CGCACGACTA CGAGCCTTCT CCGGCCGATC TCGAGGCGAT CGGCTCCGCG GACGTCCTGA TCGAGAACGG TGTCGGCCTG GAGAGCTGGC TCGACGACGC GATCTCGGCC AGCGGCTTCG ACGGCGTCGA GGTCGTGATG GCCGACGGGG TCACCGTCCG GAGTGAGCCC GGAGGGGGCA CGGACCACGC CGACGAGGGT GCGCATGAGG ACGAGCACGC CGGGGGCGAC CCGCACGTCT GGCACGACCC GCGCAACGCG AAGATCATGG TCACGAACGT CGAGAAGGGG CTCGCGGCGG CCGATCCGGA GAGCTCCGCC ACGTACCGCG CCAACCTCGC CGGCTACACG ACCAAGTTGG ACGAGCTCGA CCGGGAGAAC CAGCAGAAGA TCGACACCAT CCCGGCGGAC CGGCGCAAGC TGGTCACCAA CCACGACTCG TTCGGCTACT ACGTGGACCG GTACGGCCTG GAGTTCGTCG GGTCGATCAT CCCGAGCTTC GACACGTCCG CCGAGCTCTC CGGCCGGGCG ATCGACGACA TCGTCAGCCG GATCAGGGCG TCCCGCGTCG TGGCGGTGTT CTCCGAGTCA TCCCTGCCGC CGCGGACGGC GCAGACCATC GGCCGCGAGG CGGGCGTCCG CGTCGTCGCC GGCGAGGGTG CCCTCTACGC CGACACGCTG GGCCCGGCGG GGTCTGATGG TGCCACCTAC CTCGAAGCCG AACGGCACAA CACGGATACC ATCGTCAACG CCCTCCGGTG A
|
Protein sequence | MSPLPLRSTA GHPSDQASRP RSLPVARPAA ALLTAAALAA ALAACGGSGG GSGGGAGGSA PRVVATTTYA ADFARVIGGD GIEVRQILRP GVDPHDYEPS PADLEAIGSA DVLIENGVGL ESWLDDAISA SGFDGVEVVM ADGVTVRSEP GGGTDHADEG AHEDEHAGGD PHVWHDPRNA KIMVTNVEKG LAAADPESSA TYRANLAGYT TKLDELDREN QQKIDTIPAD RRKLVTNHDS FGYYVDRYGL EFVGSIIPSF DTSAELSGRA IDDIVSRIRA SRVVAVFSES SLPPRTAQTI GREAGVRVVA GEGALYADTL GPAGSDGATY LEAERHNTDT IVNALR
|
| |