Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5187 |
Symbol | |
ID | 5673521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6227480 |
End bp | 6228637 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641244041 |
Product | solute-binding protein |
Protein accession | YP_001509451 |
Protein GI | 158316943 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0173275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGACA ACTCGCTCGA CGACGGGGTA CCTCGGCGCG GTCGGATGCG CGGCCATGCG GGGAGACGGC GAGGTCGCCT TTTCGTGGCC GCGTCGGTGG CGGTCATGGT CGTGCTGGCC GCCGTGGGAT GCACATCGGA CTCGTCCGAT GACGAGGTGC CCCAGGGAGG ATCCGAGAGC GGAACCATCG CGCTCCTGTT ACCCGAGACC CAGACGACCC GCTACGAATC GGCCGACCGC CCCTACTTCG AGGCGCGGAT GGCGAAGATC TGCCCCGACT GCAAGGTGCT GTACTCGAAC GCCGACCAGG ACTCGGCCGC CCAGCAGAAC CAGGCCGAGC AGGCCATGAC CAACGGCGCC AAGGTCCTCG TCCTGGACCC GGTGGACGGC GAGGCCGCGG CGGTGATCGC CCGCAATGCG CGGGACCGTG GCGTGCGCGT GGTCTCCTAC GACCGGCTCA TCCAGAAGGC GCCCGTCGAC GCCTACATCT CCTTCGACAA TGAGAAGGTC GGCCAGTTGC AGGGCCAGGC GCTCCTCGAC GCGATCGGCG ACCGGGCCGG CGCCGGCAAG GTCATCATGA TCAACGGCTC GCAGGACGAC CCGAACGCCC AGCAGTTCAA GGACGGCGCG CTGTCGGTCC TGGAGGGCAA GGTGACGATC GGCTTCGACA CGTTCACCCC CGACTGGTCT CCCGACACCG CCGGTCGGGA GATGGACCAG GCGATCACCA CCGTCGGCCG GGAGAACATC GTCGGGGTCT ACGCCGCGAA CGACGGCATG GCCGGCGCCG TGGTCGCCGC GCTGCGCCGG GCGAACGTGA ACCCGCTGCC GCCCGTCACC GGCCAGGACG CCGAACTCGC CGGGGTACAG CGCGTACTCG CCGGAGATCA GCACATGACC GTCTACAAGG CCATCCGCCC CGAGGCGGAG CAGGCGGCCG ACCTGGCGCT CGCGCTGCTG CGCGGTGAGC CCGTCGACAC GATCGCGACC GGGCACGTCG ACAACGGCAA CGGCCAGGTT CCCGCCGTCC TGCTGGAACC GGTCGCGGTC ACCCGGGACA CCGTCGCCGC GACGGTGGTG AAGGACGGCT TCATCGCCAA GGCCGACCTG TGTGCCGGCA CGTACGCGAC AGCCTGCGCG TCCGCCGGCA TCTCCTGA
|
Protein sequence | MADNSLDDGV PRRGRMRGHA GRRRGRLFVA ASVAVMVVLA AVGCTSDSSD DEVPQGGSES GTIALLLPET QTTRYESADR PYFEARMAKI CPDCKVLYSN ADQDSAAQQN QAEQAMTNGA KVLVLDPVDG EAAAVIARNA RDRGVRVVSY DRLIQKAPVD AYISFDNEKV GQLQGQALLD AIGDRAGAGK VIMINGSQDD PNAQQFKDGA LSVLEGKVTI GFDTFTPDWS PDTAGREMDQ AITTVGRENI VGVYAANDGM AGAVVAALRR ANVNPLPPVT GQDAELAGVQ RVLAGDQHMT VYKAIRPEAE QAADLALALL RGEPVDTIAT GHVDNGNGQV PAVLLEPVAV TRDTVAATVV KDGFIAKADL CAGTYATACA SAGIS
|
| |