Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2413 |
Symbol | |
ID | 3906396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2801029 |
End bp | 2802267 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637879743 |
Product | major facilitator transporter |
Protein accession | YP_481509 |
Protein GI | 86741109 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00423271 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.219014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCGTCT ACAATTTCCG GCTCTTCGCG GGTGGCCAGG TGTTGTCCGT GACGGGAACC TGGATGATGG TCGTGGCCCA GGACTGGCTC GTCCTTTCGC TGAGCGGCGA CTCGGGAACA GCCCTGGGCC TCGTGACCGC CCTGCAGTTC ACCCCGATGC TGTTGCTGAC CCTCTACGGC GGAAGGCTCG CGGACCGCTA CGACAAACGC CGGCTACTGA CCGTCGCGAA CCTGGCCTCC GGACTGCTCG CCCTGGCGCT GTCGCTGCTG GTGTTAACCG GCACGGTCCG GCTGTGGCAC ATCTGCCTGT TCGCACTGGG CCTCGGTCTC GTGAACGCCG TCGAGATGCC GACCCGGATG GCATTCGTGA GCGAGCTGGT CGGCGCCGAA CTCCTCCCCA ACGCGTCGGC GCTGAGCGCG GCGTACTTCA ACGTCGCCCG GGTCGCCGGC CCGGCCGTCG CCGGCCCGTT GATCGGCGGC TTCGGTACCG GCCCGGTCAT GATGTTCAAC GCCGTGAGCT ACCTCGCGAC GGTGGTGGGT CTGCGGATGA TGCGCCCGGC CGAGATGCAC CGCGACGCGC GCCGGGCGAC CTCCACCCGC GTCGTCGACG GACTGCGGTA CGTGCTCGGC CGGGAGGACC TGGTGCTCGT GCTTGGACTC GTCGCGACGA TCGGGCTGTT CGGCCTGAAC TTCCAGCTCA CCGTGCCCCT GCTCGCCAGG ACGGTATTTC ATGCCGACGC GGCGGCCTTC GGGCTGCTCA CCAGCGGGCT CGCCGCCGGA TCGCTGCTCG CGGCCCTCGT GACGACCGCC CGCCGCACCC GCCCGTCGGC CGGCATGGTG ATCGGATCCG CGCTCGCGTT CGGCCTGCTG GAGACGGTGA CCGGCTGGGC GCCCAGCTAT GCCACGGCGG CCGTGCTCCT GATCCCTACC GGCTTCGCGA CGATCTACTT CGCTCAGGCG GCGAACCACC GCATCCAGCT CGGCAGCGAT CCGCAGTACC GAGGCCGGGT GATGGCGATC TATACCCTGA TCCTGCAGGG ATCGACGCCG CTAGGTGCCC TGTTCGTCGG CTGGCTCACC GAGCACCGTG GCGCCCGCGC CGGGTTCTAC ATCGGAGGTC TCGTTTCGGT CGCCGCCGCG ATCGCCACAC TGGTGGTGGA CCGCATGCGG GCCGCGGCCG TCGACGACCC GCCACGCCCC GGCGCCCCGG CGCCCGGCGC CCGAACTGTA GAGGGTTAG
|
Protein sequence | MSVYNFRLFA GGQVLSVTGT WMMVVAQDWL VLSLSGDSGT ALGLVTALQF TPMLLLTLYG GRLADRYDKR RLLTVANLAS GLLALALSLL VLTGTVRLWH ICLFALGLGL VNAVEMPTRM AFVSELVGAE LLPNASALSA AYFNVARVAG PAVAGPLIGG FGTGPVMMFN AVSYLATVVG LRMMRPAEMH RDARRATSTR VVDGLRYVLG REDLVLVLGL VATIGLFGLN FQLTVPLLAR TVFHADAAAF GLLTSGLAAG SLLAALVTTA RRTRPSAGMV IGSALAFGLL ETVTGWAPSY ATAAVLLIPT GFATIYFAQA ANHRIQLGSD PQYRGRVMAI YTLILQGSTP LGALFVGWLT EHRGARAGFY IGGLVSVAAA IATLVVDRMR AAAVDDPPRP GAPAPGARTV EG
|
| |