Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1895 |
Symbol | |
ID | 5670297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2273962 |
End bp | 2275299 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240817 |
Product | extracellular solute-binding protein |
Protein accession | YP_001506239 |
Protein GI | 158313731 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.854658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGGCC GGACAGGACG CGTGCGTGGT CGGCGGCTGA TTCCCGCATT GCTCGCAGCC GTTTCGGTGT TCGCCGCGGC GGCATGCGGC GGCGGCGACT CGAGCGCCGA GGGCACCGAT GGGGCGCTGG CTCCACTGGA CAGCGCGACC AAGGTCACCA TCAAGTTCCT CTCGTACAAC TACGGCACCC CCGGGCTCGG CGGCACGGGG ACGCAGGCCC TCATCGACGC GTTCGAGAAG GCACACCCCA ACATCACCGT CGAGCCGCAG GGCGTGGCGA CCAAGGACGT CCTCACCCGG CTGCGGACCG ACACCGCCGC GGGCAGCCCG CCGGACGTCG CGCAGATCGG CTGGAGCAAG ATGGCCGAGG CCGTGCAGGC CCTGCCGATC GTGCCCGTCC AGGAGATCCC GCCGACGTCG GAGTGGGACG AGCACGTCGC CGGGATGTCG AAGAGCATCC TCTCGGCCGT CTCGGTCGAC GACCAGGTGA AGGCGATGCC CTACACCATG TCGATCCCGG TGCTCTACTA CAACGCCGAC CTCTTCCGGG CCGCCGGGCT CGACCCGGCG AAGCCGCCGA CGACGATCGC GGACGTCAAG GCAGCCGGGC TGGCCATCAA GGCCACCGGG TCCGAGGGCG TCTACTACGG CGTCGTGGAC TCCGCGAAGT CCGACTTCCT CACCCAGTCG GTGGTGAACG GCAACGGCGG CAGCCTGGTC GACGCCAACG GCGAGGTCAC CCTCGACAAG CCGCCGGCGG TCGGGGCGCT CACGGCGGTG CAGGACCTCG TCAAATCCGG CGCGATGCCG TCGGTGAACA CCGAGACGGC GCTGGCCGCC TTCACTACCG GCAAACTCGG CATGCTCGTC ACCAGCACCT CCGTCCTGGC CAGCGCGATG AAGGCCGCCG ACGGCAAGTT CGAGCTGCGG ACCGCGGGCT TCCCGACCTT CGGCTCGAAG CCCGCCCGGC CCACCTACTC GGGCGCGGGG CTCGCGGTGC TGGCCAAGGA CGACGACCAC CGCCGGGCGG CCTGGGAGTT CATCAAGTTC GTCACGAGCG ACGCCGGCTT CGAGATCATC ACCTCCAAGA TCGGCTACCT GCCGCTGCGC GAGTCCGTCG CCACCAAGCT GGCGGACTCG GCGATCGTGA AGCTCCTCGC CCCGTCGCTG GAGCAGCTCG ACACCGTCGC GCCGTACACC GCCTTCCGCG GCACGAAGGC GAACCAGGCG GTCGTCGCGC TGCAGGACGA GGCGGTCGAA CCGATCGTGC TGCGGGGCGC GGACCCGGGG CCGACGCTCA CCTCGGTCGC CGAGAAGATC CGGAAGCTCA GCGCCTGA
|
Protein sequence | MTGRTGRVRG RRLIPALLAA VSVFAAAACG GGDSSAEGTD GALAPLDSAT KVTIKFLSYN YGTPGLGGTG TQALIDAFEK AHPNITVEPQ GVATKDVLTR LRTDTAAGSP PDVAQIGWSK MAEAVQALPI VPVQEIPPTS EWDEHVAGMS KSILSAVSVD DQVKAMPYTM SIPVLYYNAD LFRAAGLDPA KPPTTIADVK AAGLAIKATG SEGVYYGVVD SAKSDFLTQS VVNGNGGSLV DANGEVTLDK PPAVGALTAV QDLVKSGAMP SVNTETALAA FTTGKLGMLV TSTSVLASAM KAADGKFELR TAGFPTFGSK PARPTYSGAG LAVLAKDDDH RRAAWEFIKF VTSDAGFEII TSKIGYLPLR ESVATKLADS AIVKLLAPSL EQLDTVAPYT AFRGTKANQA VVALQDEAVE PIVLRGADPG PTLTSVAEKI RKLSA
|
| |