Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3944 |
Symbol | |
ID | 5672305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4713651 |
End bp | 4715045 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242823 |
Product | extracellular solute-binding protein |
Protein accession | YP_001508240 |
Protein GI | 158315732 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.920276 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0416082 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGAC GGCGCTTTGC CCGGGGCGTG GCGGCCGCCT GCGCTGCCGC GCTCGCCCTC ACCCTCGCCG CCTGCGGATC CGACGACGGC GCCGACGCGG GCGGGACGTC CACCGCCGGG GTCGTGCCCG AGCTCGGGCC GGACCAGAAG GTCTCGATTG TCTTCGAGAG CTACAACCTC GCCCAGCCCG GCCCGTGGAC GGACACCTTC AACGGCCTGA TCGCCGACTT CGAGAAGGCC CACCCGAACA TCTCGGTCAC CGCGCAGAAG CCGCAGACCT CCACTCTGAA GGGCTACGGC TCGGCGGCCA CCGCGAGCAT CCAGGCCCAG ATCGCCACCG GCAACGCCCC CGACGTCGCC CAGCTCACCT TCGGCGACCT CGGCTACACC GCGACCGCGC TCGGCGCGAA GCCGCTCGAC GACATCGTCG GGCGCGACGC CGTCCAGGAG AACTTCGCCG GCACCCACCC GTTCGCGCCG ACGGCGCGCA CCCTCGGAGA TGTCGACGGC AAGACCTACG GCATGCCGTT CGTCTTCTCC ACCCCGGTGC TGTACTACAA CGCCGACCTG TTCACCCAGG CCGGCCTCGA CCCGGAGAAG CCGCCCACGA CCTGGGACGA GTTCAAGACC GCCGCGCTGG CCATCAAGGC GAAGACCGGC AAGAATGGCG GTTACATCGA CTGCCTCACC AAGGTCTCCG GCGACTGGTG CTACCAGGCG CTGGTCGCCT CCAACGGCGG CTCGGTGATC TCCGAGGACC GCACCAAGCT CACCTTCGCC GAGGCGCCCG CGGTGCAGGC GGTCGAGATG GCGCAGGACC TGGTCAACTC CGGCGCCAGC CCGAAGCTGT CACAGGACCA GGCCTACCCG GCGTTCGCCC GCGGTGAGAT CGGCATGATC GTCGAGACCA GTGCGGCGCA GGGCACCTTC ATCAAGGGCG CCGGCGGCGC CAAGCCGCCG TGGACGCTGC GCGCCACCGT CATGCCGAGC TTCACCGGCA AGCCGGTCGT GCCGACGAAC TCCGGGGCGG CGCTGTTCAT GTTCGCCAAG GACGCGGCGA AGCAGCGGGC CGCCTGGGAG CTGATCACCT ACCTGACCAG CGACGCGGCC TACACGCAGA TCACCAGCAA GATCGGCTAC CTGCCGCTGC GCACCGGGCT GCTCGACGAC CCGAACGGCC TGAAGACCTG GGCCGAGCAG AACCCGCTGG TCAAGCCGAA CGTCGACCAG CTCGCGAAGC TGAAGCCGTG GGTGTCCTTC CCGGGCAACA ACTACGTCCA GATCCGCACC GGGATGCTCG AGGCGGTCGA GAGCGTCGTC TACAGCGGCG CCGACCCGCA GAAGACGCTC ACCGACGCCC AGAACCAGGC CGCGAAGCTG CTGCCCCGGT CCTGA
|
Protein sequence | MKRRRFARGV AAACAAALAL TLAACGSDDG ADAGGTSTAG VVPELGPDQK VSIVFESYNL AQPGPWTDTF NGLIADFEKA HPNISVTAQK PQTSTLKGYG SAATASIQAQ IATGNAPDVA QLTFGDLGYT ATALGAKPLD DIVGRDAVQE NFAGTHPFAP TARTLGDVDG KTYGMPFVFS TPVLYYNADL FTQAGLDPEK PPTTWDEFKT AALAIKAKTG KNGGYIDCLT KVSGDWCYQA LVASNGGSVI SEDRTKLTFA EAPAVQAVEM AQDLVNSGAS PKLSQDQAYP AFARGEIGMI VETSAAQGTF IKGAGGAKPP WTLRATVMPS FTGKPVVPTN SGAALFMFAK DAAKQRAAWE LITYLTSDAA YTQITSKIGY LPLRTGLLDD PNGLKTWAEQ NPLVKPNVDQ LAKLKPWVSF PGNNYVQIRT GMLEAVESVV YSGADPQKTL TDAQNQAAKL LPRS
|
| |