Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7039 |
Symbol | |
ID | 5675350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8589549 |
End bp | 8590835 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641245885 |
Product | extracellular solute-binding protein |
Protein accession | YP_001511276 |
Protein GI | 158318768 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGTA AATCCTTTAC CGGAATATTA GCAGTGACCG CCGCCGTGGC GCTTCTGCTC GCCGGGTGCG GCCAGGCGGG GAACAGCACG ACGACCGCCG ACGGCAGGAT CCAGATACCC ATGTGGACCC ACTCGGCCGG CAACCCCGCC GAGCTCGCGG TGTACAAGCA GATCATCTCC GACTTCAACG AGTCGCAGGA CAGGTACGAG GTAGTACAAC AAGACTTTCC CCAGGTCACC TATAACGACG CGATTGTCGC GGCCGCCGCA GCGGGGGACC TGCCGTGCCT CATCGACATG GACGGACCGG TGATGCCGAA CTGGGCCTGG TCCGGCTACC TGCAGGAGCT CAATCTTCCC AAGCAGCTCA CGGACAGCCT GCTGCCGACG GCGGTCGGCA CGTACAAGGG AAAGATCTAT TCGGCCGGAT ACTGGGACGC CGCACTGGCA ATTTTCGCGC GCAAGTCGGT GCTTGACAAG AACGATATCC GCATCCCCAC CGTGGACAGG CCGTGGACGA AGGACGAGTT CGACTCCGCG CTCGCGACGC TGCAACAGGC CGGATACGAT ACTCCGCTCG ACATCGGCGC GGAGGACACC GGCGAGTGGT GGTCGTACGC GTACTCCCCC ATGCTGCAGA GCTTCGGTGG CGACGAAATC AATCGAGACA CCTACCGCAC GGCAGAGGGC GCTCTTAACG GGCCGGCTGC CGTGAACTTC TTCACGTGGT TCCAGGATGC CTTCAAGAAA GGCTGGGCAA GCAACTCCGG CACGATCGGG AACCAGGAGT TCGTCGACGA CAAGGTCGCG CTGAGCTACA CGGGCGTGTG GAATGCGCTC GACTCGCTCG AAAAGATCGG CGACGACCTG CTCATCCTCC CGCCTCCGGA CTTCGGCCAG GGGCCCAAGA TCGGCGGTGG CTCATGGCAG TGGGGCATCA CGGCCGGATG CGAGCAAGCC GACGGCGCGC GCCAGTATCT GCGGTTCAGT TTCCAGGACA AGTACATCGC GCAGTTCGCG GACAGCCAGA TCGTCATCCC CGCGACGGCC GGTGCCGAGG AGCTCTCGAA GTACTTCACG GCCGATGGCG CTCTGCGCCC CTTCGTCGTG CTCTCCCAGA AGTTCGCCCT CGCGCGGCCC GCGACCCCGG CCTATTCCGT GATCTCGTCG ATCTTCGAGA AGGCGACAAA GGACATCATG AACGGCGCCG ATGTGAAATC CACACTCGGC AGTGCCGTGG AGGATATCGA CGAGAACATC ACGGCCAACG ACAACTACGG CTCCTGA
|
Protein sequence | MKRKSFTGIL AVTAAVALLL AGCGQAGNST TTADGRIQIP MWTHSAGNPA ELAVYKQIIS DFNESQDRYE VVQQDFPQVT YNDAIVAAAA AGDLPCLIDM DGPVMPNWAW SGYLQELNLP KQLTDSLLPT AVGTYKGKIY SAGYWDAALA IFARKSVLDK NDIRIPTVDR PWTKDEFDSA LATLQQAGYD TPLDIGAEDT GEWWSYAYSP MLQSFGGDEI NRDTYRTAEG ALNGPAAVNF FTWFQDAFKK GWASNSGTIG NQEFVDDKVA LSYTGVWNAL DSLEKIGDDL LILPPPDFGQ GPKIGGGSWQ WGITAGCEQA DGARQYLRFS FQDKYIAQFA DSQIVIPATA GAEELSKYFT ADGALRPFVV LSQKFALARP ATPAYSVISS IFEKATKDIM NGADVKSTLG SAVEDIDENI TANDNYGS
|
| |