Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3464 |
Symbol | |
ID | 5671835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4093947 |
End bp | 4095536 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242352 |
Product | extracellular solute-binding protein |
Protein accession | YP_001507772 |
Protein GI | 158315264 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.739249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGCG GACGCCCGCC AAGACCCCTC CGATCATTCT GGTCCGCCCG GCGGTTGCGG ACCGCGGCGG CGGCCGCGCT GGCCGCGCTG GCGGTCAGCA CGGTCGCCGC CTGCGGCGGT TCCACCGCCG CCGACGGGAC CGCGCCGGCG GCGCAGGGCG GGACCCTCAC CGTCCTGTGG CCCGCCGAAC CACTCGACCT GTCCACCGAC AGCGGCTTCG GACTACAGAT GATCTCCGGA TCGATCGAGC GCCTCGCCGT GTACGACGCC CTGGTCGCGA TCACCCCGGC CGCCGAGCTG GACTACCGCC TGGCCACCTC CCTCGACTCC GACGACTCGC TGACCTGGAC GCTGCGGCTG CGCGAGGGGC TGCGGTTCAG CGACGGCACC CCGCTGGACG CCGCCGCGGT CCGGGACAAC TGGACCCTGC TGGCCGATCC GGCCCGCAAG TCACCCAGCG CCAAGATCGC ACAGCGGATC GGGTCATTCA CCATCGTGGA CCCGACAACC CTGCGGATCA CGCTGAAGGA GGCCGACGGC CAGTTCCCCC GGCTGGTCGC GCAGACCCCG CTGACCTTCA TCGGCTCCCC CACCGCGCTG CGCGCCAAGG GCGACGGGTT CAAGACCGCG CCGGTCGGCG CCGGGGCCTT CACCGTCCGG GAGTGGCTGC GCAACGACCA CCTGACTCTG GTCCGCAACC CGACCTCGTC GGTCCAGGCG CACTACGACA CGATCGTCGT CAAGTCCGTC CCGGACGAGA CGCAGCGCTA CAACACCCTG CTCGCGGGGG GCGCGGACAT CGCCTTCTCC GCGAACCTGC GCACCGGCAT CACCGCGGTC GCGGCCGGGC TGGTCACCGA GAAGGCGTTC AGCGACGGCG GGCTCAACCT GCTGTTCAAC ATCACCAAGG CCCCCTTCGA CGACATCCGG GCCCGCCGGG CGGTCTCCTA CGCCCTCGAC GCGCAGGCGC TGAACAAGGC CCTGTTCGAC GGGACCGCCG CCGTGCCGTC CAGCTTCCTG CGCGACGACT CGCCGCTGCA CAGCGACGTG CCGCTGCCCC GCCCGGACCG GGCGAAGGCC CAGGCCCTGT TCGACGAGCT CGCCGCCGCG GGCAAGCCGG TGCAGTTCAC CATCATCTCG CCGCTGAACT TCAGCAACGT CGCCGAATGG GTGCAGTCCA GCCTCGGTGG CTTCCGGAAC GTCAGCGTGA AGGTCGACGC GATGGCCCAG ACGCTGCCCG TGCTCCAGGG CGGCTTCCAG GCCACGCTCA CCGGCACCCC GCGGTTCGTG GACCCCTACC CGCAGCTGGC CCTGAACCTG GGCACCGGCG GCCCGAGCAA CTACGGCAAG TTCTCCGACC CGGCCCTCGA CGCCGCGCTG CGGGAGGGGC AGCAGTCCCG GGACACGGCC GTCCGGGTCC GGGCCTACGA GACCGCGCAG CGGATCATCG CCGAACAGCT GCCGCTGGCC GGCCCGCTGT ACCGCCTGCC GGGCCAGTAC CTACACGCGT CCACGGCCTT CGGTACGGGC AAGCTCCCGA TCATCAACGA CGGCGTGCTC GACATCACCC GGCTCACCGG GGCGGGGTGA
|
Protein sequence | MRRGRPPRPL RSFWSARRLR TAAAAALAAL AVSTVAACGG STAADGTAPA AQGGTLTVLW PAEPLDLSTD SGFGLQMISG SIERLAVYDA LVAITPAAEL DYRLATSLDS DDSLTWTLRL REGLRFSDGT PLDAAAVRDN WTLLADPARK SPSAKIAQRI GSFTIVDPTT LRITLKEADG QFPRLVAQTP LTFIGSPTAL RAKGDGFKTA PVGAGAFTVR EWLRNDHLTL VRNPTSSVQA HYDTIVVKSV PDETQRYNTL LAGGADIAFS ANLRTGITAV AAGLVTEKAF SDGGLNLLFN ITKAPFDDIR ARRAVSYALD AQALNKALFD GTAAVPSSFL RDDSPLHSDV PLPRPDRAKA QALFDELAAA GKPVQFTIIS PLNFSNVAEW VQSSLGGFRN VSVKVDAMAQ TLPVLQGGFQ ATLTGTPRFV DPYPQLALNL GTGGPSNYGK FSDPALDAAL REGQQSRDTA VRVRAYETAQ RIIAEQLPLA GPLYRLPGQY LHASTAFGTG KLPIINDGVL DITRLTGAG
|
| |