Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3559 |
Symbol | |
ID | 5671928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4223260 |
End bp | 4224840 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242445 |
Product | extracellular solute-binding protein |
Protein accession | YP_001507865 |
Protein GI | 158315357 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.43971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTTG CTCTCAACCG TCGCGCGTTC GGGCGTGCCG GCCTGCTCGC CGTCGGCGCC CTCACGGCGC CGGCACTGCT CGCGGCCTGT GGCGACGACG CCCCCGCCGC ACCAGGCGCG GGCACGGGCG CGGCGGGGGC GACAGCGGCC CCGCGCCGCG GCGGGAAACT GCGGGCCGCG TTCGCGAGCA GTCCGGCCGA CACCCTGGAC ATCGTGAAAG GCCACGACGT GCTGGGCTCG GTGCGCGGCC TCGCGGTGTA CGAACGCCTC GGCGACGCCC ACCCGGACGG CTCGGTCTCC CCGCGCCTGT TCGAGAGCCT GACGCCCAAC GCCGACGCGA CCGTCTGGAC GCTGAAGCTG AAGCCGGGGA TCACCTTCTC CGATGGCAGG CCGTTGACGA CGGCCGACGT GCTGGCCAGC TTCGCGACGT TCACCGGCAC CGAGAGCGGC GCCGCGATCG CCGCGTTCGA CCCGAAGGAG AGCAAGGCGA CGGACGCGCG CACCGCGACC ATCGCGCTGA CCGCCCCCGT GTACGACCTG CCGGCCCGGG TGAGCGGCGT CGTCCTGGTG ATCATGCCGG AGGGCAAGCC GGCGTCTCAG CTCGGTGACG TCGTCGGCAG CGGGCCCTAC GAGATCGCCT CGTTCGTGCC CGGCCAGCGG ACGGTCCTGC GCAGGCGCAC CGACTACTGG GACGGCGACG CCCGGGGCTA CCTGGACGCG ATCGAGCTGG TGGCCGCGCC GGACGCGAAG TCGCGGCTGT CCGCGCTGCG GGCGGGCCAG GTGGACTGGG CCGACGACAT CGCCTACCTG GACGCCTCGA CCCTGCGGGA GGACCGCGCG ATCACGATCC ACCGCGGCGC GGCCGAGCAG GGCCTGGCCT GGTTCCTGAA CATGGCGGCG CCGCCGTTCG ACGACGAGCG CGTCCGGCAG GCCCTGCGGT ACTCGGTGGA CCGCCAGAAG CTCGTCGACA CCACGCTGTT CGGATTCGGC TCCGTCGGCA ACGACCTGTG GGGCAAGGGC CTGCCGAACT ACAACGGCTC GATCCCGCAG CGGCCGCACG ACCCCGCGAA GGCGAAATCG CTCCTGCAGG AGGCCGGGGT GGCCACCCCG GCCAAGGCCA CCCTGCTGAC CTCGCCCATC GGCCCGGGTC TCGTCGAGGC GACCCAGCTC CTCGCCGACC AGGCCCGCGA GGTCGGCTTC GACATCAAGG TCGAGGTCAT CCCGCCGGAC GTCTACTTCG CCCGGCCCGA GGAGTGGGCG AAGGCGTCGG GCGTGGCGTT CGCCCAGGTC GGCGCGTTCA CCGACATGGC CCCGCTGGTC TACCTGTCCG ACGGCCCGTT CAACTTCGGC TGGCGCAAGC CCGACTGGGA CGCCGGGTTC GCCGACGGAG TGGGCGAGCT CGACGCCGCG AAGCGCAAAG CGACGTTCGA CGGCCTGCAG CAGCAGCTCT GGGATTCCGG CTCCGACCTC GTGTGGGGGT TCGCGCCGAG GCTCGTCGCC GCCGCGCCGT CCGTCGGGGG CGTCGACTCC AGTCCCAACT TCGGCATCCC GGACCTGGTC TTCATCCACC GCACGGGGTG A
|
Protein sequence | MNVALNRRAF GRAGLLAVGA LTAPALLAAC GDDAPAAPGA GTGAAGATAA PRRGGKLRAA FASSPADTLD IVKGHDVLGS VRGLAVYERL GDAHPDGSVS PRLFESLTPN ADATVWTLKL KPGITFSDGR PLTTADVLAS FATFTGTESG AAIAAFDPKE SKATDARTAT IALTAPVYDL PARVSGVVLV IMPEGKPASQ LGDVVGSGPY EIASFVPGQR TVLRRRTDYW DGDARGYLDA IELVAAPDAK SRLSALRAGQ VDWADDIAYL DASTLREDRA ITIHRGAAEQ GLAWFLNMAA PPFDDERVRQ ALRYSVDRQK LVDTTLFGFG SVGNDLWGKG LPNYNGSIPQ RPHDPAKAKS LLQEAGVATP AKATLLTSPI GPGLVEATQL LADQAREVGF DIKVEVIPPD VYFARPEEWA KASGVAFAQV GAFTDMAPLV YLSDGPFNFG WRKPDWDAGF ADGVGELDAA KRKATFDGLQ QQLWDSGSDL VWGFAPRLVA AAPSVGGVDS SPNFGIPDLV FIHRTG
|
| |