Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0426 |
Symbol | |
ID | 5668849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 501353 |
End bp | 502888 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641239358 |
Product | extracellular solute-binding protein |
Protein accession | YP_001504797 |
Protein GI | 158312289 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGCA GTAAGTTAGC ACTCGTCGCC TCGGCGGTGG CGCTCGCCGC GCCACTCGCC GCGCTGCTGG CCGGGTGCAC CGGAGAAGCG GCGCCAGGTG CCGAGTCGTC CGCCGGCGCC ACCAAGAACA CGCTCACGCT CGCCATGTCG GCCGACATCA CCGGCTGGGA CCCCTCGAAC CAGCCAGGAT ACCAGGGCTG GGGACCGGAA GCCATCTGGG ACACCCTGAT CAAGTGCGAT GCCTTCGGCA AGCCGGAGGC CGACATCGCC GACAAGTGGA CGGTCAGCGC GGACGGCAAG TCCTTCACCG CGCACATCCG CGAGGGCCAG AAGTTCTCCG ACGGCACGCC GGTCGACTCG GCCGCCATCG CGGCATCGTT CCAGTACCTC ACCTCGCACG GCGGGTCCCA GGGCGACTAC AAGGGCCTCA AGACCGACGC TCCGGACGCG CAGAACATCA CCCTCACCTG GCCCGAGCCG CAGCCCATGA TCATTCAACG GGTCTGCAAC CCGAAGATCA CCACCAAGGC GCTGCTCGAC TCCGGCAAGG TCAACGACAA GCCGGTCGGA TCCGGTCCGT ACGTCCTGGA CGAAACCGCC ACTACCCGCG GCTCGGTGTA CACGTTCACC AAGAACGAGA GCCACTGGAA CACTGCCGCT TACCCGTACA AGAAACTGGT TTTGAAGGTC ATCGAGAGCG AGACCGCGCG CGTCAGCGCC CTGAAGACCG GGCAGGTCGA CGGCTCGCTG ATCACGGCGG CCAGCTACAA CGAGGTCGAG GCCTCAGGTC TGAACGTGGT GACGATGCAG GGCCAGACGA CGCGCCTGTT GCTCACCGAT CACCTGGGTA AGGAGGTCCC GGCTCTGGGC AGCCTGAAGG TGCGCCAGGC GATCAACATG GTCTTCGACC GTGACGCCAT GGTGAAGAAC CTCTACCAGG GTCACGGCAA ACCGGCGTAC CAGATCTTCC GGCCCGGCAG CGACGCGTAC ATCGACGGCA TGGCGGACCC GTACCCCTTC GATGTCACCA AGGCCAAGGC ACTGATGGCC GAGGCCGGCT ACGCCAGCGG CTTCGACCTG AGCCTGCCGA CCATGGCCGG TCAGAACCAC GAGACCCTCA TGCCGTACGT GACGCAGCAG CTCGGCCTGC TCGGCATCAA GGTCAAGCAG GTGCCGCTCT CCGGCGCGAA CGCGATCGGT GACCTGCTCA GCGGCACCTA CCCGGTGGTC CTGTGGCAGC TCGGCAACTT CGGCCAGTCG CTGCTCGACA TCGACGTCGT GGTGCGCTCG ACCGGCTACT GGAACCTCGA GCACCAACCC GACGCGACGG TGGATGGTCT GTGGGAGAAG ATCCTCACCG GCGACGAGGC CACCCGCAAG ACCGCCCAGC AGGACATCAA CAGGTACGTC ATTGAGCAGG CCTGGTTCGC GCCGATGGTC AACCCCGACG GGTTCTACGC GCACAGCCCG GACGTGAAGA TCGATCACGT CTCGGACCAC GAGGCGCTGA CCCCGAAGCT GCGCGACTTC AAGTAG
|
Protein sequence | MRRSKLALVA SAVALAAPLA ALLAGCTGEA APGAESSAGA TKNTLTLAMS ADITGWDPSN QPGYQGWGPE AIWDTLIKCD AFGKPEADIA DKWTVSADGK SFTAHIREGQ KFSDGTPVDS AAIAASFQYL TSHGGSQGDY KGLKTDAPDA QNITLTWPEP QPMIIQRVCN PKITTKALLD SGKVNDKPVG SGPYVLDETA TTRGSVYTFT KNESHWNTAA YPYKKLVLKV IESETARVSA LKTGQVDGSL ITAASYNEVE ASGLNVVTMQ GQTTRLLLTD HLGKEVPALG SLKVRQAINM VFDRDAMVKN LYQGHGKPAY QIFRPGSDAY IDGMADPYPF DVTKAKALMA EAGYASGFDL SLPTMAGQNH ETLMPYVTQQ LGLLGIKVKQ VPLSGANAIG DLLSGTYPVV LWQLGNFGQS LLDIDVVVRS TGYWNLEHQP DATVDGLWEK ILTGDEATRK TAQQDINRYV IEQAWFAPMV NPDGFYAHSP DVKIDHVSDH EALTPKLRDF K
|
| |