Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6358 |
Symbol | |
ID | 5674674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7718556 |
End bp | 7720115 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245207 |
Product | extracellular solute-binding protein |
Protein accession | YP_001510602 |
Protein GI | 158318094 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCCCA AAGTACGATT ACTCGTCACC GCTGCCGTCT GTTGCGCAAC GCTGGCCCTG GGCGCCTGCG GGGGAGGTGG CGACACCGGC CCGTCCTCGG GTGCCTCCGG CGAACCGGTT GCCGGTGGCG AGGGAAGAAT TCTCACCCTC AGCGATCCGC GCAGCCTCGA CCCGGCGGCT CTCGGCAACG CCTACGCAAC CACCGGTGTT GTGGGAAACG CGCTGTACGG AACGCTGATG ACCGACCCGG GCGGCAAAAT ACGGTACTCG ATGGCCGAGT CCTTCCAGAC CACCGACGCC GGGGCGACAT TCGAGCTGAA ACTGCGGTCG GGTCTGGTGT TCTCCGACGG AACCTCACTG GACGCCGAAG CCGTGAAGTT CAACTGGGAC AGGCTCAAGA ACCCGGCCAC CGCCGCCATC TCCCGGTCGG AGGCGTCGAT GATCGCCTCA TCCGACGTGG TCGATGACAC CACCTTGAAG ATCACCATGG CCACGCCGGT GCCGAAGTAC GCCCAAGCCG TCCTCACCTC GTCCCTGAAC TGGATCGCCT CGCCGACCGC CCTGGAGAAG GGGCCGCAGG CCTTCGACGC GAACCCGATC GGCGCCGGGC CATTCACCCT GCGGAGCTGG ACACGCCAGG CCGCCATGGA ACTGGTCAAG AACCCCCGCT ACTGGGACGC CCCCAAGCCC TACCTCGACC GTCTCACCCT CCGCGCGGCC CTCGACAGCA GCCAGCGCTA CAACACGGTG CTCACCGGGG GCGCGGACGC GGCCGTCGAG TCGAGCTGGG TCAACCTCGA CAAAGCCGAG CAGGCCGGCC TGCCGACGAA CCTCATACCG ACCGGCGGCG GCATCTTCAT GGCGCTGAAC ACACGCAGGG CACCCTTCGA CGACGTCCGC GCCCGCCAGG CCCTCGCCGC GGCAATCGAC AGGGACGCAC TCAACCAGGC TGTCTACAGC GGGACCGGCG AGCCCGTCGA CACACTGTTC AGTAAGGACT CTCCTTACTA CTCGGACACG CCGCTGGCGA CAACGGACCG TGCACGGGCG CAACGGCTCC TCGACGAGCT GGCCGCCGAC GGCAAACCGC TGTCCTTCGT CTTCTCCAGC GTCCCCACGA CGGATGGCAA GGCGATCGCG GAGAACATCC AGGCCCAGCT CAGCAGCTTC AAGAACGTCA CCGTCAAGAT CAAGACCATC GAGGTCGCGG AGCTGGCCGC GCTGCGCACC ACCCACGACT TCGACGTGCT CGTCTCGTCG GCCTTCTTCC GGGACCCCGA ACCGCGGCTG TGGACGACCT TCCACGGGAC CTCGGCGGCG AACCTGCCCG GCATCAACGA CCCGGCACTC AACGAAAGCC TCGCGGCCGC GCGCACCGCG ACTTCGGAGC CCGAGCGCGA ATCCGCCTAC GGGACACTGC AGGAACGGCT GGCAGAGCTG ACCCCGGTGG TCTTCCTCGC GCAGGCGGCA CCCAGCGCCT TCTCGAGCAA GAACGTCGGC GGACTCGTAC AGTACGGCCT CGGCTCACTT CAGCCCGAGG AACTCTGGAT TCAGCGCTAG
|
Protein sequence | MIPKVRLLVT AAVCCATLAL GACGGGGDTG PSSGASGEPV AGGEGRILTL SDPRSLDPAA LGNAYATTGV VGNALYGTLM TDPGGKIRYS MAESFQTTDA GATFELKLRS GLVFSDGTSL DAEAVKFNWD RLKNPATAAI SRSEASMIAS SDVVDDTTLK ITMATPVPKY AQAVLTSSLN WIASPTALEK GPQAFDANPI GAGPFTLRSW TRQAAMELVK NPRYWDAPKP YLDRLTLRAA LDSSQRYNTV LTGGADAAVE SSWVNLDKAE QAGLPTNLIP TGGGIFMALN TRRAPFDDVR ARQALAAAID RDALNQAVYS GTGEPVDTLF SKDSPYYSDT PLATTDRARA QRLLDELAAD GKPLSFVFSS VPTTDGKAIA ENIQAQLSSF KNVTVKIKTI EVAELAALRT THDFDVLVSS AFFRDPEPRL WTTFHGTSAA NLPGINDPAL NESLAAARTA TSEPERESAY GTLQERLAEL TPVVFLAQAA PSAFSSKNVG GLVQYGLGSL QPEELWIQR
|
| |