Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4222 |
Symbol | |
ID | 5672577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5028475 |
End bp | 5030040 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641243095 |
Product | extracellular solute-binding protein |
Protein accession | YP_001508512 |
Protein GI | 158316004 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTCA GTAGGTCTCG ATTAATCGTT ACGGCCGTTG TGTGCGGTGC GGTTCTGGCT CTGGGGGCCT GTGGTGGTGC TGATCCCGCG TCGCCGACGG GCGGTGCCAC CGGCGAGCCG GTCGCGGGTG GTCATGGCCG CATCCTGATG CTCAGCGACC CCCGTAGCCT GGACCCGGCG ACGCTCGGCA ACGCCTACGC GACCACCGGC GCCCTCGGTA ACGCCCTGTA CGGCACCTTG ATGACGACCG ACGATGCCGG TGAGATCCAG TACACGATGG CCGAGTCGTT CACCACCACC GACGGCGGCG CGACCTTCAC CCTGAAACTG CGCCCCGGCC TGACGTTCTC CGACGGCACC CCGCTGGACG CCGAAGCGGT GAAGTTCGAC TGGGACCGCC TCAAGGATCC GGCCACCCGC GCGACCAACC TGTCCGAAGC ATCGATGATC TCCTCGACCG AGGTCGTCGA CAGCACCACA CTGAAGATCA CCATGGTGGC GCCCGCACCG AAGTACGCCC ACTCGGTCAT CACCTCCACC CTGAACTGGA TCGCCTCACC CGCGGCTCTG CAAAAGGGCG CGCAGGCCTT CGACGCGGCC CCGGTCGGTG CCGGGCCGTT CACCCTGACG AGCTGGACCC GCCAGGCAGC CATCGAACTG GCCAGGAACC CCCGCTACTG GGACGCACCC AGGCCCTACC TCGACCGTCT CACCCTGCGC ACCACCTCCG ACACCGGCCA GCGCTTCAAC ACGGTGCTCA CCGGTGGCGC GGACGTGGCC ATCGAGTCGA ACCCGGTCAA CATCGAAAAG GCCACCGACG CCGGCCTGCC CACCACCGTC ATGGCCCTCA GCGGTGGCAC CTTCATCGCG CTGAACACCC GCCGGGCACC CTTCGACGAC GTCCGCGCCC GCCAGGCCGT CGCCGCCGCC CTCGACATGG ACGCGCTGAA CCTCGCCGTC TACAACGGCA AGGGCGAACC TGTCGACACC CTGTTCAGCG ACACCTCGCC GTTCCACTCG GACACACCAC TGCGCACGAC GGACAAGGCG ACCGCCCAAC GGCTCTTCGA CGAACTCGCC GCCGAAGGCA AGCCGGTGAC CTTCACCTAC TCCAGCGCTC CCACCACCGA GAACAGAAAC ACAGCCGAGA ACATTCAGGC CCAGCTCGGC GCTTTCAAAA ACGTCAAAGT CAACGTCAAG GTCATCGAAG TGACCGAACT CGCCGCGCTA CGCACGACCC ACGACTTCGA CGCGGCCACC TCGTCGGCGT TCTTCCAGGA CCCCGAGCCA CGCCTGTGGA CGGCCTTCGC CGCCAGTTCG GCCGCGAACC TGTCCGGGAT CAACGACCAG GAACTCAACG ACGCCCTCCT CGCCGGCCGG ACCGGTACGT CGGAACAGGA ACGCGCAGCC GCCTACAAGA CGGTGCAGCA GCGACTCACC GAGCTGTCCC CGGTGGTCTT CCTCACTCGA GCCGAACCCA GCGCCATCGC GGGAAAGAAC GTGGGCGGCC TCATCCAGTA CGGACTCGGA TCTCTGCTGC CCGACCAGAT CTGGATCCAG AAGTAG
|
Protein sequence | MMFSRSRLIV TAVVCGAVLA LGACGGADPA SPTGGATGEP VAGGHGRILM LSDPRSLDPA TLGNAYATTG ALGNALYGTL MTTDDAGEIQ YTMAESFTTT DGGATFTLKL RPGLTFSDGT PLDAEAVKFD WDRLKDPATR ATNLSEASMI SSTEVVDSTT LKITMVAPAP KYAHSVITST LNWIASPAAL QKGAQAFDAA PVGAGPFTLT SWTRQAAIEL ARNPRYWDAP RPYLDRLTLR TTSDTGQRFN TVLTGGADVA IESNPVNIEK ATDAGLPTTV MALSGGTFIA LNTRRAPFDD VRARQAVAAA LDMDALNLAV YNGKGEPVDT LFSDTSPFHS DTPLRTTDKA TAQRLFDELA AEGKPVTFTY SSAPTTENRN TAENIQAQLG AFKNVKVNVK VIEVTELAAL RTTHDFDAAT SSAFFQDPEP RLWTAFAASS AANLSGINDQ ELNDALLAGR TGTSEQERAA AYKTVQQRLT ELSPVVFLTR AEPSAIAGKN VGGLIQYGLG SLLPDQIWIQ K
|
| |