Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2067 |
Symbol | |
ID | 5670468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2490287 |
End bp | 2492089 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240989 |
Product | extracellular solute-binding protein |
Protein accession | YP_001506410 |
Protein GI | 158313902 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.153764 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACGCCT CTCCGGGGCG GCGGCTCGGT CATGCCCGAG GGATCGCCGC CCTGCTCACC GCCCTCCTGA CCACGGTCGC CCTGCTCACT GTCGCGGGAT GCAGCGGCGA GTCGGACCCG ACGCCGGGCC CGATGGCGGC GAGCCCGACC ACGCCGCCGA CGACCGCCTC CCCGCTCGAG AAGCCCGGCG GCACGCTGCG CCTGCTGACC GGGCGGATGC CGACCGGCGA CCCCGGGTGG GCCGACGAGC TGGGGGAGCG GGCGTTCGCC CGGCTGGTGA CCCGCCAGCT CTACAGCTAC CCCGCCGACG CCGACACCGC CAGGTCGACG ATCCCGCGCC CCGACCTCGC GGCCGGCGCC CCCGTGGTGA CCATGGGCGG CACCGTCTAC ACCGTGCGGC TGCGCTCGGC GGCCCGGTGG AACACCCCGA ACCAGCGCCG GATCACCGCG ACGGACGTCG CCCGCGGCCT CAAGCGGATG TGCGCGCCGC CGTCGCCCTC CCCGCTGCGC GGGTACTACG CGGCGACGGT CGTCGGCTTC GCCGAGTACT GCGCCGAGCT CGCCGCAGCG CCGGTGGCCG ACGCCCCCGC GCTGATCGAG AGCGGCACCG TCCCCGGGAT CGAGGTCATC GGCGACGACA CCCTCGCGTT CCACCTGATC AAGCCGGTCA ACGACTTCGT GGACATCCTG GCCCTGCCGG CCAGCTCGCC GGTGCCGCTG GAGGCCCTGG CCTACCCGCC GGACTCCCAG CAGTACCTCG ACAACCTGAT CTCCGACGGG CCGTACCGGT TCGTGTCGGA GCCCGGCGGT GGCTACCGGC TGTCCCGCAA CCCGGCCTGG AGCGGCTCCT CGGACGGCAT CCGGCGGGCG CTGCCCGACC ACATCACCGT CACCGACGGG CTCGACCCGG CGACCATCAC GGCGCGCATC GAGGCCGGTG ACGCGGACAT GGCCCTGAGC GGCGACATCC CCGCCGACGA CCTGGCCCGA CTGGTCGAGA GTGCCGACAA GAAGCTCGTG GTCGCTCCGA CCGGCCCGGT CGTCGCCCTC GTCGTCGGAC TCAACGGGCC GTCCGCGGCG GCCCTGCGCG ACCAGCAGGC CCGGGAGGCG CTGGCCTACT GCATCGACCG AACGGCGGTG GCCGCCGCGC TGGGTGGCCC CATGCTCGCC ACGGCGACGG CCCAGCTCCT GCAGTCGCCG ATGACCGGCT ACGAGACGTA CAACCCCTTT CCGGCCGGGG ACGGCTCCGG GGACTCACGG CGCTGCGCCG ACGGCCTCGC GAACAACCCG GGCGGGAAGG TGACGGCGCT GTCCCTGCTG ACCACGGACA GCGCCACCGA CACGGCGGTG GCCGAGGCGC TGCGCGCCGC GTTCGCCCGC GCCGGAATCC GCCTCGACCT GCGCATCCGC ACCGGCGCGC AGTACACGGC GGCCGCGTCG AGCCCTGGCG GGCAGTTCTG GGACCTCGCC CTGACCACGA TCACCCCGGA CTGGTTCGGT GACGCCGGTC GCACCGTCTA CGAGCCGCTG CTGGACGAGG CCTGGGTGGG CGCCCGGCCG GCCGACGGCG GCTACCGCCG TCCGGACCTC CTCGCCCGCT ACGAGTCCGC CGTGACGGCC TCCTCCGAGG ACGACGCCGC CACGGACTGG GCCGGGCTGG AGCGAACGGT GCTGAACGAC GCCGCGATCG TGCCCCTCGC GGTCACCCAC ACGTTGCGGC TGCGTAGCTC GGCGGTACAG GCGTTCACGA TCGTGCCGTC GCTGGGAACC GCCGATCCCA CAGCGGTTTC GCTCGGTCCC TGA
|
Protein sequence | MNASPGRRLG HARGIAALLT ALLTTVALLT VAGCSGESDP TPGPMAASPT TPPTTASPLE KPGGTLRLLT GRMPTGDPGW ADELGERAFA RLVTRQLYSY PADADTARST IPRPDLAAGA PVVTMGGTVY TVRLRSAARW NTPNQRRITA TDVARGLKRM CAPPSPSPLR GYYAATVVGF AEYCAELAAA PVADAPALIE SGTVPGIEVI GDDTLAFHLI KPVNDFVDIL ALPASSPVPL EALAYPPDSQ QYLDNLISDG PYRFVSEPGG GYRLSRNPAW SGSSDGIRRA LPDHITVTDG LDPATITARI EAGDADMALS GDIPADDLAR LVESADKKLV VAPTGPVVAL VVGLNGPSAA ALRDQQAREA LAYCIDRTAV AAALGGPMLA TATAQLLQSP MTGYETYNPF PAGDGSGDSR RCADGLANNP GGKVTALSLL TTDSATDTAV AEALRAAFAR AGIRLDLRIR TGAQYTAAAS SPGGQFWDLA LTTITPDWFG DAGRTVYEPL LDEAWVGARP ADGGYRRPDL LARYESAVTA SSEDDAATDW AGLERTVLND AAIVPLAVTH TLRLRSSAVQ AFTIVPSLGT ADPTAVSLGP
|
| |