Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3398 |
Symbol | |
ID | 5671769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4028176 |
End bp | 4029762 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242286 |
Product | extracellular solute-binding protein |
Protein accession | YP_001507706 |
Protein GI | 158315198 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.697295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCGCA GAGAGAAGTT ACTCACCACC GTCATCGCCT GTTGTGTGGC GCTCGCCGTG GGCGCCTGCG GCGGCGGTGA CGGCTCGTCC ACGACGACCG GCACCACCAC CACCGCCACC GGACCCACCG GCGACCCGGT CGCCGGCGGC TCGCTGCGGG TCATGCAGCT CCGCGAGCCC CGTTCGATGG ATCCGGTCAC CATGACGAAC GCGTGGCAGG TCAACGCGTT CGTGGGCAAC GCCCTGTACG GCACGTTGAT GATCAACGAT GAGCACACCA ACGAGATCAA GTACACGATG GCGGAGTCGT TCGCCACCAC CGACAAGGGC GCCACGTTCG CGCTGAAACT GCGACCGGGC CTGAAGTTCA CCGACGGCAC CCCGCTCGAC GCCGCCGCGG TGAAGTTCAA CTGGGATCGC CATCGGGATC CGGCGATGGC CTCGCCCTAC CTGCCGGAGG CGTCCCTCAT CGCCTCCACC GACGTCGTGG ACGCCACGAC CCTCAAGGTG ACGATGACCG AGCCGGTGCC CAGCTTCCCG GCCGCGCTGC TCACCACGTC CATGAACTGG ATCGCCTCCC CCACCGCCCT GCAGAAGGGC AAGGCGTCCT TCGACGCGAA ACCGGTCGGC GCGGGGCCGT ACACCGTGCG GACCTGGACC CGCCAGGACC GCATCGAGCT GGCGAAGAAC CCCGGCTACT GGGACGCCCC CCGGCCCTAC CTCGACAGCA TCACCATCCG CACGTCGGAC GACTCGGGCC AGCGGTACAA CACGCTGATC AGCGGCGGCG CCGACGTGGT CATCGACACG AACGCGGACA GCGTCGCCAA GGCCGGGAAG GCCGGCTACT CGACGGACGT CGCGCCGCTG AGCGGCGGCC TGTTCCTCGC GATGAACATG CGCAGGGCGC CGTTCGACGA CCTGCGGGCC CGGCAGGCGA TCGCGGCCGC CGTCGACCTG GACGCGCTGA ACCTGGCCGT GTACAACGGC GCCGCCCAGA CCGCCGACAC GCTGTTCACC AAGTCCTCGC CGTTCTACTC CGACAAGCCG CTGCGGACGT ACGACCGGGC GAAGGCGCAG CAGCTGTTCG ACGAGCTGGC CGCCGACGGC AAGCCTGTCA CCTTCACGTT CACCAGCTTC GCCTCGTCGG AGATCAGGGA CACGACCGAG AACGTCCAGG CCCAGCTCAG CTCGTACAAG AACGTCAAGG TTCAGGTGCG GGTGGTCGAC TTCTCCGAGT ATCCCAAGAT CCAGGCGAGC AACGACTTCG ACATGGTCGT CTGGTCGGCG AACTTCATCG ACCCGGACCC ACGGATGTGG ACGGCGTTCC GCAGCGACTC CCGCGGCAAC TTCCCGGGCG TCAAGGACGA GCGGATCGAC GCCGCCCTGA AGGCGGGCCG CACGGCGACG ACCGAACAGG AGCGCAAGGC CGCGTACGCC ACCCTCCAGG AGCGGCTGAC CGCGCTGGTT CCCGCGGTCT TCACCGTCCG GGTGAACCCG AGCGTCATGG CCGCCAGGAA CGTCGGCGGC GTCACCCAGT ACGGCCTCGG ATCACTCCTG CCCGAGCCCC TGTGGCTCCA GCCATGA
|
Protein sequence | MFRREKLLTT VIACCVALAV GACGGGDGSS TTTGTTTTAT GPTGDPVAGG SLRVMQLREP RSMDPVTMTN AWQVNAFVGN ALYGTLMIND EHTNEIKYTM AESFATTDKG ATFALKLRPG LKFTDGTPLD AAAVKFNWDR HRDPAMASPY LPEASLIAST DVVDATTLKV TMTEPVPSFP AALLTTSMNW IASPTALQKG KASFDAKPVG AGPYTVRTWT RQDRIELAKN PGYWDAPRPY LDSITIRTSD DSGQRYNTLI SGGADVVIDT NADSVAKAGK AGYSTDVAPL SGGLFLAMNM RRAPFDDLRA RQAIAAAVDL DALNLAVYNG AAQTADTLFT KSSPFYSDKP LRTYDRAKAQ QLFDELAADG KPVTFTFTSF ASSEIRDTTE NVQAQLSSYK NVKVQVRVVD FSEYPKIQAS NDFDMVVWSA NFIDPDPRMW TAFRSDSRGN FPGVKDERID AALKAGRTAT TEQERKAAYA TLQERLTALV PAVFTVRVNP SVMAARNVGG VTQYGLGSLL PEPLWLQP
|
| |