Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2459 |
Symbol | |
ID | 5670855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2926945 |
End bp | 2928102 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241376 |
Product | extracellular solute-binding protein |
Protein accession | YP_001506797 |
Protein GI | 158314289 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0529514 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.605436 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAGAC GCGCCCGTGT CGTCGCCGTT CTCGTCGCGC TCGGGATCGG GCTGGGGCTG GCCGCGCTGC CGGGGTGCGC GACGACGGGC AGTCCGCTGC CCCGCCAGGC TCGGCCGCTG CCGCCCGCCG ACGGCGTCGC CGCCCCGGCC CGTCCCGAGG CGGCCGCCGG CTCGGCGGGG CCCCCGTCAA CACCAGGCCA GCCGGGTCCG TCCGCGCAGC CCTGTATCGC CCGCGCGAGT GTGCCCCCGC TCGCCGCCCT GCCGCCGCCG GGAGCGCGCG GATCGCGGAT CGAGGCGATC CGGCGGTACG GCTACCTTCG CGTCGGGGTG ACCACGGTGG CCCCGCCGTT CGGGTCGATG AACTGGCGCA CCATGGAGGT GGAGGGCTTC GACCCGGCCA TCGCCGGCGA GATCGCCGGG GCCATTCTCG GCGACCCCGA ACTGGTGCAG TTCCGCGCGG TCGACACCCG GGACAGGGAG GCGCTGGTCG CCGACGGCAC CGTCGACATC GTCACCGGCA CGATGACGAT GACCTGCGCC CGCAAGGAGC GGGTCCGCTT CTCCGGGGTG TACTACGAGG CCGCGATGCG CATCCTGGTG CCGGCCGGCG CGGGCCTGCG CACCGTCGGT GACCTCGCTG GACGGCCGGT CTGCTCGTCG CAGGGAAGCA CCTCGTTCGA GAAGGTCGCG AACCTGGTGC GCGGCCCCGG CCGGCCGGTC GCGGTGAACC GGGTGGGGAT CGTGGACTGC CTGGCCGCGC TCCAGCGCGG CGAGGTGGAC GCGGTGGCCA CCGACGACAC GATCCTCGCC GGGATGCGCG CCGAGGACGC CACCGTCACC GTCCTCGGTC CCGAGGCGTT CGACGGCGTT CTGGGCCCGG CGGGCCGCGC CGGCCTCGAC GAGCCCTACG GTGTGGCGAT CGGCCGGACG GATCTCGCGC CGGGCCTCTC CCCGGCCGAG GCCGCGGCGA ACCGGGCCGC CGACGACGCC TTCGTCGCCT TCGTCAACCG GGTGCTGCTC GACATGATGA CCGGCCCGAC CTGGAACAGG CTCTACACCC GCTACCTGCG CGACGTCCTG CGGGTCCCGG GCGTGCCCCC GAACGCCATC CAGCCGAACT GGCCGGACGG CGTGCTCGTC ACGGGCGGCG GATCGTGA
|
Protein sequence | MNRRARVVAV LVALGIGLGL AALPGCATTG SPLPRQARPL PPADGVAAPA RPEAAAGSAG PPSTPGQPGP SAQPCIARAS VPPLAALPPP GARGSRIEAI RRYGYLRVGV TTVAPPFGSM NWRTMEVEGF DPAIAGEIAG AILGDPELVQ FRAVDTRDRE ALVADGTVDI VTGTMTMTCA RKERVRFSGV YYEAAMRILV PAGAGLRTVG DLAGRPVCSS QGSTSFEKVA NLVRGPGRPV AVNRVGIVDC LAALQRGEVD AVATDDTILA GMRAEDATVT VLGPEAFDGV LGPAGRAGLD EPYGVAIGRT DLAPGLSPAE AAANRAADDA FVAFVNRVLL DMMTGPTWNR LYTRYLRDVL RVPGVPPNAI QPNWPDGVLV TGGGS
|
| |