Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4942 |
Symbol | |
ID | 5673281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5932290 |
End bp | 5933315 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243796 |
Product | extracellular solute-binding protein |
Protein accession | YP_001509212 |
Protein GI | 158316704 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.997084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAGGG TCGGACGCGG GCCAGGCCAC CGAAGCCGGC GGATCACATC GAGGATCACA CTGAGAACCA CGTTCAGAGC AATGATCTTC GCCGCCGTAC CGCTGCTCGG CACCACCGCG GCCTGCGGCA CCGTGACGAG CGCGCTGCCG GCGGCCGGCG ACCCCCTTCC GGCCGCTGTC ACCCGGCAGT CGCCGCTGCC GGCACGCTCC ACCGCGTCCC CGGGGTGCGG TGACCGCACC GCCAGCCTGC GCCCGCCGGT CACGCTCCCC CCACCCGGCC AGCCTGACGG CCCCACCCTG CGCAAGATCG CCGACCGCGG CTACCTCACG GTGGGGGTCC GTCCCGACAC GCCGCCGTTC GGGTCGCGTA ATCCGGACAC CGGGGAGTTC GAGGGGTTCG ACGTCGACCT GGCCCTGCTG GTGGGCCGGG CGCTGTTCGG CGCGGACGGC CATGTCCGGT TCCGGGCGGT GACCGGCGCC GACCGGGTCG CTCTCGTCCG CGACGGCACC CTCGACCTGG TCGCCGCAAC GCTGACCATC ACCTGCGACC GGGCGGACGA GGTCGACTTC TCCGCGCCCT ACTACCTGAC GGCCAAAGCC GTTCTGGTGC TGGAGGACGC GCCCTACCAG GGCCTCGCGG ACCTCGGCGG TCGGCGGGTG TGCGCGGCGG CCGGGACCAC CTCGCTGCAG CAGGTTGTAG ACGCGCCGTC GCGGCCGATC CCCATCCAGC TTGCCAGCCC GGCCGACTGC CTGGTCGCGA TGCAGGCGGG AACGGTCGAG GCGATCGTGA ACGACGAGGC CGTCCTGGTC GGGATGGTCG AGCAGGATCC GGAGACCCGC ATCGTCGGTG TCGGCGCCTT CGACGTGGCC ATGGGCATCG CCGTCGCCAA GGACGCGCCG GACCTCACCC GGTTCGTGAA CGGCGTGCTG GAGCAGGCCG AACGCGACGG AACCTGGACG GTGATCCACC GACGATGGCT GGACGGGGTC CGCCAGCCGC CGCCGGCCCC CGTCTACCGG GACTGA
|
Protein sequence | MRRVGRGPGH RSRRITSRIT LRTTFRAMIF AAVPLLGTTA ACGTVTSALP AAGDPLPAAV TRQSPLPARS TASPGCGDRT ASLRPPVTLP PPGQPDGPTL RKIADRGYLT VGVRPDTPPF GSRNPDTGEF EGFDVDLALL VGRALFGADG HVRFRAVTGA DRVALVRDGT LDLVAATLTI TCDRADEVDF SAPYYLTAKA VLVLEDAPYQ GLADLGGRRV CAAAGTTSLQ QVVDAPSRPI PIQLASPADC LVAMQAGTVE AIVNDEAVLV GMVEQDPETR IVGVGAFDVA MGIAVAKDAP DLTRFVNGVL EQAERDGTWT VIHRRWLDGV RQPPPAPVYR D
|
| |