Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5736 |
Symbol | |
ID | 5674062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6970688 |
End bp | 6971668 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641244589 |
Product | NLP/P60 protein |
Protein accession | YP_001509992 |
Protein GI | 158317484 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGGGA TACCGCTGCG TGTGTCGGCG TCGCAGCGTC GCGTAGCGAC TCTTGCCCTC ATGACGACCG GCGGCTTCGC CGCCGCGTCG ACTGGCTTCG CGCTCGCTAC CACCGGCGAA GCTGCCTCCG CGGCATCACC GGTGCCCACC ATCCGCACGG CTGGCATCCT CCCTGCGGGC GCCCAGCCGG CTGCAGCGGA CGGTGAGGAA GCGGCGCTGG CCGCCGCCAT CGCTGATCCG TCGGAGAACT TCACCGGTCT CGCATTCGCT CCCGACCGGG CGACCGTCGC GCCCAACGGC GAAGTGATGT TCACCGTCCG TGCGACCAGG GCCGACGGCG CACCACTAAT CGGGTCCGCC GTCCGCATCG TGTCCGTCAA CGGACCGAAA TGGACCACCA CGGCCACACT CCGAACGGAC GCCGCAGGCG AGGCGCGGAT ATCTACTCGA CTGCTATCCA CGACTACCCT CACTGCCGTG TTTGACGGCT CCGGCGCGCT GCGTCCCTCA ATGGCCGGAA CAGCGACGGT CACCGTCCAG GCTCCACCCG CGGCAGCGAG GTCCGCCCGC ACCGGTGGAG GGCAGCCAGG TGCCGTACCC ACACCGATCT ACGGCTCGGT CCCGGCAAGC GAGATCGGCG CGAAGGCCGT TTATCTTGCG TCCCTGCAGA ACGGCAAACC GTACGTCTAC GGCGCCGCAG GACCTTACGC CTTCGACTGC TCGGGATACG CCCAGTACGT CTACCGGCAG CTTGGACGGA ACCTGCCGCG TACCGCCCAG CAGCAGTTCC AGGCCACGAT ACGTATACCG AAATCCGGGA AGCAGCCCGG GGACCTCATC TTCTTCGGCA CTCCATCAAA CATCACCCAC ATGGGCATCT ATGCCGGAAA CGGCTATATG TGGGCTGCTC CAAGGACCGG AAGCAACGTC AAGCTGCAGC CCATCTACAG CTCCACCTAC TATGTAGGCA GAGTTCGGTA A
|
Protein sequence | MGGIPLRVSA SQRRVATLAL MTTGGFAAAS TGFALATTGE AASAASPVPT IRTAGILPAG AQPAAADGEE AALAAAIADP SENFTGLAFA PDRATVAPNG EVMFTVRATR ADGAPLIGSA VRIVSVNGPK WTTTATLRTD AAGEARISTR LLSTTTLTAV FDGSGALRPS MAGTATVTVQ APPAAARSAR TGGGQPGAVP TPIYGSVPAS EIGAKAVYLA SLQNGKPYVY GAAGPYAFDC SGYAQYVYRQ LGRNLPRTAQ QQFQATIRIP KSGKQPGDLI FFGTPSNITH MGIYAGNGYM WAAPRTGSNV KLQPIYSSTY YVGRVR
|
| |