Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6528 |
Symbol | |
ID | 5674843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7938319 |
End bp | 7939497 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641245376 |
Product | NLP/P60 protein |
Protein accession | YP_001510771 |
Protein GI | 158318263 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.203343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGGG CAGCCCAGCC CGAACCCGAC AGCTCACCTC GCAGGCGTAC GGGAAGGAAC GCAACGTTGT CTGCGCAAAG TGTGCAGCTT GATGCTGAGA GACGCCGGCA GGAAGGGCGC GGTCGCCACC GTGCGCCGTC CGCCCCCACG GCGTCCAGCC GAGCCAGAGC CAGGGCCCGT ACCGTCGCGG CCATCACCAC CGGAACCGTC GCGGTCTCCG GGGTGGCGCT CGCCGGGTGC GCCCAGGATG TCAACTCGGA CGTCGCGCTG GACGACGGCA CGAACACCGC CCCGATGACG CTCGCGACCC AGATCGGGTC CCGGCTGGCG GTCGACGGAG CCATCCAGGC CGCCACCGCT ACCGGCGGCG GTTCGGGCAC CGTTCTCGAC GCCACCACGG ACATCTCCGC GCCGACTCTC TCGTCGAAGA TCGACGTCGG CCTGCGCGTG ACGAACCCCG AGGTCACGGT CAACGCGGAC GAGCCCGTCA ACATCGGCTT CTCGCTCTAC AACGAGGAGA CCCAGGCCCC GATCCCGGAC CAGCTCATCA AGGTCCAGGT CAAGCTGCCG ACCGGGTGGG CGACCTTCCT GCACCTGACG ACGGACGACC GCGGCTTCGC GTCGTACACC GCGAAGGTGC TCACCACCAC GAATGTCACG GCGATCTTCG ACGGCACTGA CGCCCTCCAG TCGGCGCACT CGGAGAACGA CGCCACGCTG CACGTCCGCC CGGCTCCGCC GCCGGTGCCC GCACAGGCCT CCCGCAGCGC GGACCGCACG GGCGTGAACG TCAACACCCC GGTGGTCAGC GTCAACCTGC CGACCAACAC CCTCGGTGAG AAGGCCGTCT ACCTGGCCTC GCTACAGGCC GGCAAGCCGT ACGTCTACGG CGCCGAGGGT CCCAACGCCT TCGACTGCTC CGGTCTCGTG CAGTACATCT ATCGGCAGCT CGGCAAGAGC CTGCCGCGTA CCACCGACCA GCAGTACGCG GCCACCACCC ATATCTCCCA GTACAACAAG GCGCCCGGCG ACCTGATCTT CTTCGGGAGC CCCGGAAACA TCTACCACAT GGGCATCTAC GCCGGCGACG GCAAGATGTG GGTCGCGCCC CGCACTGGTG ATGTCGTCAA GCTTCAGACG ATCTACACGA CCTCGTACTT GGTCGGCCGC GTCACCTGA
|
Protein sequence | MDRAAQPEPD SSPRRRTGRN ATLSAQSVQL DAERRRQEGR GRHRAPSAPT ASSRARARAR TVAAITTGTV AVSGVALAGC AQDVNSDVAL DDGTNTAPMT LATQIGSRLA VDGAIQAATA TGGGSGTVLD ATTDISAPTL SSKIDVGLRV TNPEVTVNAD EPVNIGFSLY NEETQAPIPD QLIKVQVKLP TGWATFLHLT TDDRGFASYT AKVLTTTNVT AIFDGTDALQ SAHSENDATL HVRPAPPPVP AQASRSADRT GVNVNTPVVS VNLPTNTLGE KAVYLASLQA GKPYVYGAEG PNAFDCSGLV QYIYRQLGKS LPRTTDQQYA ATTHISQYNK APGDLIFFGS PGNIYHMGIY AGDGKMWVAP RTGDVVKLQT IYTTSYLVGR VT
|
| |