Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5004 |
Symbol | |
ID | 5673343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6001097 |
End bp | 6002140 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243858 |
Product | NLP/P60 protein |
Protein accession | YP_001509274 |
Protein GI | 158316766 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.549544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCCGTC GTGGAATTGC CTGGCTGCCC GAGGACTGGG AGCAGTTCGT CAAGGGGCGC CAGCGCCCCT CGAACGCCCG ATCGGGTACG GGCGCCCGGC GGCGCACCTC CCAGCCGCCT CGTCGGGCGG CGACGATCGC CGCGTTCACC ACGGGCACGC TCGCGGCCTC GACGGCGGCC TTCGCCGCCA CGGTGCCGGC CACCCCGGAC GGCACCGCGT CCATCGACCC GGGCGCGACC CAGACGGCCT CCGTGCTGCC GGGCCAGGGC GGGACGGACG CGCCGGACGC CGCGGCCGCC GCCGCGCCGC TGGCGGGGGC GATCGCGTCC TCCGAGCCGA TGTTCACCAA GGTCTCGCTG ACGGCGGACA ACACCACGGT CGCGCCGAAC GCCCCGGTGG TGCTGACCGT CCGGGCCAAT GAGGCTGGCG GCGCTCCGCT GGCCAACCAG CGGGTCCGCA TCGTCGTGGT GAACGGGCCG AAGTGGCAGA CCTCGACGGC GCTGACGACC GACACCAACG GTCAGGCCCA GATCACCGCG CGGCTGCTGA CCACCACGAC GATCACCGCC GTCTTCGACG GCACGAGCGC GCTGCGGCCC TCGCTGGCCG GCGCGGCCAC GATCACGATC AAGGCGCCGA CGGCGCCCCG CCCGGCCGTC AGCCAGGCCA TCCCGTCGGT GATCCCGGGC AGCTCGATCG GGGAGAAGGC CGTCTACCTG GCCTCGCTAC AGAAGGGCAA GCCGTACGTG TGGGGGGCGG AGGGCCCCTA CTCGTTCGAC TGCTCCGGCC TGGTCCAGTA CGTCTTCAAG CAGCTCGGAC GTTCCCTGCC GCGCACCGCG CAGGGCCAGT ACAACGTGTC GACCCACGTG CCGCAGTCCG GTAAGCGCCC CGGCGACCTG ATCTTCTACG GGACCGAGGG CAACATCTAC CACGTGGGCA TCTACGCCGG GAACGGCTAC ATGTGGGCCG CGCCGCAGAC CGGCGGTGTG GTGTCGCTCC GCCCGATCTA CAGCTCCACC TACAAGGTCG GCCGGATCAT GTGA
|
Protein sequence | MPRRGIAWLP EDWEQFVKGR QRPSNARSGT GARRRTSQPP RRAATIAAFT TGTLAASTAA FAATVPATPD GTASIDPGAT QTASVLPGQG GTDAPDAAAA AAPLAGAIAS SEPMFTKVSL TADNTTVAPN APVVLTVRAN EAGGAPLANQ RVRIVVVNGP KWQTSTALTT DTNGQAQITA RLLTTTTITA VFDGTSALRP SLAGAATITI KAPTAPRPAV SQAIPSVIPG SSIGEKAVYL ASLQKGKPYV WGAEGPYSFD CSGLVQYVFK QLGRSLPRTA QGQYNVSTHV PQSGKRPGDL IFYGTEGNIY HVGIYAGNGY MWAAPQTGGV VSLRPIYSST YKVGRIM
|
| |