Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2138 |
Symbol | |
ID | 5670538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2566594 |
End bp | 2567565 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241059 |
Product | sortase family protein |
Protein accession | YP_001506480 |
Protein GI | 158313972 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3764] Sortase (surface protein transpeptidase) |
TIGRFAM ID | [TIGR01076] LPXTG-site transpeptidase (sortase) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGCGA CGAATCGCCG GCACCTGGCG GATTCGCTGA CGGATCTGCC CGTCCAGCCC GGAGCCGGAC GTCCTCCGGC CCCGGCGTCG CCGACCGCGG ATCCGCACCT GGGGGAGCCG GCGCATCGGG GCGCGCGGGC GGCACCCGGA GGGCCTGGCC GGAGCCGGGG CGCTCGGCGG CCCGGGCTGG TTCGCCACAC CCCCTGGCCA CGGCGGGTCC AGCCCGATCG AGTGGACGGC CGGAGACGCC GGCGTAACCC GGTGGCGGGG CTGGCGGATC GGCCGGTTGG CGGTCGGGTC TCCCGGGGCC TCGGCGAGGT CATGATCACA GCGGGTCTGG TGGTGGTGCT CTTCCTCGCC TACCAGCTGT GGATCACCGA CATCTTCGCG GCCAGGACGC AGGACCGGCT CCGAAACGAC CTGACCACGG CGTGGTCTCG ACAACCGCAT CCTCGGGCTC CCGCCGAGGC TGCGAAACCA CGACCGGTCG TGCCGCCCGT CGAACTGGGC GAGGGAGTCG CTGTCCTGCG GGTCCCCCGC TTCGGTGCCG ACTACGCACC CGTGGTTGTG GAAGGTGTGT CGGTGGCGGC GCTACGCCGC GGACCTGGGC ATTTCCCGGG CACTGCCATG CCGGGCGACG TAGGGAACTT CGTCGTGTCC GGTCACCGCA CCACGTATGG AAAGCCGTTC AGCCGGCTGG ACGAGCTGAG AGTGGGCGAT CCGCTCGTGG TGGAGGTGGC AGACCGGTAT TTCACCTACC GGGTCACCGG CTCGGAGGTC GTGGACCCCC ATCGGCTGGA CGTGACCTAC CCGGTTCCAG GGCACGCCGG AGTCGCTCCC ACCAGGGCGT TGATGACACT GACCACCTGC CATCCACGAT TCTCGGCGCG GAGTCGACTC ATCGTCTTTG CCAACCTCGA CGAGACCACG GACAAGTCCG ACGGACCACC TCGCGCGCTC GCGGACGAAT AG
|
Protein sequence | MPATNRRHLA DSLTDLPVQP GAGRPPAPAS PTADPHLGEP AHRGARAAPG GPGRSRGARR PGLVRHTPWP RRVQPDRVDG RRRRRNPVAG LADRPVGGRV SRGLGEVMIT AGLVVVLFLA YQLWITDIFA ARTQDRLRND LTTAWSRQPH PRAPAEAAKP RPVVPPVELG EGVAVLRVPR FGADYAPVVV EGVSVAALRR GPGHFPGTAM PGDVGNFVVS GHRTTYGKPF SRLDELRVGD PLVVEVADRY FTYRVTGSEV VDPHRLDVTY PVPGHAGVAP TRALMTLTTC HPRFSARSRL IVFANLDETT DKSDGPPRAL ADE
|
| |