Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4885 |
Symbol | |
ID | 5673225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5860221 |
End bp | 5861414 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243740 |
Product | peptidase M50 |
Protein accession | YP_001509156 |
Protein GI | 158316648 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.333233 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGACC AGCCCGGCGC GGCGCGGCAC GGCGTCCCCG GTGGAGCGGA GCCGCCGGAG CGGCGGCCGG CGCCGCGTCC GCCGGGTATG CGGGTCGGGC ACGTCCGCGG TGCCCCGATC TTCGTCTCAC CGCTGGCCGT GGTCTTCGCG GCGCTGGTCG CGTCCCTGCT CATCGACCCG ATCCGGGACC ATCTGACGAT CGCCGCGACC GACGGGCATG TGCTCGTACT GTCCCTGCTC GTCTCGGCGG GGTTCCTCCT GTCGCTGCTC GCGCACGAGA TCGGGCACGC GCTGACCGCG CAGCGGTTCG GGCTGCACGT CCGGTCGGTG ACCCTGCACG GCTTCGCCGG TTTCACCGAG TTCGAACCCG AGCCGGCCAC CGCCGCCCGC GAGTTCCTGA TCGCCTTCGT CGGGCCCGCC GTCAACGGCG TGATCGGGGG GCTGTGCTTC CTGGGCCTGC TGGCCGTGCC GGATCACGGC AGCGTCGGGA CAGTGCTCTT CTTCATCGGC TTCACCAACG CCGCGCTGTT CGTGTTCAAC CTGGCGCCGG GCCTGCCTCT CGACGGCGGG CGGGTCGTCG TCGCGGCGGT GTGGGCCATC GGGCACGACA AGCTGCGTGG GCTGCGGGCC GGCGCCTACG GCGGCTTCAT CGTGGCTGGT GCGCTGGTGG TCTGGGGCGC GACGAAGGAC TCCGACGGCT TCGGTGCGCT CTACGCCTAC GTCCTGGCCG GGTTCGTCGC CTTCGGCGCC TACCAGTCGC TGCGGTCGGC GAAGGTCCGC GAGCGGCTGC CCGGGCTGTC GGCGGGCCGG CTCGCCCGCC GCACGCTGCC GGTGGAGGCC GCGGTCCCGC TCGGCGAGGC GCTGCGCCGG GCCCAGGAGG TGGGCGCCAC GGCCGTCGCG GTGATCGACC GGGACGGTAC CCCCATCAAG ATCATGAATG GCGCCTCCGT CGACGCCCTG CCGGAGCACC GCCGGCCGTG GATGGCGGTG GACGAGGTCA GCCGGAGCAT CGGCCCGGGG ATGACCCTGC AGGCCGAGCT CGAGGGCGAG GACCTGCTGG CGGCCGTCCA GCGGGCCCCG GCCGCGGAGT ACCTCGTGGT CCAGCGCGGC CGGCCGGTCG GCGTGCTGGC CATGGTCGAT CTCGTGGCCA GGATCGACCC GGCGGCCGCG ACCCGTATGG TCTCCCATCG GTGA
|
Protein sequence | MEDQPGAARH GVPGGAEPPE RRPAPRPPGM RVGHVRGAPI FVSPLAVVFA ALVASLLIDP IRDHLTIAAT DGHVLVLSLL VSAGFLLSLL AHEIGHALTA QRFGLHVRSV TLHGFAGFTE FEPEPATAAR EFLIAFVGPA VNGVIGGLCF LGLLAVPDHG SVGTVLFFIG FTNAALFVFN LAPGLPLDGG RVVVAAVWAI GHDKLRGLRA GAYGGFIVAG ALVVWGATKD SDGFGALYAY VLAGFVAFGA YQSLRSAKVR ERLPGLSAGR LARRTLPVEA AVPLGEALRR AQEVGATAVA VIDRDGTPIK IMNGASVDAL PEHRRPWMAV DEVSRSIGPG MTLQAELEGE DLLAAVQRAP AAEYLVVQRG RPVGVLAMVD LVARIDPAAA TRMVSHR
|
| |