Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0056 |
Symbol | |
ID | 5668482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 69424 |
End bp | 72531 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641238985 |
Product | lantibiotic dehydratase domain-containing protein |
Protein accession | YP_001504430 |
Protein GI | 158311922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATGC CCCCGCGCTA CCAGCACACC GGCTTCATCC AGGTCCGCGC CTCGACGGAC CCGGGCGGCC TCGACCTGCC GACTGATCTC GATCTGTCCG ACTCCGCTGC GGTCGAGCAG GAAGGCCGGG CATGGCTGGC CAAGACCTGG GCTCGCCCGG AGGTACGGGA GGCGCTGCGG CTTGCCAGCG CCGATCTCTG CGCCCGCATC GAGCAGATCC TTGACGACGG CGCCGGGCGG CGTTCCGGGG CGGAGATCCA GCGTGCCCTC ACCGCGGTGG CGTCCTACCT GCTGCGGTGG CAGCGGCGCG CGACGCCGTT CGGAATGTTC GCCGGCATCA GCACCGCGAC CGCCGGGCCG GCAACCGCGG AGGTGGGACG CGCACATCAA GCCGTGGCAC GCGCGGACGC CGAGTGGGTC ACGGCGCTCG CCGATCAACT CGAACGACAT CCGGAGCTGC GTGGGCGGTT GACGGTGGTG ACCGACAGTG CGCGGATCGT GCGCGACGGT CGGGTCATCG TGCACCGGCG GGCCGAGGTG GGCGCCCCGA GCCCGGGGCC GCTGCGGGAG TCGTCGGTAC GGCTGACCCG GCCGGTCTTC TTCGCCTTGG CGGAGGCCGG CAACCCGATC CGGCTGGACG CGCTCGTCGC GCGGACGGCG ACCAGGTTCC CCACCGCCTC CCCGGACAAG ATCCGCGCGC TGGTGCATGG CCTGGTCGAC GGCGGCCTCC TGATCACGAG CCTGCGACCA CCGACGACAG GCGTCGATGC CCTCACCCAC CTGGCCGACG CACTACGTGC CGCAGGCCTC GAGGGCCTGG CTGACGTGAG TGACCTGCTG CGGGAAATCG ACTGCCTCCG CGCCATGCTG GCCGCCCACA ACCGCATCAC TGATCCGCAG CCGGCGGCGC AGACGCGCCG GACGGTCGCC GCGCGGATGG CCTCGCTTGC ACCGGGAGCA GGTCAGATGC TGGCGGTCGA CACGCGGCTC GACGCGAACA TCGCCGTCCC CGGGCGGGTC CTCACCGAGG CAGCGCTGGC GGCGAGCGTG CTGCTGCGAC TGTCCCGCCA GCCCTTCGGC TCGACAGCCT GGCTGGACTA TCACGCCCGG TTCCGGGCCC GCTACGGCCC CGGTGCCCTG GTGCCGGTGC GGGAGCTGGT CGCCGAATCG GGGCTGGGCT ATCCCGGCGG GTACCTCGGT GCGCCGCGAG CGAGACCTGC CTGGCGGATG CTGACCGACC GTGACACCGC GCTCCTGGCG CTGATCCAGC GGGCCGCCGT GGACGGGGCC GAGGAGATCG CGTTGACCGA CGCGGACGTC GAGGAACTGG CTGTCGGGGA GCGCGGGGAC GTCGTGCCGC CGCAGCGGGT CGAGCTCGGC GTCGAGGTGT GGGCGGTCTC GACTCGCGCG ATCGACCGCG GGGACTTCCG GCTGCGGGTC ACCGCCGCCC CGCGGACCGG TACCAGCATG GCCGGACGGT TCGCCTACCT GCTCGGCCCA GCCGACCGCG ACCGGCTGGC CGCGACGTAC GCGGCCAGTG CTGAGGCTGG GGAGCGGGAC CAGCCCTTGG CGGCACAGCT GTCCTTCCCG CCGCGTCGCC CGCACAACGA GAACGTGGTG CGCGTCGAAC CGCTGTTGCC GGCCGTCATC TCTCTCTCGG AGCATCCTGA CCCCGCCCAC GCCAAGGTCA GGTTGATCGA TCTGGATGAC CTGGTGGTTA CCGCCGACGC CGCGCAGATG TACCTGGTGC AGCGCTCGAC CGGCCGGCGG GTGATCCCTC GGATTCCGCA CGCGCTGGAC ACCAGCGTGC AGAGCCCACC GCTGGCCCGT TTCCTCGCCG AGGTCGCCGA CGCCCGTACC GCCGTGTTCG GCGGGCTCGA TCTCGGCGCG GCCCGTGTCC TGTCCTACAT CCCGCGCATC CGCTACGGCC GCACGGTCCT GGCCGTCGCC CGCTGGACGC TCACCTCCAC CGACCTGCCC CGCGGCCAGT CCGGCGACGG AAGGCAGGAA GCGCTGTGCG CCTGGCGGCA GCGGTGGCGT GTGCCCGCAC GGGTCGTGCT CTGCCACAGC GAGCTTCGCC TACCCCTCGA CCTGGATCGC CACCTGGACC GGGTGCTGTT CCTGACCCGC CTGGAGCGAG CCGGCCGCAT CGAGGTGCAG GAAGACGGTC CGCCCGACGC GCAGGGCTGG ATCGGGCGTC CGGCCGAGCT GCTGATCCCG TTGACCGCGA TCAGCCCACC GGCACGGCCA CTGCCCGTGA CGGCAGCGCC GGGGGCGGTG CTTCGTCCGG GTGACGCGGC CGTTCTACAC GCGCAGCTTG TCGGAAACCC GGCCCGGTTC GACGACATCC TCACCGGCCA TCTGCCGAGG CTCGCCGACA GCCTGGAAGG GCTGGTCTGT CTCTGGTGGG TCCGACGACA CCGCGACATG ATCCTTCCCG AGAGTGATCA GCACCTCGCC GTGTTCCTAC GCCTGCGCAG CACAGACCAC TACGGGCCGG TTGCCGCCGC GGTGGCCTCC TTCGCGGCCG ATCTGGAGAC TCGCGGCCTT CCCGGTCAGC TCACCCTGGC TTCCTCCCCG CAGCACCCCG CCCGCTACGG CGACGGTGAC GCGCTGGCCG CCGCCGAGTC GGTGTTCGCC GCCGACACCC TCTGCGCCAC CGCCCAGATC ACGGCGGCCC AGACGTCCGG GATCTCCGCG CAGGCCCTGG CCGCAGCATC GATGGTGGAC CTGGCCGCGG CCTTCGCCCC CGACCCGCTG GCCGGGTACC AGGCACTCGT CGGATGCCTT CGGCAGGAGC ACGGCGCCCT GGACCGCACC CTACGGGACC AAGCTCTCGA CCTGGCCGAT CCCGGCGGTA ACCACCAGGC GGTTCGGGCC CTGACCGGCG GTGAGGCAGT CGTCAGCGTC TGGAACGCCC GAGCCATCGT GCTGGCCTCT TACTACCAGG CCCTCGCGCG GCAGCGTGAT CCGCGGACAG TGCTGCGGAC CCTCCTGCAC GAGCACCATG TGCGCGCTGT CGGTGTGGAT CCGACCTTGG AGAAGGAGAC CGGGCGCCTC GCTCGCGCTT CCGCACTGCG CCACCTCGCG CTGGCCGGTG CCCGATGA
|
Protein sequence | MPMPPRYQHT GFIQVRASTD PGGLDLPTDL DLSDSAAVEQ EGRAWLAKTW ARPEVREALR LASADLCARI EQILDDGAGR RSGAEIQRAL TAVASYLLRW QRRATPFGMF AGISTATAGP ATAEVGRAHQ AVARADAEWV TALADQLERH PELRGRLTVV TDSARIVRDG RVIVHRRAEV GAPSPGPLRE SSVRLTRPVF FALAEAGNPI RLDALVARTA TRFPTASPDK IRALVHGLVD GGLLITSLRP PTTGVDALTH LADALRAAGL EGLADVSDLL REIDCLRAML AAHNRITDPQ PAAQTRRTVA ARMASLAPGA GQMLAVDTRL DANIAVPGRV LTEAALAASV LLRLSRQPFG STAWLDYHAR FRARYGPGAL VPVRELVAES GLGYPGGYLG APRARPAWRM LTDRDTALLA LIQRAAVDGA EEIALTDADV EELAVGERGD VVPPQRVELG VEVWAVSTRA IDRGDFRLRV TAAPRTGTSM AGRFAYLLGP ADRDRLAATY AASAEAGERD QPLAAQLSFP PRRPHNENVV RVEPLLPAVI SLSEHPDPAH AKVRLIDLDD LVVTADAAQM YLVQRSTGRR VIPRIPHALD TSVQSPPLAR FLAEVADART AVFGGLDLGA ARVLSYIPRI RYGRTVLAVA RWTLTSTDLP RGQSGDGRQE ALCAWRQRWR VPARVVLCHS ELRLPLDLDR HLDRVLFLTR LERAGRIEVQ EDGPPDAQGW IGRPAELLIP LTAISPPARP LPVTAAPGAV LRPGDAAVLH AQLVGNPARF DDILTGHLPR LADSLEGLVC LWWVRRHRDM ILPESDQHLA VFLRLRSTDH YGPVAAAVAS FAADLETRGL PGQLTLASSP QHPARYGDGD ALAAAESVFA ADTLCATAQI TAAQTSGISA QALAAASMVD LAAAFAPDPL AGYQALVGCL RQEHGALDRT LRDQALDLAD PGGNHQAVRA LTGGEAVVSV WNARAIVLAS YYQALARQRD PRTVLRTLLH EHHVRAVGVD PTLEKETGRL ARASALRHLA LAGAR
|
| |