Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1332 |
Symbol | |
ID | 5669743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1601904 |
End bp | 1605104 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240263 |
Product | lantibiotic dehydratase domain-containing protein |
Protein accession | YP_001505690 |
Protein GI | 158313182 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.591859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACATCC GGGGCGGCCC GCTGGCCTGT GCGCCCCTCG ACGCCGCGCT TTTCCGTGCT GCCGTTATGT CGGACGAGTC AGCGAAGGCC CTGGCCGCAG GTTCCTGGCC GGGCGCGAAC TGCGACGACG TGGACCGCTG GTGCACCTGG CTGGCCGACG TCTGGGCGCA GCCGGCCGTC GCAGAGGCGA TCGATACCGC GAGTCCGGCG CTGGCTGTCG GCGTGGAGGA CGCGTGCCGG GGATCCAGAC CGTCCGTGCG GCGGGTCCGG CGGCTGGGGT TGGCGGTGGC GCGCTATCTG GTGCGGATGA GCCGACGTGC AACGCCATTC GGGTTGTTCG CCGGTGTGGC CCCACTGCGC TTCGACGTGG AGCCCGTACT GCGGTGGACT GGAAGTGATC GCCCCTACCC GCGTGTGGAC TCGGTGTGGC TGGCCGCGGT CGTCGCTCGG CTGGAGTCCA TGAGCAGTGT CCGTGCGGCG CTGACCGTCG TCGCCAACGA CTTGGCCATG GTGCGCGGCG AGCGGCTTGT CGTTCCGTGG CGCCCGCATG CACACCCGCC GACGGAGACC TTCCTTCCTG CCGAGGTCTC GGTCCGCCAC GGCCGACCGG TTCAGGCTGC CGTGGCGTCG GCAGCGGCCC CCGTCCGATT TGTAGATCTT GTCGATTCGG TGGTGGTCGC GCTCGGTCTC TCTCGGCCTG GCGTTGAGGA GATGCTGGCC CAACTGCTGA CGTGCGGAGT GCTCGTGAGC AGCCTGCGGC CACCTGCGAC GTGCACGGAC GGCGTTGCCC ACCTTCTCGA TGCGCTGAAC GGCATCGAGA ACACGCGCGT ACCGGCCGCC ACCGACCGGA CTGACGACAG CACCAGCGAT CCCTCACACG GCCTTGTCGG TGATCTTCGG ATGATCCAGG CGGAGCTAGG GGCGGCACGC GATGCGGACA GCTCGTTGGG ACGTCCACGT CTGGCCGCCC TTCGTGAGCG GATGCGGGCG GTATGCGCGG TGGTCGAGCA GCCGCTCGCG GTCGACCTGC GGTTGGACTG CACGGCGGTG TTGCCGGAAC TCGTGGCGGT CGAGGCCGCT GCAACGGCGG GAGCACTGTT GCGACTGACC CCGCACCCGG CTGGCAACCC GTCCTGGCGG GACTACCACC ACAGGTTCCT TGCCAAGTTC GGTGAATCGG CGCTGGTCCG AGTCGATCAG CTGGTTGACC CGGTGATCGG CCTGGGCTAC CCGGCGCACT TCGGCGTCGC GGAACCACCG CCTACCGGCG GTGTATCGCC TCGGGACGAG TACTTGCTGC AGCTCGCCCA GCGCGCGGCG TTCGACGGCC AGCAGGAGGT CGTCCTGGAC GAGGCCGCTC TTGACACCAT GTCGGCGCAG ACGGCTGGGT CCTGGCCAGA GCCGCATGCG GAGATATGCG TCGAGGTGCT GGCACCGTCG ATGGCGGCCC TGTCCCAGGG CGCGTTCACC CTGCTGGTCA CCGGTGTCAG CCGCACCGGG GCAGCGATGA GTGGCCGCTT CCTCGATCTG CTCCCCGAGG CTGACCGGCT CAGGATGATC GATGTCTTCC GTCGGTTACC GCCGACGGCG GAAGGCGCGG TGCCGGTGCA GCTGTCGTTC CCGCCAGCGC ATGCCCGGCT GGAGAACGTC ACCCGCACCC CCCACGTCCT GGCGGACTGG GTCTCACTCG CCGAGCACCG CGACAGCCAG CGCGGCAGGC TGCCGTGGAC GGACCTGGCG GTCACCGCGA ATGACGAACA GCTCCACCTG GTTTCGCTGT CCCGCGGTGC CGTCGTCGAG CCGTTGTTGA CGAATGCCGC CGCCCGGCAG ACCTTCCCGC CGCTCGCGCG GCTGTTGCAC GAGCTGCCCC GTGCCAGCAG CGGGGCCGTA GCCCCGTTCT CGTGGGGCGC GGCGAGCTGT CTGCCGTTCC TGCCGAGGGT TCGTCATGGC CGCGCGGTTC TGGCGCCGGC CCGATGGCGA ATCAACCCAT CCGATCTTCC GGGTCCAGGC GCGGGAGACC AGGAATGGGC CGACGCGCTC GCACGGTTGC GGGAGCGGTC AGGACTGCCC CGCTGGGTGA GCACCGGCCG GGCGGACGTA CGGCTGCGCC TGGATCTCGA CGAGCGGATG GATCGCGATC TGCTGCGTGC CGAACTCGAC CGCGCCGGCG CTCTGACCGC CCTCGAGGCT CCGGGCCCGG ACGACTATGG CTGGGCCGAC GGCCGAGCCC ACGAGATCGT CGTCCCGGTG GCGACCACCG CCGCCCCGCG ACCCGCGCCG GCCATCGTGA CGACGCCCGG CCCGATGCCA GTGAGCGACG CGACCTCCGG GATCCTGCCG GGATCGAGGG TGCTCTTCGC GAAGCTGTAC TGCCATCCGG ACGTCGTCTC CACCATCCTC ACCGACCATC TGCCCGCGCT GCTCGACGCC TGGGGCGTCC CTCCGCAGTG GTGGTTCATC CGATACCGCG ACCCCGCCTC ACACCTACGC CTGCGACTGC ACGTCGCGCC CGACGCCGAC GGCGTCACCG ACAGCTACGG ACGAGCGGCA GCTCGGGTCG GTGCATGGGC GGAGCAGCTG CGGGCACGAC GTCTCATCGG CGATCTCGTG CTCGACACCT ACCACCCCGA GACAGCCCGC TACGGCGACG GCGAAGCACT CGTCGCAGCC GAGCACCTGT TCGCCGCCGA CTCGGCCGCC GTCGTCATCC AGTTGGCCGC CCAGAACGCG AACCGGGCAC TTCACCCTGC GGCGTTGACC GCAGTGAGCA TGGTCGACCT GACGTGTGCG TTGCTCGGCG GCAGACAGAC GGGGATGCGG TGGCTGCTGG AAAACCGGCG GGCCTCCGGA GCACCCACCC AGCGGCCGGT GCTGCGCCAG GCCATCGCGC TGTCCCGCAC ATGTGATGAC GAGCCCGCGC CCAACGCCGC GATCCCGCAG CTCCCGGCGC CGTTACGGGC CGTGTGGGGT GTGCGGCGCC GGGCGGCCGA GCGGTATGCG GCCCAGCTCG CCGCGTTGCC GGGCCTGCCC GCCAGCAGCG AGGTACTGGG ATCACTGCTG CACCTGCACT ACGTCCGCTC TCATGGCATC GACTCGTCAG CGGAGCGCAC CTGCCACCGT CTCGCCCGCG CGGTCGCGCT CGCCTGGCAC AACACCGGCC AGCACCCCCC TCTCGACCTC GCCAGAGCGG AAGGCCCATG A
|
Protein sequence | MDIRGGPLAC APLDAALFRA AVMSDESAKA LAAGSWPGAN CDDVDRWCTW LADVWAQPAV AEAIDTASPA LAVGVEDACR GSRPSVRRVR RLGLAVARYL VRMSRRATPF GLFAGVAPLR FDVEPVLRWT GSDRPYPRVD SVWLAAVVAR LESMSSVRAA LTVVANDLAM VRGERLVVPW RPHAHPPTET FLPAEVSVRH GRPVQAAVAS AAAPVRFVDL VDSVVVALGL SRPGVEEMLA QLLTCGVLVS SLRPPATCTD GVAHLLDALN GIENTRVPAA TDRTDDSTSD PSHGLVGDLR MIQAELGAAR DADSSLGRPR LAALRERMRA VCAVVEQPLA VDLRLDCTAV LPELVAVEAA ATAGALLRLT PHPAGNPSWR DYHHRFLAKF GESALVRVDQ LVDPVIGLGY PAHFGVAEPP PTGGVSPRDE YLLQLAQRAA FDGQQEVVLD EAALDTMSAQ TAGSWPEPHA EICVEVLAPS MAALSQGAFT LLVTGVSRTG AAMSGRFLDL LPEADRLRMI DVFRRLPPTA EGAVPVQLSF PPAHARLENV TRTPHVLADW VSLAEHRDSQ RGRLPWTDLA VTANDEQLHL VSLSRGAVVE PLLTNAAARQ TFPPLARLLH ELPRASSGAV APFSWGAASC LPFLPRVRHG RAVLAPARWR INPSDLPGPG AGDQEWADAL ARLRERSGLP RWVSTGRADV RLRLDLDERM DRDLLRAELD RAGALTALEA PGPDDYGWAD GRAHEIVVPV ATTAAPRPAP AIVTTPGPMP VSDATSGILP GSRVLFAKLY CHPDVVSTIL TDHLPALLDA WGVPPQWWFI RYRDPASHLR LRLHVAPDAD GVTDSYGRAA ARVGAWAEQL RARRLIGDLV LDTYHPETAR YGDGEALVAA EHLFAADSAA VVIQLAAQNA NRALHPAALT AVSMVDLTCA LLGGRQTGMR WLLENRRASG APTQRPVLRQ AIALSRTCDD EPAPNAAIPQ LPAPLRAVWG VRRRAAERYA AQLAALPGLP ASSEVLGSLL HLHYVRSHGI DSSAERTCHR LARAVALAWH NTGQHPPLDL ARAEGP
|
| |