Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2799 |
Symbol | |
ID | 5671188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3311180 |
End bp | 3314245 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241708 |
Product | lantibiotic dehydratase domain-containing protein |
Protein accession | YP_001507128 |
Protein GI | 158314620 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGACG TGGAGGTGCC GCTGTACCGC CATGCTGGCG GTGCGATGCT GCGGGCGGCG GTCCTTCCGC TGTCCCAGCA GCCGGAGGAC TGGCCGACGT TGTCCGATCC GGACTCGTGC CGGTCGTGGC TACGCGCCGT ATGGGCGCTG CCCGGGTTTG CCGATGCGAT CCGCTACGCG AGCGGATCAT TCGCCGCTCA GGTCGAAGCT GTTCTCGACG ACCAGGCTGC GGGCGGCAAG CAGGTTCGAC GGGTGACGTT GGCAGTCGTC CGCTACCTGC TGCGTTCCCT GGGACGGCCG ACTCCGTTCG GCTGGTTCGC CGGTGTCGCC GGGGTCCGTA TCGGCGATGA CGTGCGTGCG CGCTGGGGAT CGGCCCACCG GCTGGTCTTG CGGGCGGACA CGCTGTGGTT GGACGACGTT GTCGAGCGGC TGGAGATGCT GCCCGACCTG CTCGCCCATC TGGACGTCAT GGCCTGCGAC CTGTTGGTCG AACGCGGCGA TCGGATCGAG ATGCCTCGCG GCCCGGGGCG GGTGACGGTC CGTAACACGG CGGTGATGCG TCTCGTCCGT AACCTGGCGG CCAAGCCGAT CCCGTTCCAC CTCCTGCTCG ACCAGGTCGC CCTCGCGTTT CCTGCCGCCC CTGCCACGGC TGTCCGCCGA GTGCTCGGTG ACCTGATCAC TCAAGGCATC CTGATCACGG GTCTTCGTGC GCCTATGACG GTGGCTGACC CGCTGACGCA TGTGATCGCG GTGGTCGAGG GTGCCATGAT CGGCGATGGC GAGGCTGTGG CGGCCGTGCT GGGTGATCTC CGCGCCGCGC AGCAGGTGAT CCGGACGCAC AACGACGACC ACGTCGAGCC GGGCACGCAG GCACGGCTTC GGGAGGAGGC ATCGGAACGG ATGCGTCGCC TGTCGCCGGC CGGCCGGACA TCGCTGGCCG GTGACCTGCA CCTCGACTGC GACATCGCTG TTCCCACCGA CCTCGCCGAC GAGATGGCCC ACACGGTGGG TGCCTTGCTG CGGCTCACGT GCCAGCCACG TCCGGACCAG GGGTGGAACG ACTGGTGCCG CGAGTTCTGG GACCGCTACG GCACGGGAGC ACTCGTCCCC GTCCTAGACG CGGTGCATCC CGATACGGGC ATCGGCTGGC CGGCTGGCTT CCCGGGCAGC ATGCTCGCCG AACCCGAGGA CACGGTCAGC CACCGCGACC AAGAGCTGCT GCGCCTCGCC TGGGACACGG TTACCGCCGG ACATCACGAA CTCGTGCTCA CCGACACGCT CATCGCCACG ATCACAGCGG ACAAGCCGGT GGACCCACGG TGGATTCCGC CGCACGTCGA GCTGGGAGCC CGTGTCCACG CCACCAGCGT CCAGGCACTG GCGGCGGGGG ACTACACGTT CGGCGTCCAT CCTGCGTGGG CGTTCGGCAC GCTCACCTGC CGGTTCGGCG CCGCCGTGGG CCGTGCCGGC CTGGACACCG TCTTCGCCGC CGCGCCTTCG GCTGTCGAGG GCGCGCTGCC AGTCCAGATG TCGTTCCCGC CACTGTTTCC GCACTCGGAG AACGTCTCCC GGGTGCCCCG CTGGCTGCCC CAGGTGCTGC CGGTCGGTGA GCACCGGGCG GACGAGTCGA CCGTGATCCG CCTGGATGAT CTCGGGATCG TCGCGGTGGC CGACGGGCTT CACCTGGTCA GCATCTCGCG GCGCCAGGTC CTGGAACCGC AGGTCTTCCA CGCGCTCGCG CTGCGCAAGC AGGCGCCGCC GCTGGCACGG TTCCTCGCCA CGCTGACCCG GGGCTTTCTC GCCCGTTTCA CCGAGTTCGA CTGGGGACCG CTGGCCACCG GCCTGCCGCA CACGCCTCGG GTGCGTTACC GGCGCGCGAT CCTGTCACCC GCGACGTGGC GGATCAACAC CGCGGACCAC ACCGCACTAC GCGCCGACGG CCACTCCTGG GAGGAGGCAT TCGCCCGGTG GCGGCAGCGA GGGGCCTGCC CGGACATCGT GGAGCTCCAT GATGACCACC GGTCGCTGCG CCTGGATCTC ACCGTGGACG CGCACCTGGC GATCCTGCGC GAGCACCTGG ACAAGCACGG CCGCGCGACT CTGACCGAGA CCGACTCGGT CGAGGACACG GGATGGATGG GCGGCCACAT CCACGAGATC GTCCTGCCGT TCGTCCGCGC GGTACCGGCG GCGCCGAACC TCGTCACCGG CGCCCTGCCG CTCGTGACCA GCGCGAACGC CGCACACCGA CCGGCCTCAC CGCACGGCTC CTGGCTCTAC ACGCAGGTCT TCACCCATCC CGAACGTCTC GACGACATCC TCCGCACGCA CCTGCCCCGG CTGCTTGACC TGCTCGACGG AGACCGGTCG TTTTGGTTCG CCCGTTACCG CAGCGTCCGT GAGACCGACC ATCTGCGGCT GCGGATCCGC ACGGCCGGCC AGGAGGAGTA CGCCGCGGTG GCCTGTGCGG TCGGCCAGTG GGGGCAGCAG CTCTGCGACG CGGGCGCGGC GTCCCGGCTG ACCCTGGCGA CCTACCATCC CGAGATCGGC CGTTACGGCA GCGGGGCCGC GATGGACGCC GCCGAAGCGG TGTTCGCGGC CGACTCGCAC GCGGTCGCGA CCGCACTCCA ACTCCCGACG CCACTCGCCG TCCATCCGAT GGCGCTCGTC GCCATCGGCA TGGTGGACAT CGCCGACGGT TTCCACTCCG ACCCCGTCCA CGCGAACACC TGGCTGCTGG AGCACCTCGC CGCCAAAGCC ACGCCCGGCG CGGATCGGAC CGTCACCGAG CAGGTCACCC GCTGGGCAGC CCGAAGGACC CTGCCGGGCG ATACATCCCT GCCCGCCGCC CTCGTGCAGA CGTGGCAGGC TCGCCGGGAA GCGTTGATCC GCTATCGGCT CGCGCTCCCC GGGAACGCCG ACGCCGACCA GGTTCTGTCC GCGCTCCTGC ACATGCATCA CAACCGCGCC AGACTCATCG ATCGTGCTGA CGAGGCCACC TGCCGGCGCC TGGCCCGGCA GATCGCCCTC ACCCGACGCG CACACACCAC GGCGGACAGC CCGTGA
|
Protein sequence | MVDVEVPLYR HAGGAMLRAA VLPLSQQPED WPTLSDPDSC RSWLRAVWAL PGFADAIRYA SGSFAAQVEA VLDDQAAGGK QVRRVTLAVV RYLLRSLGRP TPFGWFAGVA GVRIGDDVRA RWGSAHRLVL RADTLWLDDV VERLEMLPDL LAHLDVMACD LLVERGDRIE MPRGPGRVTV RNTAVMRLVR NLAAKPIPFH LLLDQVALAF PAAPATAVRR VLGDLITQGI LITGLRAPMT VADPLTHVIA VVEGAMIGDG EAVAAVLGDL RAAQQVIRTH NDDHVEPGTQ ARLREEASER MRRLSPAGRT SLAGDLHLDC DIAVPTDLAD EMAHTVGALL RLTCQPRPDQ GWNDWCREFW DRYGTGALVP VLDAVHPDTG IGWPAGFPGS MLAEPEDTVS HRDQELLRLA WDTVTAGHHE LVLTDTLIAT ITADKPVDPR WIPPHVELGA RVHATSVQAL AAGDYTFGVH PAWAFGTLTC RFGAAVGRAG LDTVFAAAPS AVEGALPVQM SFPPLFPHSE NVSRVPRWLP QVLPVGEHRA DESTVIRLDD LGIVAVADGL HLVSISRRQV LEPQVFHALA LRKQAPPLAR FLATLTRGFL ARFTEFDWGP LATGLPHTPR VRYRRAILSP ATWRINTADH TALRADGHSW EEAFARWRQR GACPDIVELH DDHRSLRLDL TVDAHLAILR EHLDKHGRAT LTETDSVEDT GWMGGHIHEI VLPFVRAVPA APNLVTGALP LVTSANAAHR PASPHGSWLY TQVFTHPERL DDILRTHLPR LLDLLDGDRS FWFARYRSVR ETDHLRLRIR TAGQEEYAAV ACAVGQWGQQ LCDAGAASRL TLATYHPEIG RYGSGAAMDA AEAVFAADSH AVATALQLPT PLAVHPMALV AIGMVDIADG FHSDPVHANT WLLEHLAAKA TPGADRTVTE QVTRWAARRT LPGDTSLPAA LVQTWQARRE ALIRYRLALP GNADADQVLS ALLHMHHNRA RLIDRADEAT CRRLARQIAL TRRAHTTADS P
|
| |