Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2033 |
Symbol | |
ID | 3906750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2390630 |
End bp | 2393725 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637879370 |
Product | lantibiotic dehydratase-like |
Protein accession | YP_481136 |
Protein GI | 86740736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.423471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.294126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTATCAGC ATCTCGACTC CCTGGTGGTG CGTGCCGCGG TGAACCCGCC CGTGGACGTG GCCGAGCGAT GGCCCGACCT GGTCGGACCG GCGGCCACAC CAGAGTCCTG GCGCCTCTGG TTACGGGAGG TGATGCAGAA TCGCGCCTTC GAGATGGCGC TGGAGCAGGC CAGCCCTGTG CTCGCTCGCC GCGTTCGGGA GATCCGCGAC GGACGCCAAG TGCCCCAGGC CGGGGTGCGG CGGGCAGTGC TGTCGGTGAT GCGTTACCGG CTTCGCGCGT CGGGACGGGC GACACCGTTC GGCCTGTTCG CGGGGGTCGC GCCGGTGCGT GTCGCTGACC ACGCCTCGGT GCGCGCAGGC GTGGCGCACC GTGCCGTCGC GCGGGTCGAG GCTGGCTGGC TCTGCGCGGT GGTCGACCGG TTGGAAGCCG ATCCGGTCGT CCGAGCGCGG CTGGATGTGG TGGCGAACAA CCTCGTGTTC GAGCGGGACG GCCATCTGGT GCTGGAGCAC CGTGCCGCAG GAGGCACCGA CGGTGTTCCT ACGCGCGTGC AGGTGCGGGC GCACGCACCG GTCCGCGCCG CGATGGGTAT CGGCCGCCGC TCGGTGCGGT TCTCCGACCT CGCGGCCAGG CTCGCGTCCG AGTTCGCTCC GGTCCCTGCC GACGTCGTCG ACAGACTTCT GGCCGATCTG GTGGCGCAGC GGTTCCTGCT CACGAACCTG CGGCCTCCGA TGACGGCGAG CGATCCTCTC CGTCATGTCG TGGGTGTGCT CCAGACGACC CTAAATGGCG GGCCGGTCAC CGTCGAGGCG GCCTGGGTCG CCGAACGCCT GAACGGGATT GTCGATGGGC TGCGGAGCCA CGACAGCGCG TCGAGTTGGA CCGTCGCGCG CAGCAGCCGC GAGCGCCTCG GCGAGGACAT GGCTGAGGTC CACCCTTCGG CGGAGCCGGC ACTCCGGGTT GATCTGCGGA TGGACTGGGC GCTGGCTGTG CCCCGCGCGG TCGCGGTCGA GGCCGCAGCG GCGGCGGGCG TGCTGGTGCG CCTGGCGCGG CGCTCGACGC TCAGCTCCGG CTGGGACGCC TGGCATGGCC GGTTCCTCGA ACGGTACGGG CCGTGCGCTC TGGTTCCCGT TTTCGACGCG GTCGACCCCG AGATCGGACT GGGTTATCCC GCCGGCTACG CCGGCAGTCC CGCACCCGCG GGCGTGACGT TCACCGATCG GGACGCGAAG CTGTTGGCGT TGGCGCAGAA CGCCGCGCTG CGCCGCGAAC GGGAGATCGT CCTCGACGAC AGGATGATCA GCGACCTTAC GGTCGTCGAC CAGGACGCCC ACGTTCAGCC CACCACGGAA CTCACCCTGC GCGTCCACGC CACGAGCACC CGCTCGCTCG ACGATGGCGC GTTCACCCTC GCGGTCGTCG GGGTGTCACG CAGAGCCGGC ACCACGACCG GACGACTCCT CGACCTCTTC GACGCGAAGG ACAGTGCGCG GATGCGCGCG CTGTACGCGG GTCTTCCCAC CGCGAGCCGC GACGCGCTGA GCGTGCAGAT CTCCGCGCCC GCTCGCTACA CGAACACCGA CACGGTGGGA CGCGCCCCCC AGATCATGGC GCACCGGCTC TCCCTCGGCG AGTACGACGA CAGCGAGGAC GGGGACTCGC TGCCTCTGGA CGACATCGTG GTGACGGCCG ATGTGCACCG GATCTATCTG CTCTCGCTCT CCCGCCGCCA ACTGGTGGAA CCGGTGGCGC TCAACGCGGT GGAGCCGGTG CGACGCGCGC ATCCGCTGGC GCGTTTCCTC GCCGAGGCGC CGGCCGCGCT GAGCGTTCCC TGCGCCCCCT TCGACTGGGG GGTCGCGGCC CAGCTACCGT TCCTGCCGGC GCTGCGGCAT GGACGAACAG TCCTCTCACC AGCGCGCTGG CTGCTGGGGA CCGCGGACCT GCCCGGCTCC GCGGCCGGCT GGACGGAGTG GGACCGTGCC CTGGCCAGCT GGCGGGAGCA GGTCAACCTC CCCTCGGCGG TCTACCTCGG CGACGGTGAC CAGCGCATCG GACTGGACCT GTCTGAGCCG GCCCACCGGG TACTGCTGCG CTCCCACCTC GACCGCTCCG GCGCCGCGCT GCTGCGCGCC GCCCCCGACA CCGACGCGGC CGGCTGGGTC AGCGGGCATG CCCACGAGAT CGTCGTCCCC CTGGCAGCGG TCGCCGCGCC GGCGGCGACA CCGGGCTGGT TCAGCCGGGC GCAGGTCGTT GGCCGCGATC ATGGGCATCT TCCGGGCTGC GACGCACGGT TCTCAGTCAA GATCTACGCC GGCCTCGGTT GCCAGGACGA CATCCTCACC CGCCATCTGC CGCACCTCGT CCACGAGCTG AACGGGCAGC AGGCCGCGGG CGAGGCGCGC TGGTGGTTCC TGCGCTACCA CGACCCCGAC GACCATCTGC GCCTGCGGCT CGCGGTCACC GCCGACGGTG TGGCGCCGAC CGCCGACCGG ATCGGTGCCT GGACCCAGCG GCTCCGCCGC GCCGGCCTGA TCTCCCGCGC GCAGTGGGAC ACCTACTTCC CCGAGACCTC ACGCTTCGGG GGCACCGCCG CGATGGACGC CGCGGAGGCG TACTTCGCCG CCGACTCAGC GGCGGCCCTC GCCCAGCTCG CCGCCTCCAG GGAGAAGAGC GGGCCCGACC GTCGCGCGCT GACCGCCGCC AGCATGCTCG CCATCGTGAC CGGCCTGATC GGCGACACCG CCGAGGCGAC GAGCTGGCTC ATCAGCCACA CCCGGACGGA GCCGTCCGCC CCAGCCCGCG CGCTGTACCA GCAGACCGTC GCCCTGGCGA ACCCGGCCGA CCCACGCGCG CTGGCCGCAC AGCCCGGAGG CGAGCACGTC CTCTCCTGCT GGGCGCACCG GCGCGAGGCA CTCACCGCCT ACCGACACGT CCTCCAGGAA ACCCGCGCGG AGGCCCCCAC CTCCCTGCTG CCTGATCTTC TGCATCTCCA CCATGTCCGC ATGGCCGGAG TCAGCCTGGC CGGGGAACGC GCCTGCCTGC ACCTGGCTCG CGCCGCCGCA CTGAGCTGGA CCGCGCGCAC GAGGAACCCA TCATGA
|
Protein sequence | MYQHLDSLVV RAAVNPPVDV AERWPDLVGP AATPESWRLW LREVMQNRAF EMALEQASPV LARRVREIRD GRQVPQAGVR RAVLSVMRYR LRASGRATPF GLFAGVAPVR VADHASVRAG VAHRAVARVE AGWLCAVVDR LEADPVVRAR LDVVANNLVF ERDGHLVLEH RAAGGTDGVP TRVQVRAHAP VRAAMGIGRR SVRFSDLAAR LASEFAPVPA DVVDRLLADL VAQRFLLTNL RPPMTASDPL RHVVGVLQTT LNGGPVTVEA AWVAERLNGI VDGLRSHDSA SSWTVARSSR ERLGEDMAEV HPSAEPALRV DLRMDWALAV PRAVAVEAAA AAGVLVRLAR RSTLSSGWDA WHGRFLERYG PCALVPVFDA VDPEIGLGYP AGYAGSPAPA GVTFTDRDAK LLALAQNAAL RREREIVLDD RMISDLTVVD QDAHVQPTTE LTLRVHATST RSLDDGAFTL AVVGVSRRAG TTTGRLLDLF DAKDSARMRA LYAGLPTASR DALSVQISAP ARYTNTDTVG RAPQIMAHRL SLGEYDDSED GDSLPLDDIV VTADVHRIYL LSLSRRQLVE PVALNAVEPV RRAHPLARFL AEAPAALSVP CAPFDWGVAA QLPFLPALRH GRTVLSPARW LLGTADLPGS AAGWTEWDRA LASWREQVNL PSAVYLGDGD QRIGLDLSEP AHRVLLRSHL DRSGAALLRA APDTDAAGWV SGHAHEIVVP LAAVAAPAAT PGWFSRAQVV GRDHGHLPGC DARFSVKIYA GLGCQDDILT RHLPHLVHEL NGQQAAGEAR WWFLRYHDPD DHLRLRLAVT ADGVAPTADR IGAWTQRLRR AGLISRAQWD TYFPETSRFG GTAAMDAAEA YFAADSAAAL AQLAASREKS GPDRRALTAA SMLAIVTGLI GDTAEATSWL ISHTRTEPSA PARALYQQTV ALANPADPRA LAAQPGGEHV LSCWAHRREA LTAYRHVLQE TRAEAPTSLL PDLLHLHHVR MAGVSLAGER ACLHLARAAA LSWTARTRNP S
|
| |