Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1821 |
Symbol | |
ID | 3906212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2156428 |
End bp | 2159652 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637879159 |
Product | lantibiotic dehydratase-like |
Protein accession | YP_480926 |
Protein GI | 86740526 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.525811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACCA TTCGCGACCC AGGCGGGTCC GCTGCACCAC GCTCGGCGAT CCCGCGAGCC ACGCCGTCAG AGGTCCGCGA CTCCGACTTG GCGAGCGGCC CGCAGGTCGC CGGGCTCGGC GCTCGGGCCA GCGACGCGCT GTACAGGGTG GCTGGCCCGG GTGTCGTTCG GGTGGCCGAA CCGATGCCAG GTCTGGCCGC GCTGCCTTGG CCTGACCTTG GGGACGGGGG CGAGGAGACC GAGCGGTGGT GCCAGTGGCT GCGCGAGGCT TGGGCGTGTG CGCCGTTCGC CGCAGCCGTG CAGGCCGCGA GCCCAGTGCT CGCGCGCCGG ATCGCCCAGG TGTGCGCGGG CGGCGATCTA CCCGCACGGC AGGCACGTAG CGCCGTGCTG TCGCTCATGC GGTACCGGTT ACGGCTGGTG AGCCGAGCTA CACCGTTCGG CCTGTTCGCC GGGGTCGGCC CGGCCCGGCC CGGTACCGCG CTCACCGTGT ATGAAGACCC GACGCACCAT CGCATGCTGG CACGGGTGGA CACGGCGTGG TTGTTCGATG TGCTTGACCA GCTCGAAGGC GTTCCGGGTG TTCTCGACGT CCTGCCCGTC CGCGCGAACG ATCTCGCCTT CGTTCGTGAT GGTCGGCTGG TACTGCCGTT CGAGGGCCCG CGAGCCGGGA GCGACGCCGA GGTGTCCACA GCCGAGACCT CGGTCCGATA CACACAGGCT GTCGCGGCGG TGATGAAGGC TGCGGCCGGA CCCGTTCTCG TCGGGACGTT GGCCGCCCAG CTCGGCACCG AGTTCCCGGC CGCGCCTGCG CGGGTGCTTC GAGGCATGCT CTCGACACTG GTCGAGCGGC GGTTCCTGCT GTCCGGGCTT CGGCCACCGT CCACCGAACT CGACCCGCTC GGCGCGGTCC TCGCGACGCT GCGGGCGATC GGTGCCGACC CGGTACCCGA GGCGGCCGAT CTCGTGGACG CCCTGCGCCG CCGCTCTCGC GGGCTCGCCG ACATCACGGA ACCGTGGACG TCACAGCCGG CGACCCCAGA CACCGCAGCC CCGACCTCAA GCCCCTCCGG GCGCCGGCCT CCCATCGCCC TCGACATGCG GGCCGGCGTC GGGCTGGAAC TGCCTCGGCT GCTCCTCCGA GAGGTAGAAG CAGCCGTCGA CCTGCTGGTC CGGCTCGCGC CGGCACCCCG GGGCAACGCC GCCTGGCGCG ACTATCATGA TCGATTTCTT GAGCGCTTTG GTGCCGGCGC GTTCGTGCCG GTCAACTCCC TGATCAATTG TGAAACCGGG CTCGGCCTGC CCGCCGGATA CCGGGGAACC ACACTCGGCC CGGCAACGGC CGGCCCGCTC TCGGACCGCG ACGCGCTGCT GCTTGCCCTG GCGCAGACCG CGGCGGCCCG CCGAGACCGC GAAGTCACCC TTGATGCCGA GTCGCTCGCG CTGCTCCGCA CGAGCGACGC GGTGGGCTGG ACCTGCCAGC CCCACACGGA GCTGCGGTTT CGGCTGCACG CAGCAGACCG CCGCGCGGTG GAACGCGGCG CGTTCGACCT CGCTGTCGAG GGAGTCTCAC GGGCCGCCGG GACAACTCTG GGCCGCTTCC TCGACCTCGC CAACGAGGCC GACCGCGAGG AGATGACCTC GGCACTTCGC GCGCTGCCGA CCCGCCAGCC GGGGGCACTT CTCGTCCAGT TGTCGGGTGG CACGCTGTCG GCCACCGCTG CGAACGTCTC CCGCGCGCCG CGCATCCTGG ACCATCTCAT CGCGCTCGGC GAGTACCAGC CGCCGGACCG CGGGATGATC CCCGTGACGG ACCTCGCCGT CACCGCAGAC GGTTCGGGTC TGTGGCTGGT CTCGTTGTCC CTGGGCCGAC CGGTGGAGCC GGTCGCGTTC CACGCCGTCG AACTCACCCG TCACGGACAC CCCCTGCTCC GGTTCCTCAG CGAGATCGGC ACTTCCCGCG CCGCCCCCTG CGCGCCGTTC TCCTGGGGCG CGGCCCGACG GCTGCCGTTC CTGCCACGCC TGCGCCACGG GCGGACGATC CTCGCCCCGG CCCGCTGGCT CCTGGCCCCG GCTGACCTGC CCGGACCGGA CGCCACCTTC GCCCGGTGGA GCGACGACCT CGCGGCCTGG CGGGACCAAT GGGGCGTTCC CGACCAGGTC TTCCTCGGCA GCGACGACCG CCGCCTCCTG CTCGATCTGA CCGAGCCCGC CCACCTCCAC CTGCTGCGCG CAGACTTGGG CCGCGCCACG CGGGCAACCC TGCGGGAAGC ACCGCCGCCT GACGCAGCCG GATGGATCGG CGGGCGGACC CACGAGATCG TCCTCCCGCT CGCCGCACCT CCGACCCCGA CGGCGGAGGT ACCGCGCCCA CGGCGCCCGG CGCGGATCGC GACCTCGGCG GACGCCCATC TACCCGGAGA CGGCGCCTGG CTGTACGCCA AGCTCTACGC CCAGCCCGAC CGTCAGGTCA CCATCCTCAC CGAACGCCTC GCCAGCCTGT GGGAACACTG GGACACCCCG CCGCTGTGGT GGTTCCAGCG ATACCAGGAC CCAGCCCCAC ATCTGCGGCT ACGAATCCGC CTCATCGACC CTGACGGGTT CGGCGACGCG GCCCGGAGAG TGGGCCGCTG GGCGACCGCA CTGCGCCAAG CCGGCCTACT GGACCAGCTC CAGTTCGACA CCTACCTTCC AGAGACCGGC CGCTTCGGCG GCGCCTCGAC GCTCGCCGCC ACGGAGACGC TGTTCGCCGC GGACTCCACC GCCGCACTCG CCCAGCTCGC AGCCTCCGCC CGCGGTGCCA CCCACCCCCA TGCGCTGGTC GCCGTGAGCC TGCTCGACCT CGCCGCGGGC TGCCTACCCG GAGACGACGC CGCCCGCTGG CTCGTCGAGC AGCTACCACG CACGCAGGGA CCGCCGATCG ACCGCGCGCA GCACGACGCC GCCGTTCACC TCGCTGACCC GCGCGAAAGC CAGGCGACGA TGCACACCCT GCCGGGCGGG AAGGAGATCC TCACCGCCTG GGCGCGCCGC CGGGCGAGGC TCGCCGACTA TCAGACGGCG CTCACCTCGG CCGGCGACGG ACTCACGACC CGCGGCCTCT TGCCGACGCT GATGCACCTC CACCAGTTCC GGATGACGGG GCCGTCCGTC CAGGCAGAGC GCGACTGCGC CCGCCTCACC CGCGCCGTTG CGCTCAGCGT TCTCCGCCGC CGGGAGATGG CATGA
|
Protein sequence | MPTIRDPGGS AAPRSAIPRA TPSEVRDSDL ASGPQVAGLG ARASDALYRV AGPGVVRVAE PMPGLAALPW PDLGDGGEET ERWCQWLREA WACAPFAAAV QAASPVLARR IAQVCAGGDL PARQARSAVL SLMRYRLRLV SRATPFGLFA GVGPARPGTA LTVYEDPTHH RMLARVDTAW LFDVLDQLEG VPGVLDVLPV RANDLAFVRD GRLVLPFEGP RAGSDAEVST AETSVRYTQA VAAVMKAAAG PVLVGTLAAQ LGTEFPAAPA RVLRGMLSTL VERRFLLSGL RPPSTELDPL GAVLATLRAI GADPVPEAAD LVDALRRRSR GLADITEPWT SQPATPDTAA PTSSPSGRRP PIALDMRAGV GLELPRLLLR EVEAAVDLLV RLAPAPRGNA AWRDYHDRFL ERFGAGAFVP VNSLINCETG LGLPAGYRGT TLGPATAGPL SDRDALLLAL AQTAAARRDR EVTLDAESLA LLRTSDAVGW TCQPHTELRF RLHAADRRAV ERGAFDLAVE GVSRAAGTTL GRFLDLANEA DREEMTSALR ALPTRQPGAL LVQLSGGTLS ATAANVSRAP RILDHLIALG EYQPPDRGMI PVTDLAVTAD GSGLWLVSLS LGRPVEPVAF HAVELTRHGH PLLRFLSEIG TSRAAPCAPF SWGAARRLPF LPRLRHGRTI LAPARWLLAP ADLPGPDATF ARWSDDLAAW RDQWGVPDQV FLGSDDRRLL LDLTEPAHLH LLRADLGRAT RATLREAPPP DAAGWIGGRT HEIVLPLAAP PTPTAEVPRP RRPARIATSA DAHLPGDGAW LYAKLYAQPD RQVTILTERL ASLWEHWDTP PLWWFQRYQD PAPHLRLRIR LIDPDGFGDA ARRVGRWATA LRQAGLLDQL QFDTYLPETG RFGGASTLAA TETLFAADST AALAQLAASA RGATHPHALV AVSLLDLAAG CLPGDDAARW LVEQLPRTQG PPIDRAQHDA AVHLADPRES QATMHTLPGG KEILTAWARR RARLADYQTA LTSAGDGLTT RGLLPTLMHL HQFRMTGPSV QAERDCARLT RAVALSVLRR REMA
|
| |