Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3315 |
Symbol | |
ID | 3904101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3927165 |
End bp | 3930248 |
Gene Length | 3084 bp |
Protein Length | 1027 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637880640 |
Product | lantibiotic dehydratase-like |
Protein accession | YP_482401 |
Protein GI | 86742001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.301651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTGA GATCTCCAGC GATGTACCAG TGGGCCGGTG CAGCACTACT ACGTGCGAGC ACCGATCCGG GCGGATTGGA CTTGCCAGCG GACCTGGACC TGTTCGGTGC CGACGCCGCG GAAGAAGGGT CGGCGTGGCT GTCGGCGATG TGGCGGCGCG AGGAAATTCG CGCGGCGATC GCTCAGGCGA GTCCCGCGCT GATTCAGCAG GTTGACACCG TTCTGACCTC CAGTGGTCAT GACGTGCGAG TGGTTCGCCG GACCGTGCTT TCCGTAGCGT CCTATCTGCT TCGGTGGCAG CGCCGTCCCA CTCCCTTTGG GCTGTTCGCC GGAGTCGCGC TGGCCCGGAT CGATGCTGGG GCGAAGGTGC GGTGGGGTCG CGATCACCGG GTCGAGGCCC GGGTTGACGC GGGCTGGCTC GGTGATGTCC TCGCGCGCCT GCAACGGTGT CCGACGCTGC GGGAGCGGCT GTCGCTGGTC GCCAATGGCG CCGGGTTGGT GCGCGGTGAC CGCTTCGGAG CGCCCGCGCC GACGCCGGAT GGCATAGCGG ACGAGTTGGC GCCGATCGAG GTGTCCGTGC GCCACAGTCG ACCGGTCTGC GCCGCGCTGG AGGCCACGCG GAAACCGGTC ACGTTCAGCG AGCTACGAAC ACTGCTCATG GAGCGCTTCC CCAGTGCCCC CGCGCAGCGG ATCGACGAGA TGCTCACGGG TCTGCTCGAC CAAGGAATCC TGCTGAGCAA TCTGTCAGCG CCGATGACCT GCCTGGATGC ACTCGGTCAT GCGTGCGCTC AGCTGGAGGC CGTTGACGCC CACAGCATTC CGGAGGTTAG CGATCTTGTC CGTTCGATGT TCGAGATCCA CAAGGAGGTG TCGGCCACCA GCCAGGTTCT CGGGTCAAGG TCGGCCGTGA CCGAGCAGAT GCACGCGCTG AGCGAGGCCG CCGAGGTACC CATGATCGTC GACACGATCC TGGAGTGCGA CGTTCACATA CCGGACCAGG TCGCCCAGGA AGCCCGCAAC GCCGTCCAGG TCCTCTATCG ACTCTCACCG TATCCGTTGG GTTATCCCGC CTGGCGGGAC TACCACTCCC GGTTCCGGAC CCGCTACGGG ACGGGCGCCT TCGTGCCGGT CATGGACCTG ATCTCCGACA GTGGCCTGGG AGTTCCGGCC GACTATCTGG GCTCGGCGCG CAGGCGTGCC GCTCGGCAGG TGAGTGAACG TGACGAGAAA CTACTGGCGC TGATCCAACG GGCCACGCTG TCCGGCGGCG GCGAAATCGT CCTGACTGAT CAGATGATCG AGGAGCTTGC GGTCAGCGAT CCGGCCGACG TGCACCTGCC TGCTCGGGTC GAGGTGGCCG TGGAGATCCG CTCCATGTCC GTTGAGGCGC TGGCCCGCGG CCGGTTCACG GTGGCGGTGA CCGGCACGCC ACGGCCCGGC AGCAGCATGG CTGGCCGCTA CGCCCACCTG CTGCCAGCGG ACGGCCGCGA CCTGATCGCG GGCACCTTCG CTGCGGCCGG CACCGACGCG ATCCCCGCGC AGCTCTCCTT CGCCCCGCGT AAGCGGCGCA ACGAGAACGT CGCGCGCACG CAGCAGCTCC TGACACATGT GATCCCCGTG GCCGAATACC GCGACGGCGA CGAACGCCTG ATCCCCCTGA CGGACCTCGC GGTCAGCGTG GACGACCGCC GCTTCTACCT CGCCCAGATC TCCACCGGCC GGTACGTCGA ACCGCGGGTC GCCCACGCCC TGGAGGCCGG CGTGCACACC CCGCCGCTCG CGAGGTTCCT CGCTGAGATC ACCACCGCCC GAGCCGCCGT GTACAAGGCA TTCCACTTCG GCGCGGCGGC ACAGCTTCCC TACCTTCCAC GCGTTCGATA CCGGCGCACC GTGCTGTCTC CGGCACGGTG GCTGCTAGCG GCCGGTGAAC TTCCCGGCCG CGGCGCCTCG ACGGCCGAGT GGGACGCCGC GCTGGAAGAC TGGTGCAGCC GGTGGTGGGT TCCCGGCCAT GTCGCGATGG TGGAGCACGA CCGGCGGCAG CCGGTAGACC TCGGCCACCC GCTTCACCGT CTCCTGCTGC GCACCCGGCT GGAACGCGCT GACCGCCTGG AACTGCGCGA GACGTCGACC CTGGAAGACG TGGCCTGGCT GGGGCGTGCC CACGAGGTGC TGATCCCCAT GGTCTTGGAC CCGCAGCCCG CCACAGATCC CGGGCCAGGC ATCAGCACAC GGCGAGTCGT GGCCGTCGAC GCCGGGCATC TCCCCGGCGA GTCCACGGTC GTGTCCGCGC ACCTGTACGG GCATCCGGCG CGCGTCGAGG AACTCCTGAC GCAACACCTT CCCCACATGA TCGACGCCTT CGGCGTCCAC AGGCCGCGCT GGTGGTTTCG GCGGAACCGC GAAATGCGCA GACCAGAGAT CGACCAGTAC CTCGCCGTAT ACCTCTGGCT ATCGGAGCCC TCCGCATACG GCCCTGCCGC CGCATGCCTT GCCCGGTGGG CCGACGATCT GCGCCGACAA CACCTGCTCG CGCACGTCTC GCTCACCACC TATGACCCCC AGTCGGGACG TTACGGACAC AGCCCAGCCC TGGACCACGT CCAGGACGTC TTCGCCGCCG ACTCGGCCTG CGCCATCGCC CAGATCAGCG CATCCATCCG CGCAGGCGTG CATCCCCAGG CCCTGGCCGC TGCCAGCCTG GTCGACCTGG CAGTGAGCTA CGCCGGGTCC CCACAAGACG GGCTGGACTG GCTGATCCGC GAACTCCGCC AAGAACACGG AAGGCTGGAC CCCGCGCTAC GGCAACAGAC ACTCGAACTA GCCGACCCGC ACGGCAGTTG GACGCGGCTG CAATCCCTGC CCGGCGGACG CGATGTCCTG GCTGCCTGGG GCACCCGCGC CAGTGCGCTG GCGGCGTACC GAGATGCCCT CGCTGACCAA CGCGACCCGA TGCCGGTCCT GCGATCGCTC CTGCACCTGC ACCACAATCG CGCTGTCGGT GTCGACCCGG CTGTCGAACG AGCCACCGGC CGGCTCGCAC GGGCCTGCGC GCTGCGCCAC ACCGCCCACC GCACGGAGAC ATGA
|
Protein sequence | MAVRSPAMYQ WAGAALLRAS TDPGGLDLPA DLDLFGADAA EEGSAWLSAM WRREEIRAAI AQASPALIQQ VDTVLTSSGH DVRVVRRTVL SVASYLLRWQ RRPTPFGLFA GVALARIDAG AKVRWGRDHR VEARVDAGWL GDVLARLQRC PTLRERLSLV ANGAGLVRGD RFGAPAPTPD GIADELAPIE VSVRHSRPVC AALEATRKPV TFSELRTLLM ERFPSAPAQR IDEMLTGLLD QGILLSNLSA PMTCLDALGH ACAQLEAVDA HSIPEVSDLV RSMFEIHKEV SATSQVLGSR SAVTEQMHAL SEAAEVPMIV DTILECDVHI PDQVAQEARN AVQVLYRLSP YPLGYPAWRD YHSRFRTRYG TGAFVPVMDL ISDSGLGVPA DYLGSARRRA ARQVSERDEK LLALIQRATL SGGGEIVLTD QMIEELAVSD PADVHLPARV EVAVEIRSMS VEALARGRFT VAVTGTPRPG SSMAGRYAHL LPADGRDLIA GTFAAAGTDA IPAQLSFAPR KRRNENVART QQLLTHVIPV AEYRDGDERL IPLTDLAVSV DDRRFYLAQI STGRYVEPRV AHALEAGVHT PPLARFLAEI TTARAAVYKA FHFGAAAQLP YLPRVRYRRT VLSPARWLLA AGELPGRGAS TAEWDAALED WCSRWWVPGH VAMVEHDRRQ PVDLGHPLHR LLLRTRLERA DRLELRETST LEDVAWLGRA HEVLIPMVLD PQPATDPGPG ISTRRVVAVD AGHLPGESTV VSAHLYGHPA RVEELLTQHL PHMIDAFGVH RPRWWFRRNR EMRRPEIDQY LAVYLWLSEP SAYGPAAACL ARWADDLRRQ HLLAHVSLTT YDPQSGRYGH SPALDHVQDV FAADSACAIA QISASIRAGV HPQALAAASL VDLAVSYAGS PQDGLDWLIR ELRQEHGRLD PALRQQTLEL ADPHGSWTRL QSLPGGRDVL AAWGTRASAL AAYRDALADQ RDPMPVLRSL LHLHHNRAVG VDPAVERATG RLARACALRH TAHRTET
|
| |