Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2406 |
Symbol | |
ID | 3906389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2787547 |
End bp | 2790240 |
Gene Length | 2694 bp |
Protein Length | 897 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637879736 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_481502 |
Protein GI | 86741102 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0350264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAAG GCCAGGTCAC CCAGGCGGTG CCGGTACCGC CGCCGCTGCG TCCCGACGAC CGGTTCGGAA ACGTCGTCGA CCGGATCCGC ACCGTCGCCA TGGTCTGCCA CGACGTCGTG GCGGTCCGCG ACGAACACCG TGCCGTTACC TTCGCCCAGC TCGTCGCCTG GGTGGACGTC GTCGCCGACC GCATCCTTCG GCAGCCCGCC GCGGCTGACC CGGACACGCC CGTCGCGGTG CTGTTGCCCC ACGGCGCGAG CGGGATCGCC GCCGTCCTGG GCGTCATCGC GAGCGGCCGG CCCTGCGTGC CCCTCGACAG GATGCATCCC ACCGACCGTC TCGCTCAGGT CGTGGGACTG GCGGGGGCCT CGGTGTGCGT CACCGGACCG ACTGGCTCCG CCGACCAGCG GACTGCCGCC GCGCTGCCGG GGATCGTGGA GACTATCGAC GTCGGCGACG AGCGCGTCGC CGACTGGTCC CCGGCGCTGG CTGACCGGGT GGCGTCGACC GCGCCGCGCC GGGCCGATAC CGATCCAGCC GTGCTCATCT TCACCTCCGG GTCGACCGGA GTACCGAAGG GTGTCGTGTG GCACCACCGC GCCCTGCTGG GCATCCACTA CGCCATCCAG GTCCAGGACG TGATGCGGCT GGTTCCGGGC GACCGCCTGC CGCTCTTCCT GCCCTACTCC TTCATCTCGG GGATGAACCG GACGGTCGGT GGCCTGGTCT TCGGCACCAC CCTCGAGATG TACGACCCTC GGGTCCGCGG CGTCCGAGAC CTGGCCGACT GGCTGCGCGC CACCCGGCCG GCCGGCATCG TGGCCACGCC GGCGCTGATC CGGATCGTGT TCGGCTGCCT TGAGCCGGAC GAGGTGCTCG ACGACCTCCG ATTCGTCATG TCGGTCGGCG AGGCGATCTA CGCCCGCGAC GTCGAGTTGG CCCGCCACCA CCTACCGCCG GCCGCGGCGT TCCTGGTCTC CTACGGAGCC TCGGAGCTCG GCACGGCGAC CTGTGCCCCG ATCTGGTCGG ACGACGACCT GCCCGACGGC GTGATGCCGG CCGGGCGGCC GGTGGTTGAC GTCGCGGTGA GGGTGGTCTC GCCGGATGGC ATCGAGATGC CGCCCGGTGA GACGGGGGAG ATCGTCGTGA TGGGCCACTT CATCACGGGG GGCTACTGGC GGGCGCCGGC GGCGAGCGCA TCGCGGTTCG GCATCGGGCC GGACGGCACA CCCACCTACC GCACCGGAGA CCTCGGGCGC ATCGACGCCA GCGGCCAGCT CAGGGTTGTC GGCCGCAACG ACGCGGCCGT GAAGATCCGT GGGTACCTCG TCGAGCCGAT CGAAATCGAG TCCGCCCTGC TGGCGAGCTC GGATGTGCTA GAAGCCGTCG TCGTGGCCGA CAGGACCACG CAGCGGGCCC GCCTGGTCGC CTACGTCGTC CCCGTGACCG GCGCCCGGGT CTCACCGGCC TCGATCAGGC GCCTGCTACG GGCCAAGCTG CCGTCCTACA TGGTGCCGGC CACCGTCATG CTGGTCACCG CGCTACCGAG GACCGACCGC AGCAAGGTCG ACCGACTGAA CCTGCCCCCC GCCGGTCCGA AACCCGGCCA GGACCCGCCC CGCGACCAGT GGGAAGAGGC CGTCGCCGGT GTGTGGGCCG CCGCCCTGCA CCTCGACGAC GTCGGTATCC ACGACGACTT CGTCGAACTC GGCGGCGATT CCCTCATCGC CGAAGAGCTC CTCACCAGAG TCGCCGACGA ACTCGGTGTC AAACTCCCCA CCTCCACCGT CGCCGACGCC CCCACCGTCG CCGAGTTCAC CGCCCGCCTG CGCAATGCCG GCACCGACGT CCTACGCCAC CCCACCGTCG TCCCGCTGCG CACCACCGGC AGCGGCGGGC CCCTCTTCTG TTTCTGCGGC GCCGGCGGCC TCGCTGTCGG CATGCTTGGC TTTGCCCGCC ACTTCGACGG CGAACGCCCC GTCTACGGCG TCCAAGCCCA CGGCCTCGAA TACCGCGGTC TCCCCGACTG GTCCATCTAC GCCGCCGCCC GCCGTCACGC CCGCACCCTG CGCCTACTCC AACCCGCCGG CCCCTACTAC CTCGCCGGCC ACTCCTTCGG CGGCCTCGTC GCCCTCGAAA CCGCGAGACT CCTCACCGAG GCCGGCGAGC ACGTCGAACT GCTCGTCCTC ATCGACAGCT TCCTGCCCGA CACGTCCTCC GGCGTCTTCG CGGGAGGAGT GCAGCCCTCT CAGCTTCCAG TTCCGGGGGG CCCGACCAGG AGCTCGAATC CGGCCCCAGC CTTGGACCGT GCGAGGGGAT CGGCTATGGG CCCGGCGCGC GCGGGGGCTG CCGCACGGGT CCTGCTCGGC TGGGCCCGGC AGGCGGCCCA ACTCCCGTTG GCTGGCGTCG TTCAGTTCAA GGGCATGAAC CAGTACGACG TCTTCTACAA CCAGTCGCGG GTGCTCACGA GGTTCTACCG GCCCAAGCCC TGGAACGGGC GGGCGCTGGT CTACCTGGCG GCCGCCTCGC CGCCGCACCG TACGGAGGCC TGGCGCCCGC TGCTGACCGG CGAGACGACC TACCGCACCG TGGGCGGCGA CCACGACACC GTGCTGCGCG AACCCGTCGT AAGCGAGATC GCGGCCGACA TCCGGGCGGT CCTCGCTGGC TGTACCAGCT CCCGCAAGGC GTGA
|
Protein sequence | MRKGQVTQAV PVPPPLRPDD RFGNVVDRIR TVAMVCHDVV AVRDEHRAVT FAQLVAWVDV VADRILRQPA AADPDTPVAV LLPHGASGIA AVLGVIASGR PCVPLDRMHP TDRLAQVVGL AGASVCVTGP TGSADQRTAA ALPGIVETID VGDERVADWS PALADRVAST APRRADTDPA VLIFTSGSTG VPKGVVWHHR ALLGIHYAIQ VQDVMRLVPG DRLPLFLPYS FISGMNRTVG GLVFGTTLEM YDPRVRGVRD LADWLRATRP AGIVATPALI RIVFGCLEPD EVLDDLRFVM SVGEAIYARD VELARHHLPP AAAFLVSYGA SELGTATCAP IWSDDDLPDG VMPAGRPVVD VAVRVVSPDG IEMPPGETGE IVVMGHFITG GYWRAPAASA SRFGIGPDGT PTYRTGDLGR IDASGQLRVV GRNDAAVKIR GYLVEPIEIE SALLASSDVL EAVVVADRTT QRARLVAYVV PVTGARVSPA SIRRLLRAKL PSYMVPATVM LVTALPRTDR SKVDRLNLPP AGPKPGQDPP RDQWEEAVAG VWAAALHLDD VGIHDDFVEL GGDSLIAEEL LTRVADELGV KLPTSTVADA PTVAEFTARL RNAGTDVLRH PTVVPLRTTG SGGPLFCFCG AGGLAVGMLG FARHFDGERP VYGVQAHGLE YRGLPDWSIY AAARRHARTL RLLQPAGPYY LAGHSFGGLV ALETARLLTE AGEHVELLVL IDSFLPDTSS GVFAGGVQPS QLPVPGGPTR SSNPAPALDR ARGSAMGPAR AGAAARVLLG WARQAAQLPL AGVVQFKGMN QYDVFYNQSR VLTRFYRPKP WNGRALVYLA AASPPHRTEA WRPLLTGETT YRTVGGDHDT VLREPVVSEI AADIRAVLAG CTSSRKA
|
| |