Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3057 |
Symbol | |
ID | 5671436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3595869 |
End bp | 3597494 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641241955 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001507375 |
Protein GI | 158314867 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCC AACCGCTGAC CCTTGACGGC GTCGTACCCT ATCCGCCAAA GTCCGTGGCC CAGTACCTCA CAAAGGGATA CTGGCACGAT CAGACGCTTC CGGAGTTGCT TTTTGCCGCG GCGGAACGCT ATCCAGACAA GCTCGCGGTC ATCGACCGCG ACACCCGGCT GAGCTATAGG CAACTCACGG ATGAAGTCCT GCGTCTCGCG GCCGGTCTGC AGGATCTGGG GCTGGGCCGT GGCGACCGAG TGGTCGTGCA TCTGCCCAAT ACCTACGAGT ACATCGCGTT CGTCTTCGCC CTGTGGGAGC TCGGCGTGAT TCCCGTGGTC GCGCCCATCG CGCACCGTCG CGCGGAGATC GAACACTTCA TCGAGATTGC CGAGGCGCGT ACCTACATCA CCGTCGCATC GGACAGCGGC ACCGATCTCG CTGCGTTGGC TGCCGACCTG AAGGGTCGTT GGGCCCATCT GGAGCACACA GTGATCCTCG ATCGCGGCGG CGGCGGCGCC GAGTACGACG CGCTGCTTTC GAAAGGATCG CTCGAGCACG TGCGGCGATG CAGCCCGCAG GACGTCGCGC TGCTGCAGAT GTCGGGCGGC ACTACGGGCG TTCCGAAGCT GATGCCGCAT ACTCATCACA CGTACGGTTA TGCGTTGCGC AGAAGCGTCG GGGAACGGGG CATCACCGAG CGGACGGTGC ATCTGCTGGT CATGCCGATC TGTCACAGTA TGTCGACGCG TTCGCCCGGC TTCCTCGGCG CGTTCAGCGT GGGCGCAACG ATCGTGATCG CGCCGAACGG CAGCCCCGAT GCGGCGTTCC CGCTGATCGA GAAGCACCGC GCGAACCGCG TGACGTTGGT GCCCCCGATC CTGCTGGCCT GGCTGAACTC CTCGCTGCGC GACGCCTACG ATCTCTCCAG CATGCAGGTG ATCATGTGCG GGGGTGCGAA GCTGAGCGAG GAGGTGGCGC GGCGGGTCGA GCCGGAGCTC GGCATGGAGC TGTCTCAGAG CTTCGGGATG GGCGAAGGCC TGGTCGTCAG CAACCCTCCG GACGTCGATC GGGAGACCAG CGTGCGATAC CAGGGCCGGC CGGCCTCGGA GGCCGACGAG ATCCGGGTCA TAGACGACGA GGGAAACGAC GTGCCGCCCG GGGCGCCCGG TCACCTGCTG ACGCGGGGGC CGTCGGTAAT CCGCGGCTAC TATCGCAACC CCGAGCAGAA TGCCCTGGCG TTCACCAGCG ATGGCTTCTA CCGCACCGGT GACATAGTCG AGCGGGACGA GCGTGGGTTC ATGAGGGTCG TGGGCCGGTC GAAGGACCAG ATCAACCGTG GCGGCGAGAA GATCGCGCCG GAGGAGCTGG AGAACGCGTT CCTCGCCCAC GGCGGCGTGC ACAACGCCTC CGTGATCGGC ATCGATGACG AGGTGCTCGG CGAGCGCATC AAGGCGTACC TGATCCCTCG GTCGCCGGAG GACGTGGCCG ATCTCACCTT GAGCAAGCTG CGTCGGTTCC TGCAGGAGTG GGGCCTCGCG ACATTCAAGC TTCCCGATTT CGTGGAGGTG GTCGACAAGT TCCCGTACAC CGCGGTCGGC AAGGTCAGCA AGCGACTGCA GCGCGAGCAG AACTAG
|
Protein sequence | MSIQPLTLDG VVPYPPKSVA QYLTKGYWHD QTLPELLFAA AERYPDKLAV IDRDTRLSYR QLTDEVLRLA AGLQDLGLGR GDRVVVHLPN TYEYIAFVFA LWELGVIPVV APIAHRRAEI EHFIEIAEAR TYITVASDSG TDLAALAADL KGRWAHLEHT VILDRGGGGA EYDALLSKGS LEHVRRCSPQ DVALLQMSGG TTGVPKLMPH THHTYGYALR RSVGERGITE RTVHLLVMPI CHSMSTRSPG FLGAFSVGAT IVIAPNGSPD AAFPLIEKHR ANRVTLVPPI LLAWLNSSLR DAYDLSSMQV IMCGGAKLSE EVARRVEPEL GMELSQSFGM GEGLVVSNPP DVDRETSVRY QGRPASEADE IRVIDDEGND VPPGAPGHLL TRGPSVIRGY YRNPEQNALA FTSDGFYRTG DIVERDERGF MRVVGRSKDQ INRGGEKIAP EELENAFLAH GGVHNASVIG IDDEVLGERI KAYLIPRSPE DVADLTLSKL RRFLQEWGLA TFKLPDFVEV VDKFPYTAVG KVSKRLQREQ N
|
| |