Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5497 |
Symbol | |
ID | 5673828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6653778 |
End bp | 6656711 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244352 |
Product | acyltransferase 3 |
Protein accession | YP_001509758 |
Protein GI | 158317250 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1835] Predicted acyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.114019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGCTCC CGGTACGCGA ACGGACACGG CACCGTCCGT GCCAGGCCCC GTCCGGCTCG ATCCGGCGGC CTGCCGCCAC ACCTGCGCAC CTACGACATC GGAAGGGGAC CGGGCTGAGC CGCCGAGACA CCGACACCTA CCCGCTGGCT CGCCCCCGCC CCGGTCAGCC GGAGGACGAC CACGCCGCCG AGCGCAACCC GCGGCGCCCC GTCCGGGACC GGCGCAGAGC CGCGGACCGA CGCGCGGACA CGGACAGTGG CACGGACCAC GACGAGGTCA CAGACCATTT GGACAGCCTG TATCTCGACC GGTCCCGGCT CGACGCCGCC TACCAGGACG ACGCCTACCC GGACCCCGAG GGCCGCCTGC GCCGGGCCGG CGCGTACAGC GGCGACGAGA CCGAGCACAT CGAGCGCCGG GCCGCCGCCT CACGCGACCC ACGTGAGGCG GCACGCGACG CCGACGATCA CACCCGCGTC ATCGCCCGCC CCCGCGTCCC AGCCCGTGCC GGGCGAGCCG CCCGCACGGG GCGCCCCGCG GGCGACCGGA GCCCGCGGGG CGGCCACGAG CTCGTCGGGA GCGGCCCGCC GCGCCCGCAC GGCGCCGAGC CGGCCGAGGC CCGCCGGTCC GGTTCCCGCC AGCCGGTGGC CGACCCGTCC AGGCGACGCG AGCCGGGCGG GCCGCGCACG CCGCGACAGC CGGGCACGTC ACGGCAGACA GGCACGTCAC GGCAGCCGGC CGGCGGGCGA GGCGCGGGCG AGGCGGACGA CCTCCTCGAA GCGGACGACC TCCTGGAGGC GGACAGGCCC GGTGCCCGCG CCCGCGGCGC CGCCGACCCG CATGCGACCG GCCCGCTCGA GACCGGGCAG CCGGACGACG GATCGCCGGA GACCGGGCCG CTCGAGACCG GACCACTCAC GGCACCGGTA CTGGCACCAT TGGGCCACTC TCCCGCGCTG GACGGCCTGC GCGCCCTCGC CGTCCTCGGG GTGATGGCCT ACCACGCGGG CCTGAGCTGG ATGCCGGGCG GACTGCTCGG CGTCGACGCC TTCTTCGTCC TCTCCGGATT CCTGATCACC GGCCTGCTGG TGGCCGAGTA CCGGACCACC CGGCGCATCG ACCTGAAGTC GTTCTGGATC CGCCGCACCC GACGTCTGAT GCCGGCACTG CTGACGATGA TGCTGGGCGT GGCGGCCTAC GCCCGGTTCG TCGCCCAGCC CAGCGAGGTC GAGACACTGC GCGTGGACGC GCTGTCGACC CTCGGGTACG TCGCCAACTG GCGGTTCGCG CTGTCGGACC AGAGCTATTT CGACCACTTC TCCGCCCCGT CCCCGCTGCT GCACACCTGG TCGCTGGCGG TGGAGGAACA GTTCTACGTC CTGTGGCCGC TCGCCGTCTA CCTGCTGATG TGGCACAGCG GGCGCACCAC GGCGCGCTGG CGCAACCAGC GCCACAAGGC GCAGACCCTG ATCCTGACCG TCGCCGTGCT GGGCGCGGAG GCGTCCGCGC TGCTCGGGCT GCTGCTGGTC GTGCTGGGCG CGGACGCCTC CCGCATCTAC TACGGCACGG ACACCCGCGC GCAGGCGCTG CTGGCCGGCG CCGCACTGGC TGTGTGGCGG ATCCAGCGGC GGACACCGCT CACCGAGCGG ACGAAGAAGG GCCTGTCCGT CGCCGGCTGC CTCGCCGGGG TGGCGATGCT CACGGTCTGG GCGACCGTCG ACGGCGAGAG CCGGGTGCTC TACTCCGGCG GCTTCCTGGG CGTGGCGGTG ATCGTGATGC TGCTGATCGC CTCGATCGTC GAGGTGCCCC GCGGGCCGGC CGCGCGGGTG CTGTCGCTCC CGCCACTACC CACCATCGGC CGGGTCTCCT ACGGCCTCTA CCTGTGGCAC TGGCCGGTCT TCCTGACCGT CACCGCGGCC CGCACGGGGC TGAGCGGCGC GGCGCTGCTG GGCGTGCGCG CGGCCGTCAC GGCGGCGATC ACCATCGCCT CGTTCCACCT GGTCGAAAAC CCGATCAGGC GACGGAAGAT CCGTTTCCCC ATGCCGCGGG TCACCGTCCC GGCCTCGATC TGCGCGGTCG TCGCCCTGGT GATGGTCGCC ACGATCGCGG ACCCGTCCGC CGGCCGCGGC GCGGCGGACC TGGAGCAGCT GGCCGCGCGG GCCGCCACCG CGGCACCGCC GACGGCCCGG GCCGAGCCGG TGACGGGCCG CCCGCTGCGG GCCGTCCTGG CCGGGGACTC GCTCGCGCTC ACCCTCGGCT TCAGCGAGTT CTCCACGGCC GCCGCCCGCG AGGGCCTGGA GATCCACGAC GCCTCCCAGC TCGGGTGCGG GGTGACCCGC TCGCCGCGCC GCCTCCTCAT GGGCGAGGTG AACCTGCCAC CTCCGGGGTG CGACCAGTGG CCCACCCGCC TCAGCCGGAA AGTCGACGAG CTCCGTCCCG ACCTGGCCAT GCTGCTGGTC GGGCGCTGGG AGGTAACCGA CCAGGACCTC GAGGGCCGAC GGACCCACAT CGGTGATCCG GCGTTCGACG CCTATCTCGG GCGCGAGCTC GACCTGGCGA TCACCACCCT CTCGGCCCGC GGAGCGAAGG TGGTGCTGCT GACCACCCCG ATGTTCGAGG AGGGTGAGGC GGCGGACGGC AGCATCTACC CGGAAACCCG GGCCGACCGG GTGATCCAGT TCAACAAGCT GCTTCGCCAG GCCGCCGCGC GGCACTCCGC CGTCACCACC GTCATCGATC TCGCCGCGGC ACTCAGCCCG GGCAACGAGT ACCGCGGTGA GATCGACGGC GTTCGCGTGC GCGACGACGA CGGTGTTCAC ATCAGCAACG GCGGTGGAGC GCGGGCCGGC CAGATGGTGC TACCCGAGCT TCTCAAGCTG GCCCGCCCGG CCACGGCGCG TGGTGCCGAC GGGCCGGCCC CGGCACCCGA GTGA
|
Protein sequence | MLLPVRERTR HRPCQAPSGS IRRPAATPAH LRHRKGTGLS RRDTDTYPLA RPRPGQPEDD HAAERNPRRP VRDRRRAADR RADTDSGTDH DEVTDHLDSL YLDRSRLDAA YQDDAYPDPE GRLRRAGAYS GDETEHIERR AAASRDPREA ARDADDHTRV IARPRVPARA GRAARTGRPA GDRSPRGGHE LVGSGPPRPH GAEPAEARRS GSRQPVADPS RRREPGGPRT PRQPGTSRQT GTSRQPAGGR GAGEADDLLE ADDLLEADRP GARARGAADP HATGPLETGQ PDDGSPETGP LETGPLTAPV LAPLGHSPAL DGLRALAVLG VMAYHAGLSW MPGGLLGVDA FFVLSGFLIT GLLVAEYRTT RRIDLKSFWI RRTRRLMPAL LTMMLGVAAY ARFVAQPSEV ETLRVDALST LGYVANWRFA LSDQSYFDHF SAPSPLLHTW SLAVEEQFYV LWPLAVYLLM WHSGRTTARW RNQRHKAQTL ILTVAVLGAE ASALLGLLLV VLGADASRIY YGTDTRAQAL LAGAALAVWR IQRRTPLTER TKKGLSVAGC LAGVAMLTVW ATVDGESRVL YSGGFLGVAV IVMLLIASIV EVPRGPAARV LSLPPLPTIG RVSYGLYLWH WPVFLTVTAA RTGLSGAALL GVRAAVTAAI TIASFHLVEN PIRRRKIRFP MPRVTVPASI CAVVALVMVA TIADPSAGRG AADLEQLAAR AATAAPPTAR AEPVTGRPLR AVLAGDSLAL TLGFSEFSTA AAREGLEIHD ASQLGCGVTR SPRRLLMGEV NLPPPGCDQW PTRLSRKVDE LRPDLAMLLV GRWEVTDQDL EGRRTHIGDP AFDAYLGREL DLAITTLSAR GAKVVLLTTP MFEEGEAADG SIYPETRADR VIQFNKLLRQ AAARHSAVTT VIDLAAALSP GNEYRGEIDG VRVRDDDGVH ISNGGGARAG QMVLPELLKL ARPATARGAD GPAPAPE
|
| |