Gene Franean1_1665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1665 
Symbol 
ID5670067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1990044 
End bp1992857 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content72% 
IMG OID641240583 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001506009 
Protein GI158313501 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.265588 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGAA CGCATCCGTT GGCCGTGCTG GCGGCGAATC TCGCGGAACC CGACCCGACA 
ATCGTGGGGC GGTTCTCTCG GATCGTCACA GCGCGTGCCA ACGACATTGC GGTACGCGAT
GAGAAAGCTG CACTGAGCTA TGCCGAGCTG GACTCGCGTT CATCCGCGCT GGCTCGTTCG
CTTGTCGTCG AGGGGGACGG CGGCAACATC GGGATCCTGC TGGGGCAGGG AGCGCCCGCG
ATCGTCGCGA TGCTGGGCGC GCTGAAGGCC GGGCGGCCGT TCGTGCCGCT CGACCCGATG
CTTCCCGCGG CGCGGCTGGG GCAGATCCTG CGGCTGGCGG GGGTCGCCAC CTGCGTGACC
GACAGCGCGC ACACCGAGCT GCTCGCCGCC GCCCGGCTGG AAGCGGCCGA CACCGGTCCC
GCGCCTGGTC CCGAGCCCGG AACGGTGCCG GGCTCGGAGC ACACCCTGAT CATTGACGAC
GGGCCGCCGG CCGGTACCGC CGAGATCGAC GACGACCTGC TGCCGGGCCG TCGGGCGCTG
CCCACCGACC CAGCCTTCCT GGTGTTCAGC TCCGGCTCCA CCGGCGTCCC GAAGGGCGTC
GTCTGGCGGA ACCGGACCGT GATGAAGGAC CTGGACGCGG GCATCGAGCG GGTCGGGATG
AACGCCGCCG ACCAGATCGC CCTCGTGCTG CCGACCGCGT TCGCCGCCGG CATCACCGTC
ATGTTCTGGG GGCTGCTCTG CGGCGCCACG CTGCACCCGT TCGACCCGCG GGCCCGCGGG
ATCGGCGCGA TGCCTGCCTG GCTCGTCGAC CGCGGCATCA CCACGCTGCA CCTGACCCCG
TCGCTGATGC GCGCGCTGGC CGGCGCCGCC GAGCCCGGGC TGATCCTGGG CGACCTGCGC
GCGGTCACCA GCTCCGGTGA GGCCGTGTAC GGCCGGGATG TCGCCGCTCT GCGCGAGCTG
CTGCCTACAA CGTGCACGTT CTACAACTGG TCCGGCTCGA CCGAGACCGC CTCGCTCGCC
TTCTTCCCGG TCGGATCAGG CGACGAGATT CCGGCCGGTC CACTGCCGGC GGGATGGGCC
GTCGACGGCA AGGACATCGA GATCGTCGAC GAGCACGGCA AACCCGTACC GGATGGTGCC
ACCGGCGAAA TATCAGTAAC GTCAAGGTAC CTTTCCGGCG GCTACTGGAA CGCCCCCGAG
ATGACGGCGC AGCGATTTCG ACCGGTTCGC CTGAGTGTCG ATTCTCCGGC GGACGGCCTG
CGGGATTCCG GTGCGCTGGT TTCCGGTGAC GAAGCCCTGG CGGGCCCGGC GAGCCCGGCG
GACTCCGCGG GCCCGGACCA CGACCTGCCC GTCTCCGGCG TCACCGTCAC CGGTGAGATC
GTCCTCGGTG ACCTCGGGTC CCACGAGGTC GTGTACCGCG GCGGGGATCT GGCCCGGCGA
CGCCCGGACG GGTGCATCGA ACTACTCGGC CGCGCCGACG CCGCGGTGAA GATCCGTGGT
TACCTGGTCG AGCCCGCCGA GATCGAGACC GTCCTGCTCA CCTCGCCGGA CGTCCTCGAG
GCCGTCGTCG TCGCCGAGCG CGTCGAGGGC GAGCCTCCGC GGCTGGTCGC CTACGTCGTG
TGCGCCACCG CGGTGAAGTC GGCGACCGTG GCGATCCGCG GCCTGCTGCG GGAGAAGCTG
CCCGCCTACA TGGTTCCCTC CTCGGTGATC CTGCTCAACG AGCTGCCCCG CAACGAGCGC
GGCAAGATCG ACCGGCCCGC GCTGCCGTCG CCGCCGCCCC GGCCTCCCGC CCGCCGTCCC
ACCGGGCGCA GCCACTGGGA GTTCGCGCTG TGCGACCTGT TCGCGCAGAT CCTGAAGGTC
GATGAGGTCG GCGTCGACGA CGACTTCCTC GAGCTCGGCG GGGACTCGCT GCTCGCCCAG
CAGCTCCTCA GCGAGATCGC CACCCGTCTC GGCGTGACCC TGCCGACCTC GGTGCTGGTG
GAGGCGCCGA CCGTCGCCGC CCTCGCGGCC CGGGTGGCAG GTTCCCAACG GGGCATCCCG
ACCCATCCGA CCTGCGTGCC GCTCAACGTC GACGGCTCGC GTCCGCCGCT GTTCTGCGTC
GCCGGCGCGG GCGGGCTCGC CATCAACTTC CTCGGGCTCT CCCGCCTGCT CGGCTCCGAC
CAGCGGGTCT ACGGCCTGCA GGCGCAGGGA ATGGAAAGCC GGGCGCTGCC CGACTGGAGC
GTCGAGCGCC ACGCCCGGCG CCACCTGGCC GTGCTCCGCG TGATCCAGCC GACCGGCCCT
TACTACCTCG CCGGCTTCTC GTTCGGCGGG CTGGTCGCCC TGGAGATGGC GCACATGCTC
GCCGCCGCCG GCGAGGAGGT GGGGATGCTG CTGCTGCTCG ACACCACGCT GCCGCGCTCG
GCTCAGTCCG CCGGCGGCGC GCGTCCCAGC GGCGACCCCA GTGGCGGGGG ACGCATCACC
CGGCTGCTGC CGGACCGGCT CCCGCTGCCG AACGCGTCCA AGCTACGCAA GGCGGCACGC
CTGCCGCTGA CCGGGCTCGT CCGCTACCCC GGGCTCGTCC AGTTCGACGT GTTCTTCGAC
CAGGCCCGGT TCATCACCCA GTCGTACCGG GTCCGGCCGT ACGCCGGGCG GACCGTCCTC
TACCTCGCCG AGGACAACCC CAAGTCCGGT CGGGACGAGT GGCCCACGCA CCTGACCGGT
GACTCCACCA TCGTCACCGT CCCCGGCGAG CACCACACGA TGCTCAACGA GCCGAACGTC
TCGGTGCTCG CCGCGGACAT GCGCGCCCGC CTCGGAGAGT CGATGCAGCT CTAG
 
Protein sequence
MVGTHPLAVL AANLAEPDPT IVGRFSRIVT ARANDIAVRD EKAALSYAEL DSRSSALARS 
LVVEGDGGNI GILLGQGAPA IVAMLGALKA GRPFVPLDPM LPAARLGQIL RLAGVATCVT
DSAHTELLAA ARLEAADTGP APGPEPGTVP GSEHTLIIDD GPPAGTAEID DDLLPGRRAL
PTDPAFLVFS SGSTGVPKGV VWRNRTVMKD LDAGIERVGM NAADQIALVL PTAFAAGITV
MFWGLLCGAT LHPFDPRARG IGAMPAWLVD RGITTLHLTP SLMRALAGAA EPGLILGDLR
AVTSSGEAVY GRDVAALREL LPTTCTFYNW SGSTETASLA FFPVGSGDEI PAGPLPAGWA
VDGKDIEIVD EHGKPVPDGA TGEISVTSRY LSGGYWNAPE MTAQRFRPVR LSVDSPADGL
RDSGALVSGD EALAGPASPA DSAGPDHDLP VSGVTVTGEI VLGDLGSHEV VYRGGDLARR
RPDGCIELLG RADAAVKIRG YLVEPAEIET VLLTSPDVLE AVVVAERVEG EPPRLVAYVV
CATAVKSATV AIRGLLREKL PAYMVPSSVI LLNELPRNER GKIDRPALPS PPPRPPARRP
TGRSHWEFAL CDLFAQILKV DEVGVDDDFL ELGGDSLLAQ QLLSEIATRL GVTLPTSVLV
EAPTVAALAA RVAGSQRGIP THPTCVPLNV DGSRPPLFCV AGAGGLAINF LGLSRLLGSD
QRVYGLQAQG MESRALPDWS VERHARRHLA VLRVIQPTGP YYLAGFSFGG LVALEMAHML
AAAGEEVGML LLLDTTLPRS AQSAGGARPS GDPSGGGRIT RLLPDRLPLP NASKLRKAAR
LPLTGLVRYP GLVQFDVFFD QARFITQSYR VRPYAGRTVL YLAEDNPKSG RDEWPTHLTG
DSTIVTVPGE HHTMLNEPNV SVLAADMRAR LGESMQL