Gene Franean1_2984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2984 
Symbol 
ID5671368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3509965 
End bp3511854 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content67% 
IMG OID641241888 
ProductAMP-binding domain protein 
Protein accessionYP_001507308 
Protein GI158314800 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATG ACCTGCTCTG GCCAGCCTAT AACGGGCCAG CAGACTTGGC CGCGGTCGAG 
GCTGTGCCAT TTGAGGCACG CGGCCTTCCT GACTCGACCT ACACACTGCT CAAATATGCG
GCGGCGCAGT GGCCGGATCA CACGGCGCTC ATTGTCCTGC CCGAGGCCGC CCGCTGGCGG
GAGCCACTGC ACCGCAGCTT CATCGAACTC CTGGCCGATG TCCACCGTTA CGCGAACCTG
CTGTACAGTC TCGGCGTACG GCGCGGTGAC GCCGTCGCCC TGATGTCAGC CAACTGCGCC
GAACTGGTCG GTGCCACCCT CGCCGCGCAG CTCGCCGGGA TCGCGGCGCC GCTCAATGGC
AACTTGTCCT CGCCGCATCT CACCGAACTT CTCCGGCGCT CGGGTGCCCG AGTGCTGATC
ACCGCCGGGC CCGACCTCGC CCCGACCACC TGGATTACCG CACAGGCCCT TGCCGCGGAC
GGGATGCTCG ACGCCGTCCT TGCGCTCCGG CCTACAGCGG CGGTGGGCGC ACTCAAAGCC
CTGCCCGCCA TCGAGGGTGT GCGCATCGGC TACCTCAGCG AACTTGCTAC CGGCATGGAA
CCGTCCGCTT TCGACGGTGA GCCGCCCGGC TCGACGGACT TGGCCGCGCT GTTCCACACT
GGCGGCACCA CCGGCGCGCC GAAACTGGCC GCTCAGACGC ATGCCAATGA GATTGCCAAC
GCGTGGATGC TTGCCGCCGA CTCCCAGTTA GACCAGGACT GCGTGGTCTT CGCTGGGCTT
CCACTGTTCC ACGTCAACGC GCTCGTGGTC ACGGTACTTG CTCCCCTGTT CAAGGGGCAG
ACCGTGATGT GGGCTGGCCC GCTCGGCTAT CGCGACATCG AGCTGTACGG CGAGTTCTGG
AAGATTGTCG AGCACTACCG GATCGGGTGC ATGAGCGCGG TGCCCACGGT GTACGCCGTG
CTGGCGCAGT GCCCGGTCGA CGCCGACATC AGCAGCCTGC GGTTCGTGGC AACGGGCGCC
TCGCCGCTTC CCTCCGCGGT TCGAGACGAC TTCCAAGCAC ACACCGGGAT AGCCCTGGTT
GAGGGATACG GTTTGACCGA AGCGACTTGC GCGAGCGCAC GCAGCTTCGC GGATGGGCCG
GGGCCGGGCT CAGTAGGGCA GCGCCTGCCC TACCAGCGGG TGAAGGTGGT GAAGGTCGGC
CTGGACGGAG CGTGGGAAGA CCTACCCAAG GGCGAGATCG GCGTTCTGGC TATCAGCGGT
CCGACCGTCT TCCCCGGCTA CGTCACCGGC CACAACGAAC ACGGCCACCT CTTGGACGGG
CTCGGTAGGC TCAGCGACGG ATGGCTGGAC ACCGGAGACC TCGCGCGAGT TGACGAGGAC
GGCCTTATCT ACCTTGCGGG AAGAGCCAAG GACCTCATCA TCCGCGGCGG CCACAACATC
GACCCGACGA TCATCGAAGA TGCCCTGCTG GCTCACCCGC ACGTCACCGC TGCCGGAGCG
GTCGGGCGCC CCGACGTCCA TTCCGGCGAG GTTCCTGTCG CCTACGTGAC ACTGGTGCCC
GGTGCCGGGG TGACCGAGCA CGAGCTGCGG GATTGGGCAT GTAGGCAAGT ACTCGAGCGC
GCCGCCCAGC CGAAGGCCGT GATCATCCTC GAGGCGCTTC CCATAACCGA TGTCGGCAAA
CCGTACAAGC TCCCGCTGCG GGCAGATGCC ACCCGCAGAG AACTCCTGGC CGCGCTCAAC
GAGGTCGCCG GCGTTCGCGA CGTCGAGGCC ACCATCGAAG ACGGATCGAT CGTGGCGACC
GTCAAGGTCT CCCCATCTGC CGGCGAGGCA GCCGTCAAGG CGATCCTTGG CCGCTACGCG
ATCCGATGGC ACATGGTCAC GACATCATGA
 
Protein sequence
MSDDLLWPAY NGPADLAAVE AVPFEARGLP DSTYTLLKYA AAQWPDHTAL IVLPEAARWR 
EPLHRSFIEL LADVHRYANL LYSLGVRRGD AVALMSANCA ELVGATLAAQ LAGIAAPLNG
NLSSPHLTEL LRRSGARVLI TAGPDLAPTT WITAQALAAD GMLDAVLALR PTAAVGALKA
LPAIEGVRIG YLSELATGME PSAFDGEPPG STDLAALFHT GGTTGAPKLA AQTHANEIAN
AWMLAADSQL DQDCVVFAGL PLFHVNALVV TVLAPLFKGQ TVMWAGPLGY RDIELYGEFW
KIVEHYRIGC MSAVPTVYAV LAQCPVDADI SSLRFVATGA SPLPSAVRDD FQAHTGIALV
EGYGLTEATC ASARSFADGP GPGSVGQRLP YQRVKVVKVG LDGAWEDLPK GEIGVLAISG
PTVFPGYVTG HNEHGHLLDG LGRLSDGWLD TGDLARVDED GLIYLAGRAK DLIIRGGHNI
DPTIIEDALL AHPHVTAAGA VGRPDVHSGE VPVAYVTLVP GAGVTEHELR DWACRQVLER
AAQPKAVIIL EALPITDVGK PYKLPLRADA TRRELLAALN EVAGVRDVEA TIEDGSIVAT
VKVSPSAGEA AVKAILGRYA IRWHMVTTS