Gene Franean1_5942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5942 
Symbol 
ID5674263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7240173 
End bp7245209 
Gene Length5037 bp 
Protein Length1678 aa 
Translation table11 
GC content76% 
IMG OID641244790 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001510192 
Protein GI158317684 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I
[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3433] Aryl carrier domain 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR03494] salicylate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCGC AAGGGAGTTA CCGCAGCAGG TCGGTCCCGC TCGACCAGGC GCCGCAGGCC 
GCCGCCGCGG GTCTCGCGCG CGGGCTGGCC GCGCCGTTCG TGCTCTACGA GCGCGACGGA
GAGTGGTCCT GCGGCTCGGG GGTGCTCGCC GAGGTCGTGC TGAGCGCCAC CGAGGTGAGG
CACCGGACGG GCAGCGGCGA CTGGCGGTCC GAGCCGACCG GAGCCACCCC GCTGCGCCAG
GTCGCGAATG TCCTCGGCGG CCTCGAGGCG ACCGGGTGGC GGGCCTACGG CTGGGCGGCG
TTCGAGCTCA GCCTGCTCCT GCAGGGCCAG CCGGTGCCGG CCGGCGACGA TCCGCTGCTG
CACGTGATCG TGCCACGGCG GGAGGTGCGG CTGATCGACG GGGTCGCCGT CCTGCGCGCC
GTCGACACCG CCGATCTGGC GGAGCTGGCC AAGCTGCTCG CCGCCGTGCC GAGCGGCCCG
CCGGCGGCCA GCGCCCCGCG CGCCGTCGTC GACCTCGAGC ACGGGGTCGA CGAGTACCGC
CGGATCGTCG CGGCGGCCGT CGCCGACATC CGCGCCGAAC GGCTGCGCAA GGTGATCCTC
TCCCGCGTCG TGCCCGTGGC GGGCGACGTC GACCTGGTGG CCACCTACGA GGTCGGCCGG
GCCGGGAACA ACCCGGCGCG GTCGTTCATC GTGGACACGG GCCACCTGCG GGCCACCGGG
TTCAGCCCGG AGACCGTCGT CGAGGTGTCC GCGGACGGCC TGGTGTCGAC CCAGCCGCTG
GCCGGCACCC GGGCGCTGAC CGGCGATCCC GCCGTCGACC GCGATCTGCG CGAGGTGCTG
CTCTCCGATC CCAAGGAGAT CTACGAGCAC GCTGTCTCCG TACAGGCCTG CCAGGACGAG
CTGCTCCAGG TCTGCCGGCC CGGGTCGGTG GTCGTGGACC AGTTCATGAA CGTGCTGCCC
CGCGGCAGCG TCCAGCACCT TGCCTCGCGG GTGGCCGGCC GGCTCGCCGC CGGCCGCGAC
GCCTGGGACG CACTGGGCGC GGTCTTCCCC TCCATCACCG CCTCCGGCGT GCCCAAGCCG
GTCGCGTGCG CGGTGATCGG CGACTACGAG GGGGAGCTCC GCGGCCTGTA CAGCGGCGCG
GTGCTGATCG CTGACTCCGA CGGCGGGCTC GACGCGGCGC TGGTGCTGCG CACGGTGTTC
CAGCAGGACG GCCGGACCTG GCTGCGCGCC GGGGCCGGCA TCGTCGGCCA CTCCGACCCC
GACCGCGAGG CGGAGGAGAC CCGGGAGAAG CTGCGCAGCG TCAGCCGTTT CCTGGTCGCG
GCGCCCGCCG CGCCGGCCGC CCCGGCTGTT CCCGCCGCGC GGGCGGGCAC CCCCGGGTAC
CAGCTCGAGG ATGTCCGCCG CATCGTGGCC GAGCTGATCG ACGAGGATCC GTCGGCGATC
GGCAACGAGG CGAACCTGTT CGAGCTGGGC CTGGAGTCCA TCGCCCTGAT GAAGGCCGTC
GGCCGCTGGC GCCGGGCCGG GATGGCGGTC TCGTTCGCCG AGCTCGCGGA GAACCCGACC
GTCGACGGCT GGTGCAAGCT CCTGTCGATG CGGGCGCCCA CCGAGCCCGC CGGCGCGGAC
GACGGCGCGC AGGCCCGCGC GGAGGCCGGT GAGTTCCCGC TCGCGCTCAT GCAGCATGCC
TACTGGGTCG GCCGGGACGG CGGGCAGCCG CTCGGCGAGG TCGCGGCGCA CCTCTACACC
GAGTTCGACG GCGCGGGAGT CGACGTCGAC CGGCTGCGCA CGGCTATCGA GGCGCTCGTC
GCGCGGCACG ACATGCTGCG CGTGCGGATC ACGGACAACG GGAGCCAGGT CGTCGAGGCG
ACCTCGGGCT GGCGCGGTCT CACCGTGCAC GACCTGCGCG AGCTCGGCCA GGCCGAGACC
CAGGCCCGGC TCGCCGCGGT GCGGGACCGG ATGTCGCACC AGATGCTGGA CATCGAGCAC
GGCGAGGTGT TCGCCACCGC GCTGAGCCTG CTGCCGCAGG GCCGGACCAG GCTCCACCTG
GACGTCGACA TGGTCGCGGC CGACGCCGTC AGCTACCGGG TGCTGCTCAA CGACCTCGCG
CGGTTCTACG ACCGTCCCGG CGAGGAGCAC CCGCCGCTGG GCTACACCTA TCGCGAGTAC
CGCGCCGCGC GGGTCCCGGC CAGGCGCGCC GCCGCGCGCG CGGCCGCCGA GTGGTGGCAG
GGCCGGCTGC CGGGCCTGCC CGGCGCGCCC GGCCTGCCCC GCCTGGCCTC GCCCGACGAC
GCGGGCACCG CCTCGAGCGC CGGCGCCGCG AGCACGGCGG GCGCTACCGG CTCCACCGGC
GAGCCGCCGC GGGTGACGCG GCGTCACTTC GTCCTGGGGC CGTCGGCGCG CCAGGCGTTG
CAGCGCGCCG CCCACAGCCG GGGCGTCACC CCGGCGATGG CGGTGGCGAC CGCGTTCGCC
GAGGTGCTCG CCGGGTGGAG CACCGAGTCC CGGTTCGTCC TCAACGTGCC GATGTTCGAC
CGCGACCAGG TGCACGCGGA CGTCAACCAG GTGGTCGGGG ACTTCACGAG CTCGGTGCTG
CTCGAGGTCG ACCTGGCCGA GCCGCAGCCG TTCGCGACCC GGGTGCGCCA GGTCCAGGCC
CGGCTGCACG CCGACGCGGC GCACGCGGAC TACTCCGGTG TGGAGGTGCT GCGCGACCTG
ACCCGCCGCA CCGGCGAGCA GGTGCTGGCG CCCGTGGTGT TCACCAGCGC GCTCGGGCTC
GGGGAGCTGT TCGGCCCCGG GGTCCGCCAG CACTTCGGCG ACCCGGTGTG GATCATCTCG
CAGGGCCCCC AGGTGCTCCT GGACGCGCAG GTCACCGAGC TGGACGGCGG CCTGCTCGTC
AACTGGGACG TCCGCGACGG CGAGTTCGCC CCGGGCGTGG TCGACGCGAT GTTCGGCGCC
TTCGAACGGC TGGTGCGCGG CCTCGCCGAC ACCGCGGGCA CCTGGGACAC GGCGGTCGAC
GGCCTGGTGC CGGAACCGGC CCGTGCCATC CGCGCCGCGG CCAACGACAC CGCCGGGCCG
GTGCCGACCC GGCTGCTGCA CGAGGGGTTC TTCGAGAACG CCGTGCTGGC CCCCGACGCC
CCCGCGCTGC TGTGGGACAC CGCCGGCGGC CCGGGTTCGC TGGCCTACGG CGAGCTGCGG
CGCCGGGCGC TGGGGCTGGC CGGCGCGCTG GCCGGCCACG GCGTGCGCCG CGGCGACCTG
GTCGGAGTCA GCCTGCCCAA GGGGCCGTCC CAGGTGGTCG CCGTCCTCGG CGTCCTGGCG
GCGGGCGCGA CGTACGTGCC GGTCGGCATC GAGCAGCCCG CCGCCCGGGT GGAGCGGATC
GCGGCCGCCG CCGGGTTCGG GGTGCTCATC ACCGAGTCGC ACCGCGACGG CGTGCCCGCC
GGGGTCGTCC AGCTGGCGCC GGACCAGCCC GCCGAGCCGG CGCCGGTACC GGATCTCGCC
GCCCCCGGCG AGCTCGGGGG GCTGGACCGG CCGGCCTACG TGCTCTTCAC CTCCGGATCG
ACCGGTCAGC CCAAGGGCGT GGAGGTCGGG CACCGGGCGG CGATGAACAC GATCGCCGAC
CTGATCGACC GGCTGGGCCT GGGCACCGAC GACCGGACGC TCGCCGTGTC GGCGCTGGAC
TTCGACCTCT CGGTGTTCGA CATCTTCGCC CCGCTGTCGG CCGGCGGCGC CGTGGCGCTC
GTCGACGAGG ACTCCCGCCG GGAGGCCAGC CGGTGGGCGG AGCTGATCCG CGACCACCGG
GTGACCGTCC TCAACTGCGT CCCCACGGTG CTGGACCTCG TCCTCGCCGC CGGGGTGGCG
CTCGGGGACA GCCTGCGGGC GGTCCTGCTC GGCGGCGACA AGGTGGGCGT GGACCTGCCG
GGCCGGCTCG CCGCGGCGGT GCCGGGCTGC CGGTTCCTGG GCCTGGGCGG CACGACTGAG
ACGGCCATCC ACTCGACGAT CTGCGAGGTG GAGGGGGCGT CCCCGCTGCC GCCGCAGTGG
CGCTTGGTCC CCTACGGGAC GCCGCTGCGC AACGTCCGGC TGCGGGTGGT CGACCCGCTC
GGCCGAGACT GCCCCGACCA CGTGGCCGGG GAGCTCTGGA TCGGCGGTGA CGGCGTCGCC
CGCGGCTACC TGGGCGATCC CGAGCGCACC GCGGACCGCT TCGTCGAGCA CACCGGCATC
CGGTGGTACC GGACGGGTGA CATCGCCCGG TACCTGCCGG ACGGCACGGT GGACTTCCTC
GGCCGGCGCG ACGACCAGGT GAAGATCCGC GGCTTCCGGG TCGAGCTGGG CGAGGTGGAG
GCAGCCCTGA CCACGCTGCC GGAGGTCCGG GCGGGGGTGG CGGTGCTCGT GCGCGGCGCG
TCCGGGCGGT CCGCCGTGCT CGGCGGGGGC GTCGTGCTCG GCGCCGGCGT CGTGCCGGCC
ACCCCCGCCA CGGGCGAGGC CGACGACGGC GGGGGCGGCG GCGAGAGCTC CGGCGACGGC
GCGGGCATCG CCGGCGCCGT GCGCGAGGGG CTGCGCCGCG CGCTGCCGCC GCACATGGTC
CCGGACCTGG TGGTCGCGCT CGACAGCCTG CCGCTCACCG CCAACGGGAA GATCGACCGC
CGGGCGGTGA CGGCCGCCGT CGAGCGGGCG GTGGCCGGCC GGGCGGCCGA CCACGCACCG
CCGCGCACCG ACCTGGAGCG GGTCGTGCAC AACGTCTGGC GCGAGGTGCT CGGGGTGGCC
GAGTTCGGGA TCACCGACGA GTTCTTCGCG CTGGGTGGTG ACTCCGTCCT CGCCACCGCG
CTCGTCACGC GGCTGCGCGA CGAGCTGGAC ACCGCCGCCG TCACCGTGCG CTCGGTGTTC
GGGGCGCCGA CCGTCGCCGC GCTCGCCGAG CGGATCCGCG CCGCCGACAC CGTCCCCGGC
CGGGCGGAAC GGGTCGCCGC GATCGCACTG GAGATCGCGG CGATGTCGGA CGACGAGGTC
GCGGCCGAGC TGGTCGACCC GGACACGCTC CCCGCCGACT CCGGCGGTGC GTCGTGA
 
Protein sequence
MRAQGSYRSR SVPLDQAPQA AAAGLARGLA APFVLYERDG EWSCGSGVLA EVVLSATEVR 
HRTGSGDWRS EPTGATPLRQ VANVLGGLEA TGWRAYGWAA FELSLLLQGQ PVPAGDDPLL
HVIVPRREVR LIDGVAVLRA VDTADLAELA KLLAAVPSGP PAASAPRAVV DLEHGVDEYR
RIVAAAVADI RAERLRKVIL SRVVPVAGDV DLVATYEVGR AGNNPARSFI VDTGHLRATG
FSPETVVEVS ADGLVSTQPL AGTRALTGDP AVDRDLREVL LSDPKEIYEH AVSVQACQDE
LLQVCRPGSV VVDQFMNVLP RGSVQHLASR VAGRLAAGRD AWDALGAVFP SITASGVPKP
VACAVIGDYE GELRGLYSGA VLIADSDGGL DAALVLRTVF QQDGRTWLRA GAGIVGHSDP
DREAEETREK LRSVSRFLVA APAAPAAPAV PAARAGTPGY QLEDVRRIVA ELIDEDPSAI
GNEANLFELG LESIALMKAV GRWRRAGMAV SFAELAENPT VDGWCKLLSM RAPTEPAGAD
DGAQARAEAG EFPLALMQHA YWVGRDGGQP LGEVAAHLYT EFDGAGVDVD RLRTAIEALV
ARHDMLRVRI TDNGSQVVEA TSGWRGLTVH DLRELGQAET QARLAAVRDR MSHQMLDIEH
GEVFATALSL LPQGRTRLHL DVDMVAADAV SYRVLLNDLA RFYDRPGEEH PPLGYTYREY
RAARVPARRA AARAAAEWWQ GRLPGLPGAP GLPRLASPDD AGTASSAGAA STAGATGSTG
EPPRVTRRHF VLGPSARQAL QRAAHSRGVT PAMAVATAFA EVLAGWSTES RFVLNVPMFD
RDQVHADVNQ VVGDFTSSVL LEVDLAEPQP FATRVRQVQA RLHADAAHAD YSGVEVLRDL
TRRTGEQVLA PVVFTSALGL GELFGPGVRQ HFGDPVWIIS QGPQVLLDAQ VTELDGGLLV
NWDVRDGEFA PGVVDAMFGA FERLVRGLAD TAGTWDTAVD GLVPEPARAI RAAANDTAGP
VPTRLLHEGF FENAVLAPDA PALLWDTAGG PGSLAYGELR RRALGLAGAL AGHGVRRGDL
VGVSLPKGPS QVVAVLGVLA AGATYVPVGI EQPAARVERI AAAAGFGVLI TESHRDGVPA
GVVQLAPDQP AEPAPVPDLA APGELGGLDR PAYVLFTSGS TGQPKGVEVG HRAAMNTIAD
LIDRLGLGTD DRTLAVSALD FDLSVFDIFA PLSAGGAVAL VDEDSRREAS RWAELIRDHR
VTVLNCVPTV LDLVLAAGVA LGDSLRAVLL GGDKVGVDLP GRLAAAVPGC RFLGLGGTTE
TAIHSTICEV EGASPLPPQW RLVPYGTPLR NVRLRVVDPL GRDCPDHVAG ELWIGGDGVA
RGYLGDPERT ADRFVEHTGI RWYRTGDIAR YLPDGTVDFL GRRDDQVKIR GFRVELGEVE
AALTTLPEVR AGVAVLVRGA SGRSAVLGGG VVLGAGVVPA TPATGEADDG GGGGESSGDG
AGIAGAVREG LRRALPPHMV PDLVVALDSL PLTANGKIDR RAVTAAVERA VAGRAADHAP
PRTDLERVVH NVWREVLGVA EFGITDEFFA LGGDSVLATA LVTRLRDELD TAAVTVRSVF
GAPTVAALAE RIRAADTVPG RAERVAAIAL EIAAMSDDEV AAELVDPDTL PADSGGAS