Gene Franean1_4529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4529 
Symbol 
ID5672878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5402982 
End bp5404571 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID641243394 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001508810 
Protein GI158316302 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCC CGATCCCATC GTGGGGACGC GATGTCACTG TCGAACAGGT AGGCGGCGTT 
CCGTTCCGAA TGTACGAGCC GCGGGCGCGG AGGCTGGAGT CCCTGCTGGA CCATGCCGAC
CGCTGGGCGG GCCGGGCCCA CGCCGTGCAG GGCGACGTGC GGCTCGACTT CGGCGACCTC
GTGCCCGCGG TCGGTCACAA GGCGGCCCAG CTGAGGCAGC ACGGGGTGGG CCGTGGGGAC
CGGGTCGCGC TGCTGGGCTG GAACAGCCCG GACTGGGTGG TGAACCTCTG GGCCACCTGG
TGGCTCGGGG CCGTCCCGGT CCTGGTCAAC GCCTGGTGGA GCACCCGAGA GATGGAGCAC
GCCTTCACGG CGCTCACACC GGTCGCCGTG CTCGCCGACC GCCGGCTGGA GCACAAGGTG
CCAGCCGGGA GTCCCGTCGC GCCGTGGCTG ATGGCCGACG CGGTCGACGG TGCGCCGCCT
GAGCGTCCCG ATAGCGACGA CGAGAACGAG CCCGCCCTGA TTCTGTTCAC CTCCGGCAGC
ACCGGATTCC CCAAGGCGGT CGTGCTCTCG CACCGCTCGA TCATCTCCGG CCTGCACTCA
CTGCTGAGAA TCACCAAGCG GCTTCCGCAG GAACTCGAGG GCGCGGCGCC CAGCGTCGCC
CTGCACACGG GGCCGCTGTT CCACATCGGC GGCGTACAGA CCCTCGTGCG AGGGGTCGTG
GTCGGCGAGA CCCTGGTGTT CCCGGAAGGC AAGTTCGACG CAGATGCGGC GATGGACCTC
ATCTCCGCGC ATGGTGTCAC CCGCTGGAGC GCGGTCCCCA CCATGGTCAG CCGGCTGCTC
GACGCCCAGG CACAGCGGCC GGTCGACCTG CACAGCCTGC GGTCACTGAC ACTGGGCGGC
GCCCCCGCCC ACCCCAGTCT GTACCAGCGG ATCCGCGACG AGCTGCCGTC CGTCCAGGCC
CGCGTGGCCA CGGGTTACGG CCTCACCGAG AACGGTGGGC AGGCCACCGC GGCCAGCGGC
CGCGACACCC GTGACCGGCC CGGCTGCTGC GGCCTGCCGC TACCGACCGT CGAGATCTCC
TTCGGTGACC GGACGCCGGG CGGGGACGGC GAGGTCCTGC TCCGCGCCGC GTCGCAGATG
CTCGGCTACT ACGGGGAGGC GTCCGGCCCC ATCGACGCCG AGGGGTGGCT GCACACCGGT
GACCTGGGCT ACCTCGACGA GGACGGCTAT CTCTGGGTGA CCGGTCGCAG CAAGGATCTC
ATTCTGCGTG GGGGCGAGAA CATCGCGCCG CTGTCGGTCG AGCGCGCGCT GGTCGGCGTG
CCCGGCGTTC TCGACGCGGC GGTGGTCGGC CTGCCGCACG TCGACCTGGG GGAGGAGGTC
GCGGCGGTCG TGGTGGTCGA CGAGGCCACC GCGGGCCGGC CGGATCTCGC GGAGTACGTC
ATCGAGGTCC TGCGCTCGGA CCTGGCGTCC TTCGCGATCC CGACCCGGTG GCGCTTCCAG
ACCGAGGAAC TGCCGGTGCT CGGGTCGGAG AAGATCGACA AGCACGCGCT CGCCGCGGAG
TTCGCCGCCG AGACCGCCGC TTCCCGGTGA
 
Protein sequence
MSGPIPSWGR DVTVEQVGGV PFRMYEPRAR RLESLLDHAD RWAGRAHAVQ GDVRLDFGDL 
VPAVGHKAAQ LRQHGVGRGD RVALLGWNSP DWVVNLWATW WLGAVPVLVN AWWSTREMEH
AFTALTPVAV LADRRLEHKV PAGSPVAPWL MADAVDGAPP ERPDSDDENE PALILFTSGS
TGFPKAVVLS HRSIISGLHS LLRITKRLPQ ELEGAAPSVA LHTGPLFHIG GVQTLVRGVV
VGETLVFPEG KFDADAAMDL ISAHGVTRWS AVPTMVSRLL DAQAQRPVDL HSLRSLTLGG
APAHPSLYQR IRDELPSVQA RVATGYGLTE NGGQATAASG RDTRDRPGCC GLPLPTVEIS
FGDRTPGGDG EVLLRAASQM LGYYGEASGP IDAEGWLHTG DLGYLDEDGY LWVTGRSKDL
ILRGGENIAP LSVERALVGV PGVLDAAVVG LPHVDLGEEV AAVVVVDEAT AGRPDLAEYV
IEVLRSDLAS FAIPTRWRFQ TEELPVLGSE KIDKHALAAE FAAETAASR