Gene Franean1_5429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5429 
Symbol 
ID5673760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6567871 
End bp6569448 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content73% 
IMG OID641244284 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001509690 
Protein GI158317182 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.308842 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGA CCGGCGACGA GATCCGACCG ACCATCCCTG ACCTGTTGGG GCGCTGCGCT 
CGGGAGTTCG GCTCCGCCGA CTACATCGTC TCCCTCACCG ACCGGCTGAC CTACGCCGAG
GCGGAGGAGC AGTCCGCCCG GGTGGCCCGG TGGCTGTTGC ACGAGGGCGT CGGTAAGGGC
ACCCGGGTGG GCCTGTTCTT CCCCAGCGGC GTCGAGTGGG CCCTCTGGTG GCTGGCCGTG
AGCCGGATCG GGGCCGTGGC CGTTCCGCTC AGCACCCTGT ACCCGCCGGC GGAGATCGCC
AAGGTCGTGC GGCTGGCCGA TGTGCAGCTC CTGGTGGCAC CGACGACCGT GCTGCGGATC
GACGTCGCGC AGCGGTTCGA GGCGGCGTTC CCCGAGCTGG CCGGGCAGCT GGCCGGGCAG
CCGGCCGGCC AGCTCGAGCT GGCGGGCGCG CCGTACCTGC GGCGGATCGT GCTAACCGGC
CAGACGGACC GGGGCTGGGC CACCCGGTGG GATCCGCGGG ACCCGCCGCT GGTGCGGGCC
GAGCTGCTCG CCGCGGTGCA GACCGAGGTC ACCCCGGCCG ACCTGGCGAT CATGGTTCAC
ACCTCCGGTT CCACCGCCGA CCCGAAGGGC GTGCTGCACA CGCACGGCAC GCTGGTGCGC
CAGACCTCCA CCTGGCCGGC GGCGATCCGC GGGCTCACCG GCGTCGACCA CGCGCCGCGC
ATCCTGTGTG CCATGCCGTT CTTCTGGATC GGCGGGATCC TGGCCGCGAC CGGAGCTCTG
CACGCACCCG TCGCGGTCCT GGTGCTGGCG CGGCTGGAAG CCGGGCCGGC CCTCGACCTC
GCCGAACGGG AACGGGCGAA CGGCGTCGTC GGATGGCCCG CGTTCACCCA GCAGCTGCGG
CTGCACCCGT CCTTCCCCAG CCGGGACCTG CGCAGCGCCC CCGCGCTGCG GGAGGGGCCG
GTGGACCTCG CGATGGCGGG CGTCCCGGAC GGCCATCCGA TCCACCGCAG CCTGACCGAG
TCCGGCGGCA GCTTCGCGTT CACCGAGACC GCGATCGTCG ACGCCGCCGG CGAGCGCGTC
CCGGACGGGA CCGTCGGTGA GCTGCTCATC CGTGGCATCG GCTCGATGGC CGGCTACAAC
AAGCGTGAGC GCGCGGAGGT CTTCGACGCG GACGGCTGGT ACCACACCAG CGACCGCGTC
TACCGCAGGA CGGGCGACCC GCGGCTGTTC TACGTCGGCC GGGACAGCGA GCTCGTCAAG
GTCGCCGGTT CGAACGTGGC ACCGCGCGAG GTCGAGGCCG TCATCGAGGA GTTCCCCGAG
GTCGCGCACT GTGTCGTGAC CGGTGTCGAG CATCCGACCC GCGGCGAGGA GGTGTGCGCG
GTCATCGTTC CGGCCGGCAC GACCGGCACG GACGTCGACG TGGACGGTCT GGCCGCGCGC
ACCCGTACGC TCCTGTCCAG CTACAAGGTT CCGACCCGGT GGATCGTCGC CGCGGACGAC
GAGGTGCCGG CCCTGCCGAG CGGCAAGCCG GACCGCCGCG GCCTGCGCAC ACTGATCGAG
GACGGCCGAC TGAAGTAG
 
Protein sequence
MTSTGDEIRP TIPDLLGRCA REFGSADYIV SLTDRLTYAE AEEQSARVAR WLLHEGVGKG 
TRVGLFFPSG VEWALWWLAV SRIGAVAVPL STLYPPAEIA KVVRLADVQL LVAPTTVLRI
DVAQRFEAAF PELAGQLAGQ PAGQLELAGA PYLRRIVLTG QTDRGWATRW DPRDPPLVRA
ELLAAVQTEV TPADLAIMVH TSGSTADPKG VLHTHGTLVR QTSTWPAAIR GLTGVDHAPR
ILCAMPFFWI GGILAATGAL HAPVAVLVLA RLEAGPALDL AERERANGVV GWPAFTQQLR
LHPSFPSRDL RSAPALREGP VDLAMAGVPD GHPIHRSLTE SGGSFAFTET AIVDAAGERV
PDGTVGELLI RGIGSMAGYN KRERAEVFDA DGWYHTSDRV YRRTGDPRLF YVGRDSELVK
VAGSNVAPRE VEAVIEEFPE VAHCVVTGVE HPTRGEEVCA VIVPAGTTGT DVDVDGLAAR
TRTLLSSYKV PTRWIVAADD EVPALPSGKP DRRGLRTLIE DGRLK