Gene Franean1_3146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3146 
Symbol 
ID5671523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3701255 
End bp3702856 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content64% 
IMG OID641242041 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001507461 
Protein GI158314953 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.683024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCTGT TGCCACGGTA CGGAACCAAT GGTACAAAAC AAACTATGGC GATGGACCAG 
GTGTCACACA CGAGGCTTGC GGCGGCGCAC TGGCCGGCAG CTCCCTCCCC GCACAGCCTC
GACGTTACGG TCGGGGATGC GCTACGCGAG GCGGCTCGTC GTCGGCCAGA CCGCATCGCG
CTGGTCGATG GAACCGAAGA TCGCGAAACG CGCCGGCAGT GGACCTACGC GGAGCTGCTC
GACACCTCTC TTCGCTGGGC TCGTGCTCTT CGCCGAGAGT TCGATCCCGG CGACCGCGTC
GCGGTGTGGG CCACGAACTG CCCGGAGTGG ATCCTCTTTC AGTTCGGTAC GGCCCTCGCT
GGCCTGACGT TGGTGACGGT CAACCCGGCC TATCGGTCCA GCGAGCTCGG GTTCGTGCTG
CGTCAGTCTC GGGCACAGGG GATCTTGGTC CAACGTGAGC TGCGAGGACG CGACTTACCG
GGTGTCGTGC ACGAGATTGC GGACCAGCTT CCGGAATTGC GTTGGGTGAT GCCCCTTGAC
CAATGGGTTG CGTATGTCGA GGAGGCTGAC CCAGGTGATG TCGAGACCGA TCTTCCGCCG
GTTCGCCCCG AGGACCCAGT GCAGATCCAG TACACCTCCG GGACCACGGG CTTTCCCAAG
GGCGCATATC TGGCGCACCA GGGGATGGCG CTCAATGCGC GCCTGTACGC GGAGGCCATC
GGAGCTTCGG AACGCGACAC CTGGGTGAAT CCGTTGCCAC TCTTTCACAC CGCCGGTTGT
GGGCTGGCGA CGCTGGGCAT CCTCCAGACG GGCGGCTGTC ACGTGCTGCC CCAGGGCTTC
GAGACGGATC TGATGTTCGA CCTGATCGAT ACTTACAAGG CAACGGTCAC GCTCGGCGTG
CCGACCATGT TCATTCGCAT GCTCGAGAAA CTGCCAACTG GCTCAATGCT CCTCGATTCA
TTACGCATCG TGACCACGGG CGGCGCTCCG GTGCCGGTCG AATTGGTCCG CCGGCTCGAG
AAGGAGTTCG GTGTCATGGT GGCAATCGGC TTCGGTCAGA CAGAGTCCTC GCCATACATC
ACCCACACCC GCCCGGGCCA GGATCTTCCT CACTGGGCCG AAACGGTGGG GCGTCCGCTC
CCGCGTGTCG AGGTTAAGAT CTCTCGTCCG GATGGGTCCG TGGCCGACGT GGACGAGGGC
GGCGAGATCT GCACTCGAGG TGTCTGCGTC ATGAAGGGAT ACTTCGAGAA CCCCGAGGCC
ACGTCCCAAA CCATCGACCA GAATGGCTGG CTGCACACGG GCGACGTCGG CACCATGGAC
TCGCACGGCT ACGTTCGCGT GCTGGGCCGC TTCAAGGACC TCATCATCCG GGGTGGGGAG
AACATCTACC CCCGTGACGT CGAGGCAGCG CTGTCCGAGC ACCCGGACGT CACCGACGTC
GCGGTCGTGG GCTTGCCGGA CGGCGAATGG GGCGAGATCG TCGGTGCGTT TGTGCAGACG
TCGAAGCCGC TCACCGCTGA CGGCCTCCAG GCCTTCCTAC GCGGCAAGCT GGCGAGCTAC
AAGATTCCGC AGGTGTGGAG GTTCCCGAAG GAGTTTCCGT AG
 
Protein sequence
MTLLPRYGTN GTKQTMAMDQ VSHTRLAAAH WPAAPSPHSL DVTVGDALRE AARRRPDRIA 
LVDGTEDRET RRQWTYAELL DTSLRWARAL RREFDPGDRV AVWATNCPEW ILFQFGTALA
GLTLVTVNPA YRSSELGFVL RQSRAQGILV QRELRGRDLP GVVHEIADQL PELRWVMPLD
QWVAYVEEAD PGDVETDLPP VRPEDPVQIQ YTSGTTGFPK GAYLAHQGMA LNARLYAEAI
GASERDTWVN PLPLFHTAGC GLATLGILQT GGCHVLPQGF ETDLMFDLID TYKATVTLGV
PTMFIRMLEK LPTGSMLLDS LRIVTTGGAP VPVELVRRLE KEFGVMVAIG FGQTESSPYI
THTRPGQDLP HWAETVGRPL PRVEVKISRP DGSVADVDEG GEICTRGVCV MKGYFENPEA
TSQTIDQNGW LHTGDVGTMD SHGYVRVLGR FKDLIIRGGE NIYPRDVEAA LSEHPDVTDV
AVVGLPDGEW GEIVGAFVQT SKPLTADGLQ AFLRGKLASY KIPQVWRFPK EFP