Gene Franean1_4313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4313 
Symbol 
ID5672668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5152450 
End bp5154045 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content72% 
IMG OID641243186 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001508603 
Protein GI158316095 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGA TCCCACCTGA GCTGGTCAAA CGGTACGAGA ACGAGGGGTG GTGGACACCC 
GAGACGCTGG GCGACCTGCT CGCCGCCGGC CTCGACGCGA CCCCCGGCGC CGAGTTCCGC
GTGCACTCGG CGGTCCGGCC GTGGTCCGGG ACGTTCCGTG ATGTCGAGCA GGTCGCCCGC
CGGCTCGCCG CCGGCCTGCG CGCCCGCGGT GTCGGCCCCG GTGACGTCGT CGTCTTCCAG
CTGCCGAACT GGATGGAGGC GGCCGCCGTC TTCTGGGCGT CGTCCTTCCT CGGGGCGGTT
GTGGTCCCCG TCGTGCACTT CTACGGGCGC AAAGAGCTCG GCCACATCCT GGAGCGGACC
TCACCCAAGG TCTTCGTCAC CGCCGAGGGC TTCGGGCGGA TGGAGTACCA GCCCGACCTG
AGCCGGGACG TCCCCGTCGT CGGCGTGGTC GGCCGCGACT TCGACGAGCT GCTCGCCGAC
GCTCCGCTGC CCGGCGTGAT CGCCACCGAT CCGGCCGCCC AGGCGGTGAT CGCGTTCACC
TCGGGAACCA CCCGTGACCC CAAGGGGGTC ATCCACACCC ATCAGACCCT CGGCTTCGAG
ACGCGCCAGC TCGCCGGCCT CTACCCGCCG GACCGCGGCC GCCAGCTCAC CGCCGCGCCG
GTGGGGCACT TCATCGGCAT GCTGAACGCG TTCCTGATCC CCGTCCTGGA CGGCCGGCCG
ATCAACCTGG CCGACGTCTG GGACCCGGCC CGGGCCCTGG AACTGATGGT GAGCGACGGG
CTGACTGTCG GCGGCGGCGC CACCTACTTC GTGACGAGCC TGCTGGACCA CCCGAGCTTT
TCGCCCGAGC AGCACCTGCC GGCGATGAAG TACGCCGGGC TGGGCGGGTC GTCCGTGCCG
GCGGCGGTCA CCACCCGGCT GGACGAGCTC GGCATCACCG TCTTCCGGTC CTACGGCAGC
ACCGAGCATC CGTCGATCAC CGGCTCGCGC CACACCGCGC CGGCGGAGAA GCGGCTGTTC
ACCGACGGGG ACCCGCTGCC CGGGGTGGAG ATCCGCCTCG ACGCCGACGG CGAGATCCTC
AGCCGCGGCC CGGACCTCTG CCTCGGCTAC CTCGACGAGG CCCTGACCGA GCAGGTCTTC
GACGCCGACG GCTGGTACCG CACGGGCGAT GTCGGTGTCC TCGACGCCGA CGGCTACCTT
ACGATCGTCG ACCGCAAGGC GGACTTCATC ATCCGCGGCG GGGAGAACAT CAGCGCGCTC
GAGGTCGAGG AGGTGCTGCT CACCATGCCC GAGGTCGCCG AGGTCGCGGT GGTCGCGGCC
CCCGACGCCA GGCTCGGCGA GCACGCCGCC GCGATCCTGC GGCTGCAGCC CGGGAGCGAG
CTGCCCACCC TGGAGGAGGT GCAGGCGCAC TTCGCGCGGG CCGGTCTGGC CCGGCAGAAG
TGGCCCGAGG AGCTTCGCGC GATCGAGGAC TTCCCCCGGA CGCCCAGCGG TAAAATCCAG
AAGGCCGTGC TCCGCCGGGA GCTGCGCGCC GTCCCGCAGG GGGGCCGGCA GGGGGACCCG
TCGGGACACA TTGCGGCTGG GCAGGGAAGA GAATAG
 
Protein sequence
MRTIPPELVK RYENEGWWTP ETLGDLLAAG LDATPGAEFR VHSAVRPWSG TFRDVEQVAR 
RLAAGLRARG VGPGDVVVFQ LPNWMEAAAV FWASSFLGAV VVPVVHFYGR KELGHILERT
SPKVFVTAEG FGRMEYQPDL SRDVPVVGVV GRDFDELLAD APLPGVIATD PAAQAVIAFT
SGTTRDPKGV IHTHQTLGFE TRQLAGLYPP DRGRQLTAAP VGHFIGMLNA FLIPVLDGRP
INLADVWDPA RALELMVSDG LTVGGGATYF VTSLLDHPSF SPEQHLPAMK YAGLGGSSVP
AAVTTRLDEL GITVFRSYGS TEHPSITGSR HTAPAEKRLF TDGDPLPGVE IRLDADGEIL
SRGPDLCLGY LDEALTEQVF DADGWYRTGD VGVLDADGYL TIVDRKADFI IRGGENISAL
EVEEVLLTMP EVAEVAVVAA PDARLGEHAA AILRLQPGSE LPTLEEVQAH FARAGLARQK
WPEELRAIED FPRTPSGKIQ KAVLRRELRA VPQGGRQGDP SGHIAAGQGR E