Gene Franean1_4933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4933 
Symbol 
ID5673272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5922091 
End bp5923590 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content72% 
IMG OID641243787 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001509203 
Protein GI158316695 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0506811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.147252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTCG GGATGCTGTT GGAGATGGCC GCGGACGGCC TGGCCGACCG GGTCGCGCTC 
GGTCCGCGCG TCGGCGGGCT GAGCTTCGCC GAACTCGCGC ACCAGGCGCG CCGGGTCGGT
GCCGCGCTGC AGACCCTGCC TGGAGACCGG GTCGGGCTGA TCGACCTGAA CTCGCCCGCC
GTCCCGCTGA CCCTGTTCGG GTCGGCGATC GCGGGCAAAC CGTTCGTGCC GATCAACTAC
CGGCTCGCCG ACGAGCAGCT GCGCGCGATC GTCACCCGGA CCGCGCCGGC CACCGTCGTC
GTCGGGGCCG GCGTCGCCGA GCGGCTGGGC GACATCGACG GCATCCACCT GGTCACCCGG
GACGAGCTGC TCGCCATCGC CGCGGACACC GAGGCCAAGG AGGCCGACGG CTGGGGCGGC
GACCCCGAGG ACATCGCCGT GCTGCTGTTC ACCAGCGGCA CCACCGGCGA GCCCAAGGCC
GCTGTGCTGC GCCACCGCAA CCTGACCGAG TACGTCATCT CCACGGTCGA GTTCGCCGGT
TCCGCCGAGG ACGAGGTCGC GATCGTCAGC GTCCCGCCAT ACCACATCGC CGGGGTGTCG
GCGTCCTGCT CGTCGACCTA CTCGGGACGT CGGGTCGTCC AGCTCGAGAG CTTCGAGCCC
CGGGCCTGGG TCGACCTCGT CCGGGCGGAG TCGGTCACGC ACGCGATGGT GGTGCCGACC
ATGCTCGGCC GCATCCTCGA CGTCATCGAG GCCGACGGCC AGGGGCTGCC GTCCCTGCGG
TCGATCTCCT ACGGCGGCGG GCCCATGCCA CTGCCGGTCA TCGAGCGCGC GGTCACCGCA
CTCCCGCACG TGGGGTTCGT GAACGCCTAC GGGCTCACCG AGACCTCGAG CACGATCGCC
GTCCTCGGCC CGGACGACCA CCAGGCCGCC ATCAGCAGCG ATGACCCGGA GGTACGGGCG
CGGCTGGGGT CGGTCGGCAA GCCGCTGCCG AGCCTGGAGG TCACGATCCG CGACCCGGGC
GGCCAGGAGG TCCCCACCGG CGAGCACGGT GAGATCTGGG TCCGCGGCGG GCAGGTCTCC
GGCGAGTACC TGGGCATCGG GCGGATCGAG AACGACGGCT GGTTCCCGAC CCGCGACGAG
GGTCACCTCG ACTCGGGCGG CTACCTGTAC GTCCACGGCC GGCTCGACGA CGTCATCGTG
CGCGGCGGGG AGAACATGTC CCCGGGCGAG ATCGAGGCTG TGCTCATCAC CCATCCCGCC
GTCGAGGAGG CCGCCGTCGT CGGCATCCCG CACCGGGACT GGGGGGAGCA GGTGGTGGCC
GCGGTCGTCA CCTCCGGCGA GGTCACCGAG GACGAGCTGC GCGGCCACGT GCGAGCCCAG
CTCCGGTCCA GCCGCACCCC GGAGCACATC CAGTTCCGCT CGGAGCTGCC GTTCAACGAG
AACGGCAAGC TGCTCCGCCG GGTGCTGCGC ACCGAGCTCG AGCAGGCGTT CTCCTCCTGA
 
Protein sequence
MHLGMLLEMA ADGLADRVAL GPRVGGLSFA ELAHQARRVG AALQTLPGDR VGLIDLNSPA 
VPLTLFGSAI AGKPFVPINY RLADEQLRAI VTRTAPATVV VGAGVAERLG DIDGIHLVTR
DELLAIAADT EAKEADGWGG DPEDIAVLLF TSGTTGEPKA AVLRHRNLTE YVISTVEFAG
SAEDEVAIVS VPPYHIAGVS ASCSSTYSGR RVVQLESFEP RAWVDLVRAE SVTHAMVVPT
MLGRILDVIE ADGQGLPSLR SISYGGGPMP LPVIERAVTA LPHVGFVNAY GLTETSSTIA
VLGPDDHQAA ISSDDPEVRA RLGSVGKPLP SLEVTIRDPG GQEVPTGEHG EIWVRGGQVS
GEYLGIGRIE NDGWFPTRDE GHLDSGGYLY VHGRLDDVIV RGGENMSPGE IEAVLITHPA
VEEAAVVGIP HRDWGEQVVA AVVTSGEVTE DELRGHVRAQ LRSSRTPEHI QFRSELPFNE
NGKLLRRVLR TELEQAFSS