Gene Franean1_4558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4558 
Symbol 
ID5672905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5437257 
End bp5438792 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content73% 
IMG OID641243421 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001508837 
Protein GI158316329 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC ATGACTCGGC GTCGTCCGTC GGGGACAGCG CCCGCAGCCT GATCGCCCGG 
CGGATCTTCG GCGTGCTGGC GCTCGACCCC GCCGCGACCG CGCTGACCTT CGGTGACCGC
ACCTTCCCCT GGTCCTACTA CGCCGACGCC ATCACCGATC TGGACGCGCT GCTGGCCGAG
TACCCGCGGG CCCGGCAGAT CGGGATCGTG CTGCGCAACC GCCCCGGCCA GCTGTGCTCC
GTGATCGCCA CCATCGCCAC CGGCCGGACG GTCGTGACGT TGAGCCCGCA CCTCGGGGAC
ACCGGCCTGG CCGAGGACAT CGTCGATCTG GCACCCGACG TGGTCGTCGC CGACGAGGAG
GACTGGGCCC GCGCCGCGAT GGTCGAGGCC ACGACGGCTG TCGGCGCGAT CGCGCTGCGC
ACCGGCCCCG GCCGGGCGTT CGTCCGGCAC CCGATGCCGG CCCCGCCGTC CCCCGCGTAC
AAGCCCGCCG CCGACGTCGC CGTCCTCATG ATGACCAGCG GCACCACCGG CCGGCCCAAG
CGCGTCGAGC TCACCTACCA GCGGATGGCC GCGGCGTTCC GCGCCGCGGG AACCCCCGTC
GACGAGGGCC GGGAGCTTCG CCTGCACCGG CGGACCGCCA TCCTCTGGGC TTCGCTCGCC
CACATCAGCG GCCTCTACTT CGCGATCGCC CACGCGATGG AGGGCAGGAG CATCGCCCTG
CTCGAGAAGT TCGAGGTCCA GGCCTGGGCC GAGCTCGTCC GCCGCCACCG GCCGGGCTAT
GTCCGCCTCG CCCCGACCGC GATGCGCATG GTGCTCAACG CCGACCTGCC TCGGGACGTC
TTCGAGAACG TCTTCGCCGT CGGTTCGGGG ACCGCGCCCC TGCCCGCCGA ACTCGCCGAC
GCCTTCGAAG ACCGCTACGG GGTCCCGGTC CTCGGCACCT ACGGCGCCAC CGAGTTCGCC
GGCGCGATCG CCGGCTGGAC CATCGACGAC AAGCGGGAAT GGGGCACTCG CAAGCGAGGC
AGCGTCGGGC GCGCGTACGA CGGCATCGAC CTGCGGGTGG TGGATCGCGA CAGCGCGACG
GTTCTCGCGC CGGGCGCGGT CGGGCTGCTC GAGGCGCGCG GGGGACAGCT GTCCGACGAC
GGTGGTGCCT GGATCCGCAC CACCGACCTG GCCTCGATCG ACGACGACGG CTTCCTGTTC
ATCCACGGCC GGGCCGACGA CGCGATCAGC CGCGGCGGCT TCAAGATCCC GCCGAGCGTG
ATCGAGGAGG CCCTGGCCCA GCACCCGGCG GTCGACGAGG CCTCGGCCGT GGGACTCGCC
GACCCGCGGC TGGGCGAGGT CCCGGTGGTC GCGGTCACAC TGAGCGCGCC CGCGACGGAG
GCGGAGCTGA TGGAGTTCCT CTCGGCCCGG TTGACGCGCT ACCAGCGGCC GGTCGACCTC
GCGATCGTCG ACGCGCTGCC GCGTACCCCG TCGCTGAAGG TGAGCCGCGC CCTCGTCCGG
GAGCAGATCT TCGCCCGCCG GCCCACCGCG ACCTGA
 
Protein sequence
MSTHDSASSV GDSARSLIAR RIFGVLALDP AATALTFGDR TFPWSYYADA ITDLDALLAE 
YPRARQIGIV LRNRPGQLCS VIATIATGRT VVTLSPHLGD TGLAEDIVDL APDVVVADEE
DWARAAMVEA TTAVGAIALR TGPGRAFVRH PMPAPPSPAY KPAADVAVLM MTSGTTGRPK
RVELTYQRMA AAFRAAGTPV DEGRELRLHR RTAILWASLA HISGLYFAIA HAMEGRSIAL
LEKFEVQAWA ELVRRHRPGY VRLAPTAMRM VLNADLPRDV FENVFAVGSG TAPLPAELAD
AFEDRYGVPV LGTYGATEFA GAIAGWTIDD KREWGTRKRG SVGRAYDGID LRVVDRDSAT
VLAPGAVGLL EARGGQLSDD GGAWIRTTDL ASIDDDGFLF IHGRADDAIS RGGFKIPPSV
IEEALAQHPA VDEASAVGLA DPRLGEVPVV AVTLSAPATE AELMEFLSAR LTRYQRPVDL
AIVDALPRTP SLKVSRALVR EQIFARRPTA T