Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4558 |
Symbol | |
ID | 5672905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5437257 |
End bp | 5438792 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243421 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001508837 |
Protein GI | 158316329 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC ATGACTCGGC GTCGTCCGTC GGGGACAGCG CCCGCAGCCT GATCGCCCGG CGGATCTTCG GCGTGCTGGC GCTCGACCCC GCCGCGACCG CGCTGACCTT CGGTGACCGC ACCTTCCCCT GGTCCTACTA CGCCGACGCC ATCACCGATC TGGACGCGCT GCTGGCCGAG TACCCGCGGG CCCGGCAGAT CGGGATCGTG CTGCGCAACC GCCCCGGCCA GCTGTGCTCC GTGATCGCCA CCATCGCCAC CGGCCGGACG GTCGTGACGT TGAGCCCGCA CCTCGGGGAC ACCGGCCTGG CCGAGGACAT CGTCGATCTG GCACCCGACG TGGTCGTCGC CGACGAGGAG GACTGGGCCC GCGCCGCGAT GGTCGAGGCC ACGACGGCTG TCGGCGCGAT CGCGCTGCGC ACCGGCCCCG GCCGGGCGTT CGTCCGGCAC CCGATGCCGG CCCCGCCGTC CCCCGCGTAC AAGCCCGCCG CCGACGTCGC CGTCCTCATG ATGACCAGCG GCACCACCGG CCGGCCCAAG CGCGTCGAGC TCACCTACCA GCGGATGGCC GCGGCGTTCC GCGCCGCGGG AACCCCCGTC GACGAGGGCC GGGAGCTTCG CCTGCACCGG CGGACCGCCA TCCTCTGGGC TTCGCTCGCC CACATCAGCG GCCTCTACTT CGCGATCGCC CACGCGATGG AGGGCAGGAG CATCGCCCTG CTCGAGAAGT TCGAGGTCCA GGCCTGGGCC GAGCTCGTCC GCCGCCACCG GCCGGGCTAT GTCCGCCTCG CCCCGACCGC GATGCGCATG GTGCTCAACG CCGACCTGCC TCGGGACGTC TTCGAGAACG TCTTCGCCGT CGGTTCGGGG ACCGCGCCCC TGCCCGCCGA ACTCGCCGAC GCCTTCGAAG ACCGCTACGG GGTCCCGGTC CTCGGCACCT ACGGCGCCAC CGAGTTCGCC GGCGCGATCG CCGGCTGGAC CATCGACGAC AAGCGGGAAT GGGGCACTCG CAAGCGAGGC AGCGTCGGGC GCGCGTACGA CGGCATCGAC CTGCGGGTGG TGGATCGCGA CAGCGCGACG GTTCTCGCGC CGGGCGCGGT CGGGCTGCTC GAGGCGCGCG GGGGACAGCT GTCCGACGAC GGTGGTGCCT GGATCCGCAC CACCGACCTG GCCTCGATCG ACGACGACGG CTTCCTGTTC ATCCACGGCC GGGCCGACGA CGCGATCAGC CGCGGCGGCT TCAAGATCCC GCCGAGCGTG ATCGAGGAGG CCCTGGCCCA GCACCCGGCG GTCGACGAGG CCTCGGCCGT GGGACTCGCC GACCCGCGGC TGGGCGAGGT CCCGGTGGTC GCGGTCACAC TGAGCGCGCC CGCGACGGAG GCGGAGCTGA TGGAGTTCCT CTCGGCCCGG TTGACGCGCT ACCAGCGGCC GGTCGACCTC GCGATCGTCG ACGCGCTGCC GCGTACCCCG TCGCTGAAGG TGAGCCGCGC CCTCGTCCGG GAGCAGATCT TCGCCCGCCG GCCCACCGCG ACCTGA
|
Protein sequence | MSTHDSASSV GDSARSLIAR RIFGVLALDP AATALTFGDR TFPWSYYADA ITDLDALLAE YPRARQIGIV LRNRPGQLCS VIATIATGRT VVTLSPHLGD TGLAEDIVDL APDVVVADEE DWARAAMVEA TTAVGAIALR TGPGRAFVRH PMPAPPSPAY KPAADVAVLM MTSGTTGRPK RVELTYQRMA AAFRAAGTPV DEGRELRLHR RTAILWASLA HISGLYFAIA HAMEGRSIAL LEKFEVQAWA ELVRRHRPGY VRLAPTAMRM VLNADLPRDV FENVFAVGSG TAPLPAELAD AFEDRYGVPV LGTYGATEFA GAIAGWTIDD KREWGTRKRG SVGRAYDGID LRVVDRDSAT VLAPGAVGLL EARGGQLSDD GGAWIRTTDL ASIDDDGFLF IHGRADDAIS RGGFKIPPSV IEEALAQHPA VDEASAVGLA DPRLGEVPVV AVTLSAPATE AELMEFLSAR LTRYQRPVDL AIVDALPRTP SLKVSRALVR EQIFARRPTA T
|
| |