Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1903 |
Symbol | |
ID | 5670304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2282727 |
End bp | 2284526 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240824 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001506246 |
Protein GI | 158313738 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC TGCTTCCCGA CGTCCTGCGT GGGCGGGCCG CGCTCGAACC GGAGCGGACG GCGTACGTCT TCGTGAACGA GCGTGCGGAG GAGACCGGCC GCCTGACCTA CGGCGGGTTG CACGCCCGCG CGCTGGCCGT GGCCGGTGAG CTGTCCCGGG CCTGCGAGCC CGGTGACCGG GCGCTGCTGG TGTTCCCGCA GTGCCTGGAC TTCGTCGTCG CGTACCTCGG CTGTCTGTAC GCGCGGGTGG TCGCGGTGCC CGTCCACCCG CCGCACCGGG ACGGTGTCCA GGACTCCACC CGCCGGATCG TGGACGACTG CGAGCCGGCC GCGGTTCTCA CGCTCGAGGC GATGGCCCGC GAGCTCCGGG CCGCCCTGAC CTCGCCGCGC GGCGCGATGT CCTGGATCCC CGTCGACCGG ATCCCCGCCG GCCCGGCTCC CACGGGCATC CACACCGTGG ACGTCCACGC CGCGGGCCTC GACCCCACGG ACATCGAGCC CATGGGCCTC GATCCCGCCG ACATCGCGTT CCTGCAGTAC ACGTCGGGCT CGACGTCGGA CCCCAAGGGG GTCATGGTCT CCCACGGGAA CCTCGCCGCC AACCAGGAGA TGATCCGGCG GGCCTTCGGG CACGACCGCG ACTCGACGGT GGTCGGCTGG GCTCCGTTCT TCCACGACCA GGGCCTGATC GGCAACGTGC TGCAGCCGCT GTACGCCGGG GCCACCTGCG TCCTGATGTC CCCGACCGCC TTCATCCGGT GGCCGATGCT GTGGCTGTCG CTGATCTCCC AATACCGCGC GCACACCAGC GGCGGCCCGA ACTTCGCCTT CGAGGCCTGC GTGGCCCGCG CCGCCCGGGG CGGGGTGCCC GATCTCGATC TCAGCTGCTG GAAGGTCGCC TTCAACGGCG CGGAGCCCGT CCGCCCCGAC ACCCTGCGAC GCTTCGCCGA GACGTTCGCG CCGCACGGGT TCGACGAGCG GGCGCTCTAC CCGTGCTACG GCCTCGCCGA GGCGACGCTG CTGGTGACCG GCAGCGAGAA GGGCCGCGGG GCGCGGGTGA TCGAGGCGGC CACCGAGGAC CTGCGTGCGG GCCGCTACAC GCCGGTGCCC GCGGGCCGCG GCAGCACCCT GGTCGGCTCC GGGGTCCTCC CCCGGGACGG CGCGCTGCGG ATCGTGGACC CGCGGACGGG ACGCTCTCTC CCGCCGGACC GGATCGGGGA GATCTGGGTC GCCGGGGACC ATGTCGCGCG GGGCTACTGG AACCGCCCGT GGGAGAGCAC CGAGACCTTC CGCGCCGAGC ACGCCGACGA GCCCGGCCGC GGCTACCTGC GCACGGGGGA CCTCGGCCTG GTGGTCGACG ACGAGCTGTT CGTCGTGGGA CGGCTGAAGG ACCTCGTGAT CATCCGGGGC CGGAACTACT ACCCCCAGGA CCTCGAGCAC ACCGCGCAGT CCGCGCATCC CGCGCTGCGG CCGGGCGGGT GTGCCGCGTT CTCCGTTCCG GGTGCCGACC GGGAGAGGCT CGTCATCGTC CAGGAGGTCA GGGGCGAGTT CCGGCGGCGG GCCGACCCGG GTGAGGTCGC CGGGGCCATC CGGGCCGCGG TGGTGCGTGA GCACCAGGTC TCCGTCGGGG ATCTCGTGCT GACGCTGCCG GGCCGGCTCC AGAAGACGAC CAGCGGAAAG ATCATGCGAG CCGCGGCCCG ACGCCGCTAT CTGCGAGACG CCTTCGACCG CTGGACGCCG CCGACCGGCC CGTCCACCGG CCCGTTTCCC AGCACCGACA ACCCGCATGA GAAGACCTGA
|
Protein sequence | MTTLLPDVLR GRAALEPERT AYVFVNERAE ETGRLTYGGL HARALAVAGE LSRACEPGDR ALLVFPQCLD FVVAYLGCLY ARVVAVPVHP PHRDGVQDST RRIVDDCEPA AVLTLEAMAR ELRAALTSPR GAMSWIPVDR IPAGPAPTGI HTVDVHAAGL DPTDIEPMGL DPADIAFLQY TSGSTSDPKG VMVSHGNLAA NQEMIRRAFG HDRDSTVVGW APFFHDQGLI GNVLQPLYAG ATCVLMSPTA FIRWPMLWLS LISQYRAHTS GGPNFAFEAC VARAARGGVP DLDLSCWKVA FNGAEPVRPD TLRRFAETFA PHGFDERALY PCYGLAEATL LVTGSEKGRG ARVIEAATED LRAGRYTPVP AGRGSTLVGS GVLPRDGALR IVDPRTGRSL PPDRIGEIWV AGDHVARGYW NRPWESTETF RAEHADEPGR GYLRTGDLGL VVDDELFVVG RLKDLVIIRG RNYYPQDLEH TAQSAHPALR PGGCAAFSVP GADRERLVIV QEVRGEFRRR ADPGEVAGAI RAAVVREHQV SVGDLVLTLP GRLQKTTSGK IMRAAARRRY LRDAFDRWTP PTGPSTGPFP STDNPHEKT
|
| |