Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3965 |
Symbol | |
ID | 5672326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4747678 |
End bp | 4749519 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242844 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001508261 |
Protein GI | 158315753 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.666614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.454806 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGACG CTCCGGCCGA GGTGGCGATC CCGGCGCCCC GCGCCGTCCC CGTTTCCTCA CCCGTCGATC TCGTACCTGC CGATCTCGTA CCTGCCGATT CCGTGGCCGC CGGCTCGACG GCCGCCGCCG CGCTCGCGGC GGGCACCCTG CACGCCGGCC TGGCCACGGT GGCGGCCCGC CATCCGGGGT TGCCTCTCGA CTTCCCCTCC GCCGGAGCGT CGCTGACGCT GGGCGAGCTC GTCGCCCGCG CGGACGTCCT GGCGGCAGCC CTGACCGGTG CCGGGGTCGT CGCCCGTGAC CGGGTCGGAG TGCTCAGCGA CAACGCGCCC GACTTTCTGG TGGCCCTGGC CGGAGTGAGC CGGGCGGGAG CCGCCGCGTG CCCCCTGCCG CTGCCCGCCT CCACCCGTGA CCTGCCCGGC TACGCCGCGC GGCTGGCGCG CGCGGTCGCC GTCGCCGACA TCCGGCTCGT GCTGGTCGGC GGCCGGACGG CCCGGATGGC CGACCGCTTC GCCGGGGCCT TCGACGGCGT CCGCCTCGTC CGGGTCGCCG ACCTCACCAC ACCGGCCGCC GCCGGCACTG TCCCGGCCAC CGGCACCGCG CCAGCCGCCG CTGGCGGTGC GGCCCCGGCC GGCCCGGCGG TGGAGGTGTC ACCCGACGAG GCCGCCCTGG TCCAGTTCAC CTCCGGTAGC ACCGCCGCCC CGAAGGGCGT CGTACTGACG CACCGCAACA TCCTGGCCGG GCTGGCCGCG ATCATCGGCG GCGTCGCGCT GACCGAGGTC GACCACGGCG GCATCTGGCT GCCGCTCTTC CACGACATGG GCCTGTTCGG CACGCTCGCC GGCATCTTCA CCGGCATGCC GATGACCGTC TGGTCACCGG CCGCCTTCGT GAAGGACCCG GCCGGCTGGC TGACAGACTT CCTCGGCCGC GGCGGCAGCA TCGCCCCCAT GCCGAACTTC GCCTACGACC ACCTGGCGGA GGCCGTCCCC GCGCCGCGGG AGGCCGGCCT GGACCTGAGC GGCTGGCGGG TCGCCTTCAA CGGCGCCGAG CCGGTCGAGC CCGCCTCGGT CGAGCGCTTC CTCACCACCT TCACCCCGGC CGGCTTCGCG CCGGCGGCGA TGATGCCCGT CTACGGGATG GCCGAGGCCA CGCTGGCGGT GACCTTCCCG CCACCGGGCC GCGCCCCCGT GCACCGCTGG GTCGACCGCG ACCTGCTCGC CCGCGACGGC GTCGCGCGCG ACGTGCCCGC CGGCTCGCCG TCCGCCCGCG GGCTGGCCGG GGTGGGGCGA CCGGTGCGCG CCATGCGGGT GCGGATCGGC GGCCGCGACG GCACCGGCGT GCTCGGTGAC GACCAGGTCG GCGAGATCCA GATCAGCGGC GACGCGGTGA CGGGCGGCTA CCTGACCGAC ACCGGCGCGC AGCCGTCCGG CGCGTTCACC GCGGACGGCT GGCTGCGCAC CGGCGACCTT GGCCTGCTGC GCGACGGCGA GCTCTTCGTC ACCGGCCGGG ACAAGGAGAT GGTGATCGTC CGCGGGGTGA ACTACTACCC CCACGACGCC GAGGAGGCCG CCCGGGACGT CCCCGGCGTC CACCGCCGCC GCTGCGTCGC CTACGCGGAC CGCTCACCCG GGGGCGCCGA GACGATGGCA GTCCTCGCCG AGACCCGGCT GGTCGACGAC ACCGAGCGCG CGGCGCTGGC CGCCGCGATC CGGGTGGCGG TGACCGCCGC GCTGGGGCTG GCCGAGATCG CCGTCGCCCT CGTCGGGCCC GATGCCCTGC CGCGGACGTC CAGCGGGAAG TTCCAGCGCC TCGCCGCGCG CGAGGCATGC GTACCGACAT GA
|
Protein sequence | MHDAPAEVAI PAPRAVPVSS PVDLVPADLV PADSVAAGST AAAALAAGTL HAGLATVAAR HPGLPLDFPS AGASLTLGEL VARADVLAAA LTGAGVVARD RVGVLSDNAP DFLVALAGVS RAGAAACPLP LPASTRDLPG YAARLARAVA VADIRLVLVG GRTARMADRF AGAFDGVRLV RVADLTTPAA AGTVPATGTA PAAAGGAAPA GPAVEVSPDE AALVQFTSGS TAAPKGVVLT HRNILAGLAA IIGGVALTEV DHGGIWLPLF HDMGLFGTLA GIFTGMPMTV WSPAAFVKDP AGWLTDFLGR GGSIAPMPNF AYDHLAEAVP APREAGLDLS GWRVAFNGAE PVEPASVERF LTTFTPAGFA PAAMMPVYGM AEATLAVTFP PPGRAPVHRW VDRDLLARDG VARDVPAGSP SARGLAGVGR PVRAMRVRIG GRDGTGVLGD DQVGEIQISG DAVTGGYLTD TGAQPSGAFT ADGWLRTGDL GLLRDGELFV TGRDKEMVIV RGVNYYPHDA EEAARDVPGV HRRRCVAYAD RSPGGAETMA VLAETRLVDD TERAALAAAI RVAVTAALGL AEIAVALVGP DALPRTSSGK FQRLAAREAC VPT
|
| |