Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2941 |
Symbol | |
ID | 5671327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3458621 |
End bp | 3460942 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641241847 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001507267 |
Protein GI | 158314759 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.227779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTCA GAGCCCACCG GCAGATAAGC GCTCCTGTCG ATGTCGTCTG GGATGTGGTC ACCGACCACG AGGGGATGAG TACCTGGTTC CCCGGTGTCG CCGTCGCCCT GGAACGTCCG GGCGTGCAGG GCGGCATCGG TGCCGTCCGG ATCGTCCGGA TGACGGGCCT CAGCATCCGG GAGCAGGTGA CGGACATCGA GCCCGCCCGC CGGCTCGCCT ACCGGTGCAT CTCCGGGCAC CCCTTGCGGG ACTACCGCGG AGAGATCTTT CTCGCGCCGT CCGGGACCGG TACGAGCCTG ACCTGGGTGC TCGACACCTC GACCCGTTTC CGTTTGGTCG CGTGGCTGCT GGTAAACCAG GCGCGTCTTT TTGCCTGGTC TCTCGCCAGG GCCGCGGAGC GTTCATCCGC TCTCATCCGG TCCGCCGACA CCGATCGGAC GAAGGAGCCC GCTGTGCAGC ACGTGCAGTC GACGGCGTCG ACGCAGCAGA CGCCCGGCGC AGATTCCAGC CCCGGACCAG CTTCGCTGGC CGAGGCATTC CAGCGCAATG CCAGTCGCGA CCCACAGGCG TTAGCCTTGA GCACCCCGGA CGGGTCAGCG ACCCTCACCT GGGGGCAATA CGCCGAGCAG GTGCGCGACA TCGCGGCTGC CCTGCATGCC CACGGCGTCC GGCGCGGCGA CAGCGTCGCC CTGATGATGC TCAACCGCCC CGAATTCTAT CCCATCGACA CGGCCGCTAT CCATCTCGGC GCGATCCCGT TCTCGATTTA CAACACGTCG TCCGCCGAGC AGATCCGGTG GCTGTTCGCC AGCGCGAAAC CGAGCATGGT CTTCTGCGAC AGCAGCCACG CCGCTGCGGT GCTGGAGGCA GTCGACGGCG GCACCGCTGT CAAAGCTGTT GTGTGCGTGG ACGGCGACGT CGAGGGGGCG ACGACCTCGG TGGAATTTCG GGGCGTCCGC AGCGACGACT TCGACTTCGA GAGCACCTGG CGTTCGGTGA CACCGGACGA CGTTCTCACG TTGATCTACA CATCCGGCAC GACGGGGGAG CCGAAAGGCG TCCAGATCAC GCACGGCAAC ATGCTTGCGC AGCTCGCCGC GACCAACACC TTCCTGGAAG CGGGCCCGGG CGACAGAGTC ATCTCCTTCC TGCCCTCGGC GCACATCGCG GACCGGTGGG CTGCGCACTA TCTCCAGCTG GTCTGCGGGA CGACCGTCTA CCCGCTGGTC GACCGCACTC AGCTGCTCCC GACAATGCTC CGCGTTCGTC CCACTCTGTT CGGCGCCGTG CCCCAGGTGT GGCAGAAGAT CCGTGCTGGG GTGCTGGCGA TGATCGACGC AGAAGCCGAC GAGGAGCGGC AGGCGGGCAT CCAGCAGACC CTGGCCGTCG GAGCCCGGTA CGCGCGAAGC CGCAGCGATG GCACCCTCAC GGCCGAGCTC GAGAGCCTTT TCGCAACGGC CGACACGCAG GTGCTCAGCC ATCTGCGTTC CAGGCTCGGC CTGGACCAGG CGCGGATCGT GATGTCTGGC GCGGCGGCGG TACCCGTGGA GATCGTCGAG TTCTTCAACT CCATCGGGGT TCCGCTCATC GACGGGTGGG GGATGTCCGA GCTCTCCTGC ATGGGCGCGT TCATGCCCAA CCACGCGCCG CGCCTGGGAT CGGTGGGCAT GGCTCTTCCC GGTGTCCAGG TCCGCCTGGG CGAGGACGGC GAACTGCTCG TGCGCGGACC GATCGTGATG AAGGGCTACC TCGGCCGACC CGAGCTGACT GCTGAGCTCA TTGACGACGA GGGCTGGCTG TACACGGGCG ACGTCGCCCG CATCGACGAC GAAGGATACA TTTATATTAT TGATCGAAAG AAAGAAATCA TTGTCAACTC CAGCGGAAAG AACATCTCGC CAGCGGGCAT CGAGGGTCAT CTGAAAGCGG CGAGTCCCCT TATCGGCCAA GCTGTCGTGA TCGGTGAGGC GCGGCCCTTT TTGACCGCGC TGATCGTGCT CGACGCGGAT GCCGCGGGCC AGTACGCGGC CTCCCGCGCA CTCCCGGCGG ACGCGAGCTC GCTCGCCGCG GACGAGGGAG TCGTCGCCGC GCTCTCCGCC GCCGTGACCG AGGCGAACAC CCACGTCTCG CAGGTCGAGC ACGTCCGGAA GTTCGCTGTC CTCCCCCAGT TCTGGGAGCC GGGAAGCGAG CTGCTTACGC ACACGATGAA GCTGCGCCGC AGGCCGATCG GCGCCCGCTA CGCCGACGTC ATCGAGGCGC TCTACCGGCT GCCGCGCGAC GAGTCGGTCC TGCGTATCGC GGACGCGGCG TCCGCCTCCT AG
|
Protein sequence | MRVRAHRQIS APVDVVWDVV TDHEGMSTWF PGVAVALERP GVQGGIGAVR IVRMTGLSIR EQVTDIEPAR RLAYRCISGH PLRDYRGEIF LAPSGTGTSL TWVLDTSTRF RLVAWLLVNQ ARLFAWSLAR AAERSSALIR SADTDRTKEP AVQHVQSTAS TQQTPGADSS PGPASLAEAF QRNASRDPQA LALSTPDGSA TLTWGQYAEQ VRDIAAALHA HGVRRGDSVA LMMLNRPEFY PIDTAAIHLG AIPFSIYNTS SAEQIRWLFA SAKPSMVFCD SSHAAAVLEA VDGGTAVKAV VCVDGDVEGA TTSVEFRGVR SDDFDFESTW RSVTPDDVLT LIYTSGTTGE PKGVQITHGN MLAQLAATNT FLEAGPGDRV ISFLPSAHIA DRWAAHYLQL VCGTTVYPLV DRTQLLPTML RVRPTLFGAV PQVWQKIRAG VLAMIDAEAD EERQAGIQQT LAVGARYARS RSDGTLTAEL ESLFATADTQ VLSHLRSRLG LDQARIVMSG AAAVPVEIVE FFNSIGVPLI DGWGMSELSC MGAFMPNHAP RLGSVGMALP GVQVRLGEDG ELLVRGPIVM KGYLGRPELT AELIDDEGWL YTGDVARIDD EGYIYIIDRK KEIIVNSSGK NISPAGIEGH LKAASPLIGQ AVVIGEARPF LTALIVLDAD AAGQYAASRA LPADASSLAA DEGVVAALSA AVTEANTHVS QVEHVRKFAV LPQFWEPGSE LLTHTMKLRR RPIGARYADV IEALYRLPRD ESVLRIADAA SAS
|
| |