Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3708 |
Symbol | |
ID | 5672074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4390621 |
End bp | 4392276 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242591 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001508011 |
Protein GI | 158315503 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAACG TCGCGAGCCT GCTGACCGCC TCGGCGCGGC GGCACCCGGA CCGGTGCGCG GTGCGGTTCG CCGGACGCCG GACGTCCTAC GGCGAGCTGC GCGAGCAGGC GGCCAGGTTC GGCTCGGCGC TGCTCGGGCG CGGCCTGGAA CGCGGCGACC GGGTCGCGGT GCTGCTGCCC AACTGCCCGC AGTACCTGGC CGTGCTGTTC GGGGCCTGGC ACGCCGGCCT GGTCGCCGTG CCGATGAACG CCAAGCTGGC CGGCCCGGAG ATCCAGGTGA TCCTGGACGA CAGCGGCGCC CGGGCGTTCG TCCACGCCGG CGCGGGCACA GTCGCCGGCC TCGATCTCAC CGGGGTCCAG GAGGTGGTGG TCGACGTCCG CGGCGTCGGC GCCGGTGTCA GCGCCGACAC CGACACCGAC ACCGGAGACG GTGCCGTTGC CGGCAGGACT GCCGGAGGCG GCCCGAGCGG GTTCGACCGG CTGCTGGCCG AGGGCTCCGC CGAGCTGGTA CCGGTCGACG TCGCCGGGGA CGACCTCGCC TGGCTGTTCT ACACGTCGGG GACGACCGGC CGCCCCAAGG GGGCCGAGCT CAGCCACCGC AACCTCACCG TCACGACCTG GACGCTGCTC GCCGACGTGT GCGACTACCG GCCCTCGGAC CTCGCCCTGC ACGTCGCGCC GCTGTCCCAC GGCAGCGGCC TGTACTCCCT GGGCGCGATC GCCCGCGGCG CCGAGAACCT GATCCACGAC GGCGGCGGGT TCGACCCGGC CGAGGTACTC GAGCTCGTCG CGCGCGAACG GATCACCGTC ATCGCCTTCC TGGTGCCGAC GATGATCGTG AAGCTGCTCG GTGCCCCCGA GACGGACACG AGCTCGTTGC GCTGCGCCGT CTACGGCGGC GCGCCCATCC ACGTCGAGCA CTCCCGCGCG ATGATCGAGC GGTTCGGGCC GGTGTTCGTG CAGATCTACG GCCAGGGCGA GTCACCCATG ACCATCACCT ACCTCGACCA CGGCGCCTCA CCCGACACCC CCCTCGACTC GGCGGGTGTG GCACACCCGG GGGTCGAGGT GCAGATCATG GGCGCCGACG ACCGGCCGCT GCCCGCCGGC GAGGAAGGCG AGATCTGCGT CCGCGGGGAC GTGGTGATGC GGGGCTACTG GAACAACCCG GAGGCGACCA GCCGGGCGTT GCGGGGCGGC TGGCTGCACA CCGGTGACAT CGGCCGCCTC GACGAGCACG GCCGCCTGTT CCTCCTCGAC CGCAGCAGCG ACGTCATCAT CTCCGGCGGG TCCAACATCT ACCCGCGGGA GGTCGAGGAG GTGCTGATCC AGCATCCCGC CGTCGCCGAG GTCGTGGTCT TCGGCGTACC CGACGAGCTC TGGGGCGAGA ACGTCGTCGC CGCGGTGGTG CCCGCGGCCG CGCCGCCCCC GGCGAACGAC CTCATCGACT TCAGCCTCAC CCACATCGCC CGCTTCAAGA AGCCGAAGCA GATCATCTAC GTCGACGCGC TGCCCAAGAG CTCCTACGGC AAGGTCCTGC GCCGGGAGGC CCGCCGGCTC GCACTGGCGG CGGGCGAGAC AGCCGGCCAC GAGCACGTGA CCGGCCAGCC TGCCACCATC AAGTCCGTTG ACCGCGATCC AGGAGCCGCC GAATGA
|
Protein sequence | MVNVASLLTA SARRHPDRCA VRFAGRRTSY GELREQAARF GSALLGRGLE RGDRVAVLLP NCPQYLAVLF GAWHAGLVAV PMNAKLAGPE IQVILDDSGA RAFVHAGAGT VAGLDLTGVQ EVVVDVRGVG AGVSADTDTD TGDGAVAGRT AGGGPSGFDR LLAEGSAELV PVDVAGDDLA WLFYTSGTTG RPKGAELSHR NLTVTTWTLL ADVCDYRPSD LALHVAPLSH GSGLYSLGAI ARGAENLIHD GGGFDPAEVL ELVARERITV IAFLVPTMIV KLLGAPETDT SSLRCAVYGG APIHVEHSRA MIERFGPVFV QIYGQGESPM TITYLDHGAS PDTPLDSAGV AHPGVEVQIM GADDRPLPAG EEGEICVRGD VVMRGYWNNP EATSRALRGG WLHTGDIGRL DEHGRLFLLD RSSDVIISGG SNIYPREVEE VLIQHPAVAE VVVFGVPDEL WGENVVAAVV PAAAPPPAND LIDFSLTHIA RFKKPKQIIY VDALPKSSYG KVLRREARRL ALAAGETAGH EHVTGQPATI KSVDRDPGAA E
|
| |