Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1928 |
Symbol | |
ID | 5670329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2310397 |
End bp | 2311974 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240849 |
Product | anthranilate synthase component I |
Protein accession | YP_001506271 |
Protein GI | 158313763 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.929386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCG GCGAGATCAC ACCGAGCCGG GCGGAGTTCC ACGAGCTCGC CGCGCGCCAG CCGGTCGTCG CGGTGTCCCG CCGCCTGCTC GCCGACGGCG AGACACCGGT CGGGGTCTAC CGCAAGCTGG CCGGCGGGCC GGGGACGTTT CTGCTCGAGT CTGCCGAGCA CGGGGGCGTG TGGTCGCGTT ACTCCTTCGT CGGCGTCCGT GCCGCGGCCA CACTCACTGA ACGGGACGGG CAGGCCGCCT GGACGGACGG AACCCCGCCG CCCGGTGTCC CGCTCGACGG CGACCCGCTC GATGTCCTGC GTGCCGTGGA ACGCCAGCTC TGCTCGGCCC GGCCGAGTGG CACTCCGCCG CTGCTGGGCG GCCTGGTCGG GTACCTCTCG TACGACATCG TCCGGCGCAT CGAGCGGTTG CCCGCCCGGG CAACCGACGA TCTCGGGATG CCCGAGCTGC GGATGCTCCT GACGACCGAC CTGGCCGTCC TCGACCACAC CGACGGATCG TGCCAGCTCG TCGCGAACAT CTTCACCGGC GCCGGGGACC CGGCCGACCC GGCCGAAGCA GCGGACCTGG CTGGCCCAGG AGCGGCGGGC GGACCGTCCG CGCGGCGGGC GGAGCTGGAC GCCGCCTACG ACGATGCGGT GCACCGCATC GAGGTGATGA CGGCAGATCT CGGCAAGTGG AGTGAGCCGA CCGTGGCGAC CACAACCGGG GCGTCCACGG GCGTGCGCGA CTTCGCCTCC GCAACCCCGC CCGGCGGCTT CCACGCTGCC GTCGAGCGGT CGATCGAGGA GATCCGGGCG GGGGAGTGCT TCCAGATCGT GGTCTCCCAG CGGTTCGAGC GCCCCACCAC CGCTGACGCC CTCGACGTCT ACCGGGTCCT GCGGGCGTCG AACCCCAGCC CCTACATGTA CCTGCTGCGG TTCGCCGATC ATGATGTGGT CGGCTCGTCG CCGGAGGCGC ACGTCAAGGT CACCGGCCGC CGGGCGTTGC TGCACCCGAT CGCGGGCAGC CGGCCGAGGG GCGAGACCCC CGAGCGCGAT GCCGAACTGG CTGCCCAGCT CCTGGCCGAT CCGAAGGAAC GGTCCGAGCA CGTGATGCTG GTCGACCTGG TCCGCAATGA TCTCGGGCGG GTCTGCGTGC CCGGATCGGT GCGGGTGGTC GAGTTCGCGT CCGTCGAGCG GTTCTCGCAC ATCATGCACA TCGTCTCCAC CGTGATCGGT GAGGTGGCGC CCGAGCGCAG CGCGGTCGAC GTCCTCGCCG CGACCTTTCC CGCCGGGACG TTGTCGGGAG CGCCCAAGGT GCGGGCCATG GAGATCATCG ACGAGCTCGA GCCGACGAGG CGCGGCCTGT ACGGCGGGGT CGTGGGATAT CTCGATTTCG GCGGTGACCT CGACACCGCG ATCGCCATCC GCACAGCGGT CCTCCGTTCA GGAATGGCCT ACGTGCAGGC CGGCGCCGGG ATCGTGGCGG ACTCCGTTCC CGACACCGAG GATCTCGAGA GCCGGACGAA GGCCGCGGCG GTTCTCCGCG CGATCGAGGT GGCGGAGTCG CTCCGCCCGC CGGTATGA
|
Protein sequence | MTTGEITPSR AEFHELAARQ PVVAVSRRLL ADGETPVGVY RKLAGGPGTF LLESAEHGGV WSRYSFVGVR AAATLTERDG QAAWTDGTPP PGVPLDGDPL DVLRAVERQL CSARPSGTPP LLGGLVGYLS YDIVRRIERL PARATDDLGM PELRMLLTTD LAVLDHTDGS CQLVANIFTG AGDPADPAEA ADLAGPGAAG GPSARRAELD AAYDDAVHRI EVMTADLGKW SEPTVATTTG ASTGVRDFAS ATPPGGFHAA VERSIEEIRA GECFQIVVSQ RFERPTTADA LDVYRVLRAS NPSPYMYLLR FADHDVVGSS PEAHVKVTGR RALLHPIAGS RPRGETPERD AELAAQLLAD PKERSEHVML VDLVRNDLGR VCVPGSVRVV EFASVERFSH IMHIVSTVIG EVAPERSAVD VLAATFPAGT LSGAPKVRAM EIIDELEPTR RGLYGGVVGY LDFGGDLDTA IAIRTAVLRS GMAYVQAGAG IVADSVPDTE DLESRTKAAA VLRAIEVAES LRPPV
|
| |