Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6379 |
Symbol | |
ID | 5674695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7741057 |
End bp | 7742208 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641245228 |
Product | zeta toxin family protein |
Protein accession | YP_001510623 |
Protein GI | 158318115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGC CGAACCCGTA CTACCTTCCC GAAGAGCGTG CCCGGGAGAT CTTCACCAAC GAGATTGTCA GGGAGAAGTT CGCGGGTGTG GTCTCCCACC GCGACGACGG GCGACGACCG GTCGCGGTGA TCGTCATGGG CCAGCCTGGC GCCGGGAAGA CCCGAATCGC CGACGCCGTC AAGCAACAGC TCGACGAACG GGGCGGGGCC GCCCACGTCG TGGGCGACTT CTACAAGCCC TACCACCCCG ACTACGACGA GCTGGTCATC ACCGACCCGG AGCGCGCCTC ACCGCTGACC TCCCCGGACG CCCGCCGGTG GATCGACATG GCCACCGAGT ACGTGATCGA CCAGCGTGCG GACGTACTGC TGGAAAGCGC CGGTCGCGAC CGGGCCGACT TCGCCGACAT CGCCGAACGC CTCCACAACA ACGGCTACCG GGTGGAGGCG GCCGTGGTGG CCGTCGATGA GGCGCACAGC AGGCTCGGTA TCGTCGACCG CTACCACGAA CAGGTTGCGG ACACCGGATA CGGCCGGCTC ACCGCGCGGG AAACCCACGA CCTCTCCTAT GCCGGCGTGC TGGATTCGGC CGACTTCATC GACCGATCAG ACGCGGTCGA CGCGGTCGCC GTGGTCCGCC GCAACAACGA GATCCTGTAC GCCAACGAAC GCGATGCCAG CGGCGGGTGG CAGCAGGCGC CCGCCACCCG GGAGGCGATT GAGGCCGAGC GCGGCCGTAG CTGGAGCCCC GAGGAGACCG AACAGTTCGC TAACAAGATC GACAATCTGG CGAATGCGAT GGGCCCCCAG TGGCACACCG AGTTCGACGA CATCACTGAC CGGGCGCGGG CACGGGCCGA TCCGGCGGTG GCCCTGCCCA AGCGGAGTCC CACCGTCGAA ACGAGCACCA AAGCCGGCTC CCGTAGTCCG ATCCCTGCCT CTGGTACGGA CACCGCCGCG ACTCCCGATG CCGCCTCCCC CGCAGACCCC CAGATCACAC CGGCGGCGCC CTCACATACC ACCCAGTCCC CGGCACAAGC GTCGTTCTCT GGGTCTGTCA AGCCATCCCC GCCTCGCAGG GGCAGTGGCC CACCACCTGG AACCCCGCCA CCCGGTCCGC AGATCGGCCA GGGACCTGCC AAAGGCCGAT AG
|
Protein sequence | MTQPNPYYLP EERAREIFTN EIVREKFAGV VSHRDDGRRP VAVIVMGQPG AGKTRIADAV KQQLDERGGA AHVVGDFYKP YHPDYDELVI TDPERASPLT SPDARRWIDM ATEYVIDQRA DVLLESAGRD RADFADIAER LHNNGYRVEA AVVAVDEAHS RLGIVDRYHE QVADTGYGRL TARETHDLSY AGVLDSADFI DRSDAVDAVA VVRRNNEILY ANERDASGGW QQAPATREAI EAERGRSWSP EETEQFANKI DNLANAMGPQ WHTEFDDITD RARARADPAV ALPKRSPTVE TSTKAGSRSP IPASGTDTAA TPDAASPADP QITPAAPSHT TQSPAQASFS GSVKPSPPRR GSGPPPGTPP PGPQIGQGPA KGR
|
| |