Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7237 |
Symbol | |
ID | 5675538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8836043 |
End bp | 8837932 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641246074 |
Product | hypothetical protein |
Protein accession | YP_001511462 |
Protein GI | 158318954 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0824468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.337859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGACG GTGCGTGGAG AGGTGACCCT CGGGATCCGG CTGGGCCCGG CTACCGGGAA CGGCCGCCCG GCGCACCGGA CGACCCGGGG CCGGACGGCT ACTGGGACGG CCCGGACCAG CATCAGCAGT ACCGGAACGA TCCCCGCGGC AACGGTTCCC ACCAGGGCGT GCCGCACTAC CAGGGCGGCC CGCCCGCGGG ATATCCCGAC GGCCCCCAGC CGGGCGGCGC CTACCAGCAG CCGGGATACC CGCCGGCCGC CGACCCCGGG CGCGCCTACG GCTCCGGCCC CGGTGGGGGT CCCTACCCGC CGGCAGGACC GGCGGGAGGC CAGTACTCGC CGCCGCCGCC CGGTCCCGGT GCCCGCCCGC CCGCGGGTGG TGGCGCCCCC TATCCTCCGG GCGGCCCGGG CCCCGCGACC GGCCCGTACC CGCCGCCCGG CCCCGCCACG GGCGGCTACC CGTCGGCGGA CGGCCCGAGC GGCCCGGCCA GCCCGGGCGC TCCGGGCGGT GGGGGTGCCC GCCGGGACTC GACCTCCGTC CGCGGTGCCC TGGTGCCGCG CGCCGGCGCC GGCCGTGCCC CGACGCGCCC GCCGGGGTCG ACCGGTCCGG CGCCGCGGCA GGCCCCCGAG CGGGGGAGCG CGGTAGATGC CGGCGCCGCC CGTCCGGACC TGTATCAGCC GGAGCCGTCC CGAGCGGCCG CGCGCCCGGG GACGGACGCG GGCGCGCGCC GGGGCGGTCC CGACCCGGCC TACCGGGACG CGGCGTACCC GCAGATGGCA CACCGCGACG TCGCGGTGCC CGACGCCGAC CCGCGCGACG CCGGGCGGCG GGACGTGCCC TACCGCGACC GGGAACGCGG ATACCGCGAC GGACCCGCGC GGGACACGCC GTATCGCGAC CCGGACGGCC CGGCTGACGA CGACGGCCGC GGCCCGATGC CCGGCACCGG CCCGCGCGCG CGGGCCCGGG CCGCCCGGCG CGCCGGCGGC CAGGACGGCA CCGGCCCGCA GGACCGGGCC AGATCCACGG GCGCCGGCAC CGAGGTGCTG GGAGCCGTCG GCGCGGCTTC CGCCGGGACC GGGCCGCGCC GGGCCGCGCC CCCGCGGCAG GGCGGCTTCG ACGAGGTCGA CTTCCCCGGC GAACCTGACT TCCCGGACGA TGGCGACGGC CTCGACGGTC CGGACGGCGG CGAGAGCGCC GGGCTCGGGC CCTTCCTGCG CCGCCTCGTG ATCGCCCTGG TCGTGCTCGG CGTGGCCCTC GCGGTGGGCG TCGGTGCCGG CGTCATCTGG GAGAAGGTGC GCCCGAGCGG CGATACGGCG ACGACGGCGA ACACGCCCCC GACGGCGACG CCCGGCACGG GCCCGTCGGC TTCCCCCGCG CCGTCCACCG GTGCCCCCGC GGGCGGCGGC CAGCCGCAGG CCGCGGTGCC CGCGGACTGG GTGGCCTTCA CGGACCCCGA CCAGAAGGCG ACGTTCTCCC ATCCGCCGAC CTGGAAGCAA CGACGGGACA ACACCGGTGT GTTCTTCGGG GAGCCGGGGG CGGGCGCGGT GGGCACACCC GCCGAGTACG GCCCGCAGAT GATCGGCGTC GCCCGGGTCG CGGGCGCGGA CGCCGCGACG GCGCTCAGCC AGGTCCAGAG CAGTGAGTTC GGCAGCGTCT CGGGCCTGAC TCAGGACCGC TCGGGCCCGG CGACGGACAC GTCCGGCGCG ACTGTGCAGG AACTGGCGGG CTCCTACGAC CGTGACGGCC AGCGCGTCTC GTACCTCATG CGCACGAGCG AGGCGCCCGG CGCGGTCTAC GTGCTCATCG CCAGGGTTCG GGCGGACGCC TCGGCGTCAC TGAACACGAT GATGGGCGCG CTGCGCGCCT CGTTCCAGCC GGCCGCCTGA
|
Protein sequence | MTDGAWRGDP RDPAGPGYRE RPPGAPDDPG PDGYWDGPDQ HQQYRNDPRG NGSHQGVPHY QGGPPAGYPD GPQPGGAYQQ PGYPPAADPG RAYGSGPGGG PYPPAGPAGG QYSPPPPGPG ARPPAGGGAP YPPGGPGPAT GPYPPPGPAT GGYPSADGPS GPASPGAPGG GGARRDSTSV RGALVPRAGA GRAPTRPPGS TGPAPRQAPE RGSAVDAGAA RPDLYQPEPS RAAARPGTDA GARRGGPDPA YRDAAYPQMA HRDVAVPDAD PRDAGRRDVP YRDRERGYRD GPARDTPYRD PDGPADDDGR GPMPGTGPRA RARAARRAGG QDGTGPQDRA RSTGAGTEVL GAVGAASAGT GPRRAAPPRQ GGFDEVDFPG EPDFPDDGDG LDGPDGGESA GLGPFLRRLV IALVVLGVAL AVGVGAGVIW EKVRPSGDTA TTANTPPTAT PGTGPSASPA PSTGAPAGGG QPQAAVPADW VAFTDPDQKA TFSHPPTWKQ RRDNTGVFFG EPGAGAVGTP AEYGPQMIGV ARVAGADAAT ALSQVQSSEF GSVSGLTQDR SGPATDTSGA TVQELAGSYD RDGQRVSYLM RTSEAPGAVY VLIARVRADA SASLNTMMGA LRASFQPAA
|
| |