Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6751 |
Symbol | |
ID | 5675064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8210850 |
End bp | 8212472 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245600 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001510991 |
Protein GI | 158318483 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.136114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGTGG CGCGTGGCGA GGCTGGCTGG GATTTCTTCA TCTCTTACAC CGCCGTGGAC ACAGCCTGGG CGGAGTGGAT CGCCTGGCAG TTAGAAGACG CTGGCTACCG GGTGCTGATC CAGGCGTGGG ACTCCGTGCC CGGGTCGAAC TGGGCGGTCC GCATACAACA GGGGACGACC GAGTCTGACC GCACCATCGC TGTGCTCTCG GCCTCCTACC TGCGGTCGGT CTACGGGCAA AACGAGTGGC ATGCCGCCCA CGCCGCCGAC CCCGGCGGCT TCGCCCGCAC GTTGCTCCCC ATCCGGGTGG AGGACTGCCC CCGCCCGGGA CTGCTCGGAC AGATCGTGTC GATCGATCTG TTCGGCCATC CCGCCGACGT CGCCCGCCAG CACCTCCTCG ACGCGATCAG CACAGCACGG GCGGGGCGCG CGAAACCCAC CGCCGCACCC GCCTTCCCCC CACGCCCGGC CCTACCCCCA CAGCAGCCCT CAGCAAGAAC GGCACCCCCC TTCCCCGGCC CGGACCCGAC GGCCTCCCTC GACCAGCCCG CCCTCCGCAC GCACGCCCGA TCCGATCGCC TCCTCCATGG GCCGCAGCGC CGCGTTTCTC TCGCGGTGGT GCTGCTCGTC ATCACCGGCA CCGTCTTCCT TGCCAGTTCC GCGCGGGACA GGAATCCGAG CGCGACCAGC GCCCACTCCG CTGCGCCACC AACGTCGGCT CCCACACCCA GCCTGTCGGG CTCACCCTTA CGCGACCACA CCGACTCGGT GCGGTCGGTG GCGTTCTCCC GGGACGGACG CACGCTAGCC AGCGCCAGCC AGGACGGCAC GGCGCGGCTG TGGGACATCG CCGAGCGGAC CTCCCAACCG TTGACCGGCC GCATCGCAGT GTGGTCGGTG GCGTTCTCCC CAGACAAGCA CACGCTGGCC AGCGCCAACG GCGACAGCAC GGTGCAGTTG TGGGACGTGG CCGAGGGGAC CCTCCCCCAC CCGGTGGCTT CCCTGCCCGG CCACAGCGAC GCGGTGGGAT CGGTGGCGTT CTCCCCGGAC GGACGCACGC TGGCCAGCGC CAGCGACGAC CACACAGTGC GACTGTGGGA CGTGGCCACG GGGACCACCA CCCACACGTT GACCGACCAC ACCGGCCCCG TGAACTCGGT GGCGTTCTCC CGGGACGGGC GCACGCTGGC CAGCGCCAGC GACGACCACA CGGTGCGACT GTGGGATGTG GCCGAGGGGA CCCTCCTCCG CACCTTGCCC GGCCACACCG AGCCAGTGAT GTCGGTGGCG TTCTCCCCGG ACAGACGCAC GCTGGCCAGC GCCAGCCAGG ACAACACCGT GCGGTTGTGG GATGTGGCCG CGCGGACCGC CCCCCGCCTG GTGGGCTCTC TGTCCGACCA CACCCACTGG GTGATGTCGG TGGCGTTCTC TCCCGACGGG CGCATCCTGG CCAGCGCCAG CCAGGACCGC ACAGTGCGGC TGTGGGACGT GGCCGCGCGG ACCACCACCC ACACGTTGAC CGGCCACACC GGCCCCGTGT TCTCGGTGGC GTTCTCCCTG GACGGGCGCA CTCTGGCCAG CGCCAGCGAC GACAACACGG TGCGACTGTG GGACATGAGC TGA
|
Protein sequence | MGVARGEAGW DFFISYTAVD TAWAEWIAWQ LEDAGYRVLI QAWDSVPGSN WAVRIQQGTT ESDRTIAVLS ASYLRSVYGQ NEWHAAHAAD PGGFARTLLP IRVEDCPRPG LLGQIVSIDL FGHPADVARQ HLLDAISTAR AGRAKPTAAP AFPPRPALPP QQPSARTAPP FPGPDPTASL DQPALRTHAR SDRLLHGPQR RVSLAVVLLV ITGTVFLASS ARDRNPSATS AHSAAPPTSA PTPSLSGSPL RDHTDSVRSV AFSRDGRTLA SASQDGTARL WDIAERTSQP LTGRIAVWSV AFSPDKHTLA SANGDSTVQL WDVAEGTLPH PVASLPGHSD AVGSVAFSPD GRTLASASDD HTVRLWDVAT GTTTHTLTDH TGPVNSVAFS RDGRTLASAS DDHTVRLWDV AEGTLLRTLP GHTEPVMSVA FSPDRRTLAS ASQDNTVRLW DVAARTAPRL VGSLSDHTHW VMSVAFSPDG RILASASQDR TVRLWDVAAR TTTHTLTGHT GPVFSVAFSL DGRTLASASD DNTVRLWDMS
|
| |