Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7000 |
Symbol | |
ID | 5675311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8530266 |
End bp | 8531444 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641245846 |
Product | hypothetical protein |
Protein accession | YP_001511237 |
Protein GI | 158318729 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.550341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAA AGCTGGTCTG TCCGTACTGC TACCAGCAGT TCGGGGAACG GGAGATCTGG TTCCGGTGCT CCGGGCGGCC AGGCCCCACC GGAAAGTCCT GCTCCAGTCA GCGGGATGAA CGCCTGGCGA AACGGATGGG ATTCACCGGC CAGCTTCCGC CGGCGTTCTC GGCGGACGGC CGGAAGCTGT CCGCCGTCCA TTCGGACTGT CGAGCCGAGA CGAACTACCG GTTGTGCCCG GAGTGCCACA GTCAACTTCC GGTGCACTTC GGAAAGATCG AGAATCGGCT CATCGCGCTG GTCGGTGCGA AGGAGAGCGG CAAGACCGTT TTCATGACCG TCCTGCTGCA CGAGCTGATG CACTCGGTCG GAGTCCGGTT CGACGCGTCG GTGCTGGGTG CGGATGACGA GACCCGGGAC AGCTTCCGCA AGCGGTACGA GGCCCCGCTC TACGACAACC ACCAGCTGGC CGCCCCGACA CAGCGCTCAA CGACACCGAT GTCACGCCGG CCGCTGGTGT TCACCTTCAC AGCGCGCGGG CGCGGCCTGG GCCGCCCGCG TCAGGAGCGG ACCGTGCTGT CCTTCTTCGA CACCGCCGGC GAGGATCTGA ACTCGGCGGA CAGCGTCGAA CAGAACGTGC GCTACCTCGC CAGCGCCGCC GGCATCATCC TGCTGCTCGA CCCACTGACG ATGCGCGGTG CCCGGGGCCA GGCGGACCCG GACGCCCCAC GCCCGCACGA ACAGGGCCTG GACAGCCCGG TGAGCGTCCT GGGCCGCATC ACCGAGCTGC TGCAGCGGGC GTTGGGGACG AAGCCCTCCC AGCTGATCGG CACCCCGATC GCCGTGGCGT TCTCGAAGAT GGACGCGCTC ACGCGCGGTC TGCCCGAGGA GAGCCCGCTG CGGCGGTCGC AGCCGGTCGG CTCGCGCTTC GACGCGGCGG ACAGCAGGGA CGTCCACGAC CATGTGCGCG CGCTGCTCGA CGAGTGGGAG GGGTCGTCCA TCGACCAGAC CCTGCGCCAC AACTACTCGC GCTACCGGTA TTTCGGGCTG TCGGCACTGG GGGCCGCCCC CACCGCCGAC CGGCGGGTGG CGACCGGGGT GGTCCAGCCC TACCGGGTGG CCGACCCGTT CCTCTGGCTG CTGAGCGAAT TCGGCGCCAT TCCCAGAACA AAAGGCTGA
|
Protein sequence | MSRKLVCPYC YQQFGEREIW FRCSGRPGPT GKSCSSQRDE RLAKRMGFTG QLPPAFSADG RKLSAVHSDC RAETNYRLCP ECHSQLPVHF GKIENRLIAL VGAKESGKTV FMTVLLHELM HSVGVRFDAS VLGADDETRD SFRKRYEAPL YDNHQLAAPT QRSTTPMSRR PLVFTFTARG RGLGRPRQER TVLSFFDTAG EDLNSADSVE QNVRYLASAA GIILLLDPLT MRGARGQADP DAPRPHEQGL DSPVSVLGRI TELLQRALGT KPSQLIGTPI AVAFSKMDAL TRGLPEESPL RRSQPVGSRF DAADSRDVHD HVRALLDEWE GSSIDQTLRH NYSRYRYFGL SALGAAPTAD RRVATGVVQP YRVADPFLWL LSEFGAIPRT KG
|
| |