Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5020 |
Symbol | |
ID | 5673358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6015026 |
End bp | 6016072 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243874 |
Product | hypothetical protein |
Protein accession | YP_001509289 |
Protein GI | 158316781 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.848752 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCGTGA GAGTGTTCGG GTGGGCGGCA GACGCCAGCG CCTGCGGGTA CTACCGGCTG GGGCTGCCGC TGGAGGCTCT GAAGGCCCGC GGCCATCGCA CCCTCGTCTC CACCGTCCTG CCGGCCGGGT GGCTGGACGC GGACATCATC GTCGGCCAGC GGGTGTGTAT GCCTGAGCCC TCGCAGACCT GGCAGCGCCT GGCCCGGGAC GGTCATCTGC TCGTCTACGA GATCGACGAC AACCTGTTCG GCCTGCACCC GACGAACCCA GGCCGCCGTC TGTTCGGCGA CTCGGCGGTG CAGCAGCGCG TCCGCGATAA CGCCGCTGTC GCCTCGCTGG TCACGGTGAC CACCGAGGCT CTCGCGCAGG TGATGCGCGA GATCAACCCC AACGTCGTGG TGTTGCCAAA CCGCATCCCG GGCTGGCTGC TCATCCGCGA CCGGCCCCGC CGGCAGCGGC TCACGGTCGG CTGGGCCGGT TCGGCCACCC ACTACGCCGA CATCGCGGAG ATCGCCTCCC CGCTGCGCCC GCACGTGTTC AACCGCTCGA AGTCGCCGCT GCGCGTGCTG GAGGTGGCGG CCCTCGGCAT CCCCGCGGTG GCCAGCGAGT ACGGCCCGTA CGAGGACTTT GTGCGCCCCG GCGAGACCGG GTTCCTGGTC CGCTGTGACC ACGAGTGGGT CACCTACCTG CGGGAGCTGG CCGGCGATCG GGAGCTGCGC GAGGTGCTGG TCGCCGACGT TCGCACCCTG GCCCTGCCGG CAGCCGACAT GGTGATCCTC GGCGACGTCC TAGAGCACAT GGCCAGCAGT GAGGCTGTCG ACCTGTGGGG CCGAGCCCGC GCTGCCGCCC GCCGCGGCGT GCTGGCCTCC CTGCCCGTGG TGCCCTACCC GCAGGGTCTG GCTGAGGGCA ACCCGTTCGA GGCGCACGTG GAGGAGTGGA GCGACGCCCG CGCGCAGGCC GTGCTGCCCG GGGTGGTCGC TCACGACGTC GACGGGGAGA TCGGCGTGTA CCTCGCCGGC CCGGCCGAGG AGCCGGCATG CGCCTAG
|
Protein sequence | MSVRVFGWAA DASACGYYRL GLPLEALKAR GHRTLVSTVL PAGWLDADII VGQRVCMPEP SQTWQRLARD GHLLVYEIDD NLFGLHPTNP GRRLFGDSAV QQRVRDNAAV ASLVTVTTEA LAQVMREINP NVVVLPNRIP GWLLIRDRPR RQRLTVGWAG SATHYADIAE IASPLRPHVF NRSKSPLRVL EVAALGIPAV ASEYGPYEDF VRPGETGFLV RCDHEWVTYL RELAGDRELR EVLVADVRTL ALPAADMVIL GDVLEHMASS EAVDLWGRAR AAARRGVLAS LPVVPYPQGL AEGNPFEAHV EEWSDARAQA VLPGVVAHDV DGEIGVYLAG PAEEPACA
|
| |