Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5109 |
Symbol | |
ID | 5673444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6117602 |
End bp | 6119296 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243960 |
Product | hypothetical protein |
Protein accession | YP_001509374 |
Protein GI | 158316866 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.344799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.106863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGACT TCCTCGCCGC GGTACGCGGG CTGACGGTCC GCGGTCGCTC GTTCCTCGCC GCGGGCGGGG CGTGCGTGGC CTCGGCAGCG GTCATCGGCG AGCAGGACCT CCTCGCGGTC GGTGCCCTGC TTGTCGCGCT GCCGCTGTTC GCGGCCGGGT TCGTCGCGCG GACCCGCTAC CGGCTGGCCT GCACCCGCCG GTTGGAGCCG CCGCGGGTCA CCGCGGGCGA CACCGTCTCG GTACGGATCC GGCTGGACAA CGTCTCCCGA CTGCCGTCCT CGGTGCTGCT CGTCGAGGAC GCGACCCCCA ACCTCGGCCA CCGCGCGCGC TTCGTGGTGG ACCAGATCGA ACCGGGCGGT TCCCGTGACC TCTCCTACCC GCTGGGCGCC GGGGTCCGCG GCCGGTACCA GGTCGGCCCG CTCACGATCC GGCTGACCGA CCCGTTCGGC CTGTGCGAGC TGGAGCGCAG CTTCCGGGGG CGGGACGAGC TGATCGTCGC GCCCGCCCTG GAGCGCCTGC CGCTGACGCC ACTGGTCGGT TCGTCCTCGC TCAACAACGA GGTACGCCGC TCGTCGGCAC GGGCGGGTGA GGACGATTCC ACCACACGGC CATACCGCTC CGGCGACGAT CTGCGCAAGG TGCACTGGAA GACCACGGCC CGACTGGGTG AGCTGATGGT GCGCCGCGAC GAGCGGCCGC TGACCGGCGC CGCCGCCGTG CTGCTCGACA CCCGGCACGC GGCCTGGCCC GAGATGGACC GGGACGCCCC CTTCTCCTGG GCGGTCGGCG CGGCCGGCTC GATCGCGGTC AACCTCGCCC GCAGCGGCTA CGGCGTCCGG CTGATCGCCG ACACCGGTGT CGCGGCGACC GGTCCCGGCA ACGCCGTCGG CGCGCTGCTC GACGAACTGG CGGTCATCGC GCCGACCCCG TCGGCCACCC TCAGTCCGGC GCTGGCCAGT CTGCGCTCGG CCGAGCACTC CGGCATGGTC GTCGTCGTGC TCGGGCGCAC CGACCAGGCG ACGGCCTCGA TGATCGCGGG TGCCCGGCCG CGCAACGCCC CGGCCATCGC GGTGCTGGTG GATCTCGCCG GCTGGGGCAC CTCCCCGGCG GCCGGGGGCG ACCTGGAGGT CACCCGGCAC ACCCTGACCA GGCACGGCTG GACCGTCCTG GTCGCGGGTG CCGGGGCCCG CCTCGCCGAC ACCTGGCCGC AGATCTTCCG TCCGGGCGCG TCGGCCGGGC GCCGGTTCGC CGTCGGGAAC GCCGTCGGCG GCATGGCGCG GGTGTCGTAC GGGTCCGCGT CGGGCCCCGG TCCGGCCGCC CGCGCGGGGT CCGCCCCCCG CGCGGACCAC GACGCCTCCG CCCGCCGGGC CGGCGCGGCA CCGAACGGCG GCTCGCTCCA CGGCGACTCC CCCCACGACA GCCCGCCCAA CGGCCGCTCA TCCAACGGCG GTTCGACCAA CGGCGGTTCG GGCCACGGCG GCTCCCCCAA CGGCGGCTCG GGCGGCAGCG GCTCGCATGG TCGGGACGCC CGCCGCGGCG CGGCGCCCGC CGGGGGTGGG GCGGACCAGG CCCCCGCCGA CTCGACGCCA CCCCCGCCCG CGACCGGCCC GCTGTACCGG CCCGGCTCCC CCCACGAGCC CGCGGCCCCC GCCACCACGG GAGGCGGTCC CCCACCGAAT GGTCGCGGAT GGTGA
|
Protein sequence | MRDFLAAVRG LTVRGRSFLA AGGACVASAA VIGEQDLLAV GALLVALPLF AAGFVARTRY RLACTRRLEP PRVTAGDTVS VRIRLDNVSR LPSSVLLVED ATPNLGHRAR FVVDQIEPGG SRDLSYPLGA GVRGRYQVGP LTIRLTDPFG LCELERSFRG RDELIVAPAL ERLPLTPLVG SSSLNNEVRR SSARAGEDDS TTRPYRSGDD LRKVHWKTTA RLGELMVRRD ERPLTGAAAV LLDTRHAAWP EMDRDAPFSW AVGAAGSIAV NLARSGYGVR LIADTGVAAT GPGNAVGALL DELAVIAPTP SATLSPALAS LRSAEHSGMV VVVLGRTDQA TASMIAGARP RNAPAIAVLV DLAGWGTSPA AGGDLEVTRH TLTRHGWTVL VAGAGARLAD TWPQIFRPGA SAGRRFAVGN AVGGMARVSY GSASGPGPAA RAGSAPRADH DASARRAGAA PNGGSLHGDS PHDSPPNGRS SNGGSTNGGS GHGGSPNGGS GGSGSHGRDA RRGAAPAGGG ADQAPADSTP PPPATGPLYR PGSPHEPAAP ATTGGGPPPN GRGW
|
| |