Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7272 |
Symbol | |
ID | 5675573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8877861 |
End bp | 8881166 |
Gene Length | 3306 bp |
Protein Length | 1101 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641246109 |
Product | patatin |
Protein accession | YP_001511497 |
Protein GI | 158318989 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03607] patatin-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.341762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.239541 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGCA CGTTTAGTCT TGCGGCCATG ACGGAGGGCA TGGCGGCAGT CGCCGACGAG TCCGACGACG GGTCGACCCA GGAGGTCCGG CTCGCGGTCG TGATGACCGG CGGCGCGAGC CTGGCGGTGT GGATGGGCGG CGTCGCCCGG GAGATCAACC TCCTGACCGG CGCCTCCGCG CACCGTGACC GCGCCGCGGG TGACGGTCAC GTCGGCGTGG CCGGGACCGG AGACCAGGGC CTGTCTGCCG CCGACGCGGC GGCCGCGTCC CGCTGGGCGG CGCTGCTGGA GATCCTCGAC GTCGAGGTGT CGGTGGACGT CCTGGCCGGG GCCTCGGCGG GTGGGATCAA CGCGGCGCTG CTCGGCTACG CCAACGCCCA CGACGCCGAC CTGGGCCCGC TGCGCGATCT GTGGATCACG CTGGGATCAC TGCGCGAGCT GATGCGACGC CCGGGCGAGC GGAGCTTCCC GTCGCTGCTG CGCGGCGACG CCGTGATGCT GCCCGCCCTG CAGACCGCGC TGCGCAGCCT CGAGCCCTCG TCCCGAAGCG GTCACACCGC CGAGGCGAGG CGCGCCAGAT CCGCGCGCAC GGTTCGCCCG ACCTCGGTGT TCATCACGAC CACGATCCTG CGCGGCGAGT CGGCGCGCTG GTCGGACGAC CTCGGCGGCA TCGTTCGCGA CCTCGACCAC CGCGGCCTGT TCGTCTTCGG CGAGGACGAC CTGACCGACC CCGAGGCGGC CAACCGCCTG GCGCTTGCCG CCCGGGCCAG CGCGGCCTTC CCCGGTGCCT TCGAGCCGGC CTACATTCCC GTCCAAAGCT CGCCCGACGC GGACCATCCC GACATGGCCG ACTACGTCAA CGCGGCCAGC GGCTTCTACG GCTCCGACGG CGGCATCCTG GTGAACCGCC CGATCGGGCC GGCCCTGCAA GAGATCTTCG ACCGGCCGGC GCACGGGCGG CAGGTCCGCC GCGTGCTCGC CTACGTGGTG CCCGCGCCCA ACCCGCCGGA CGACCCCACG CCGAGCCGAG ACGACGGGTC GACGCCGACA GTCTCCGACA CGCTGCTCGG CGCCCTGGGC GCGGCGCTGA ACCAGTCGAT CGCGAGCGAC CTGCGGCGCA TCAGGGACCA CAACGAGGCG GTACGCGGCG TCCGCGCGAG CCGGCGCGGG CTGTTCCGCC TCGCCCCGGC CGGCGGCCCC CGCCTCGCCG ACGAGGCAAT CTACGAGGCC TACCGCCACC GCGAGGCCGA ACGCGCCGCG GACGTCGTCC TGAGCACGCT GGACCGGATC GTCACCGGCG CGAACGCCGG CGCGGACGCG GCGGCGCTGC TCGCGGACCC GGCCGGACGG ACGGACCTGC GGGCCGCGGC GGTCACGGCC GCGTTGGCGG AGTTCCCGGA CCGGCTGCCC GGCCCGGACG ACCTCGGTGA CCTTTGGCGG CTGGGCCGCC CCGCGGTGGA CGCCGCCAAG GGCATCCTGA TTTCGATGAT CAACGAGGCC TACGTCCTGT CGACCGACCC GGACGACCGG GTGGCGCTGG CGACGCTGGC CCGGGAGGTG CACGCGGCGG TCGTCGCGCC CCGCGAGCAG GCCGGAGCAC ATCTGCTCCC GCTCCCGTCG ACGCCGGCGA ACCGGGGCGG CTTCACCGTC GTCCGCCACA CCCTGCGCGA CCTGGCGGGG GCGTCGTCCG CCGCCGTGGT GGCCGAGGCG ACCCGCCGCT GGCTGCGCGG CGCCGGCGAC GAGGAGACCA GCCGGACCGA CCTCACGCTC GCCTGGAGCA CGCTGGGCTC GGTCACCGAG AACCTGCGCG GTCTTCTGAC GGGCGTCCTC GCCCGCTCAG CACCATCGCG GACGGCCACA ACCGACCTCG CGGACAGGCC CGGCCACGGG GACGGGGAGC ACGGGGGACG ACACGGGCGG GACGGGCAGG ACGAGGACCG GCCCTTCGGG CCCGCGGCGC TGATCGCACC AGGGCCGCGC ACCCACGCGA ACGGGCTGAG CCTCACCGCG CGGCGCCGGG TCGCCGCCCG CACCCTGCGG GACTTCGTCG GCTACCTCCC CACACGGCGT TCGCGGGCCG CGCTGGCCCT GCTCGACCTG CACCTGGTCA CCCACGCCCT GACCGGCGGG GACGTCGTCG ACCAGCCGGT GGAGCTGGTC CAGGTGAGCG CGGACATCCG CTGCGGTCTG GACCCGTCAC GGGCGAGCGC CGACCGGAAG CTGACCGGGC TCCAACTCGG CAACTTCGGC GCGTTCGCGA AGTCGTCCTG GCGGGCCAGC GACTGGATGT GGGGCCGGTT GGACGGCGCC GCCTGGCTGG CCCGCATCCT GCTCGACCCC CGGCGGCTGG TCCACTTACG CGATGCCGCC GCGCCGGGCG CGGAACCGCC CGACAGCACC GCGACCTGGC TCACCGAATT CGTGGAGCTG CTGGCGACGG TCGCCGCGGG CCCCGTCACC GCGGACGTCC TCGACGAGCT CGCCTGGCTC GACGACCCGG ACGCGGCCGT GCCGGCCGCC CTGCCCGAGA CCGCGGCCTG GGTCGCCACC GGGATCCAAC GCGAGATCGC CGCCGGCGAG CTGACCAGCG TCGCCGACGC CGTCCGGGCG GACATCGCCG GTGCCAGGGT CGGCTCCGCG CCCACCCGGG AGTTCCTTGA CGCGATGGCG CCCGGCCCGG CCCAGCTCGA TCCCGCCGAC ACCGCCCGGG TGCTGCGGGC GTGCCGCGTC TCGGACGAGC GGCTGACCGA TCGGGCCAAC AGCCCGATCC TCGCCATCAC GGTCGCCCAG GTGCTGGCGG TCCTCACCGG GTGGCTGGCG TCGCTGCGCG CGCTGCCGCG GCCGCTGCGT CCGGCGGTCG CGGCGGTGCG CGCCGTCGCT CGCGTCGCCT ACGCGCTGGT GGACGACGTG ACGCGCGGCC GGCGCCGGGC GACGATCGCC CTCGGCACGG TGCTGCTCGC CGCCGGGCTC GCGGGCGCGC TCGTCCTGTC CGGCCCGATG GGCGGCGTGG GGCTGCTGGT GGCCGTGACG GGGCTGCTGC TGATCAGCCT CACGGGCTGG CGGGTGCTGC CCGCGGGGCT GGCCGTCGTG GGCGTGGCGG GGCTGGCGGC GGTGGCCGCG GCGGGGGTGA TCCCGGTCGT GGGTGACCAT CTCTTCCCCT GGCTACACGA CGACGCGGTG CCCTACCTGG CCGATCATCC CTGGGCGTGG GCGGCCGTCT TCGGCCTGCT GATGCTCCCG CCGGTGTGGT CACTCGCCGA ACTCCTGCGC CCGCGGCGGC GGAGCCGACA CCGGCCGGCA AACTGA
|
Protein sequence | MSGTFSLAAM TEGMAAVADE SDDGSTQEVR LAVVMTGGAS LAVWMGGVAR EINLLTGASA HRDRAAGDGH VGVAGTGDQG LSAADAAAAS RWAALLEILD VEVSVDVLAG ASAGGINAAL LGYANAHDAD LGPLRDLWIT LGSLRELMRR PGERSFPSLL RGDAVMLPAL QTALRSLEPS SRSGHTAEAR RARSARTVRP TSVFITTTIL RGESARWSDD LGGIVRDLDH RGLFVFGEDD LTDPEAANRL ALAARASAAF PGAFEPAYIP VQSSPDADHP DMADYVNAAS GFYGSDGGIL VNRPIGPALQ EIFDRPAHGR QVRRVLAYVV PAPNPPDDPT PSRDDGSTPT VSDTLLGALG AALNQSIASD LRRIRDHNEA VRGVRASRRG LFRLAPAGGP RLADEAIYEA YRHREAERAA DVVLSTLDRI VTGANAGADA AALLADPAGR TDLRAAAVTA ALAEFPDRLP GPDDLGDLWR LGRPAVDAAK GILISMINEA YVLSTDPDDR VALATLAREV HAAVVAPREQ AGAHLLPLPS TPANRGGFTV VRHTLRDLAG ASSAAVVAEA TRRWLRGAGD EETSRTDLTL AWSTLGSVTE NLRGLLTGVL ARSAPSRTAT TDLADRPGHG DGEHGGRHGR DGQDEDRPFG PAALIAPGPR THANGLSLTA RRRVAARTLR DFVGYLPTRR SRAALALLDL HLVTHALTGG DVVDQPVELV QVSADIRCGL DPSRASADRK LTGLQLGNFG AFAKSSWRAS DWMWGRLDGA AWLARILLDP RRLVHLRDAA APGAEPPDST ATWLTEFVEL LATVAAGPVT ADVLDELAWL DDPDAAVPAA LPETAAWVAT GIQREIAAGE LTSVADAVRA DIAGARVGSA PTREFLDAMA PGPAQLDPAD TARVLRACRV SDERLTDRAN SPILAITVAQ VLAVLTGWLA SLRALPRPLR PAVAAVRAVA RVAYALVDDV TRGRRRATIA LGTVLLAAGL AGALVLSGPM GGVGLLVAVT GLLLISLTGW RVLPAGLAVV GVAGLAAVAA AGVIPVVGDH LFPWLHDDAV PYLADHPWAW AAVFGLLMLP PVWSLAELLR PRRRSRHRPA N
|
| |