Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5078 |
Symbol | |
ID | 5673413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6077590 |
End bp | 6080019 |
Gene Length | 2430 bp |
Protein Length | 809 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243929 |
Product | hypothetical protein |
Protein accession | YP_001509343 |
Protein GI | 158316835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.126059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0204601 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCTTCGG CGGCACACAG AGCTGACAAC GGCGCGAGCA AGCGTGCTCG TGCCGCGGGT AGTCGCTGCC ATCACCAGGA ATCCGACAAC AACGTCGACG CAACGCGATA CAACTCCGCA GAGCATTCTC TACGAAAGGA CACCCGATTA GCCATGGCGG GGGACGTACC GGCCGACGAG CGGATCCTCA TCACGCTGGA TGACCCCGAC CCCGAGGCCG ACGCGCGGCG CGGCCTGGTC GGTCTGTCGC GAAACGCCGC CGGTGGCGCT GAGGGTGAGC TCCCGGATGG TTCCGCTGGC GACGTCGGTG GCTCCGGCGG AACCGGTGGC GCCGGCCCGT CCGAGATCTC CGGCCCGAGC GGCCCGGGTG ACCCCGGTGC CCAGGACGTG ACCGAGGACC CCCGCCACCG TGAGCGCGCC ATCCACCGCT ACGGCCGCGT CCGGGTGGTG GGCCGGCCGC TGGCGACCGC GGCCGTGGCC ACCGCTCGCG GGGTCTCCGC CGGCACCACC CCGGTGGACG GGACCGTCGT GTCGACGCTG AGCCTGCCCG GCGGGCTGGA CCGCACCGAG GCGCTCGGGG TCGAGGCGTT CCGCCTGCGC GCGGGGGACG ACTACCGCCG GCGCAAGCGC TCGCGGCCGC GCGACGCCGC GCCCTGGGAC ATGAACCGGC CGTGCACCGA CATCGCCCCG CCGCGCGGCT CTGCGGGCCC AGGCGCCACC CCGCTCGCAC CCACCGTGCG CTCGCTGCTC AACGGGCTCG GATCGGACGC CTCGGGACCG GAGGCGACCG GGTCGCCCTC GGGCGCGGTC GGGACGGGCA CGGCCGGAGC GCCGTCCGCC GGAGCGCTCA GCAGTTACCT GGAGGGCTCC GTCGCGGTCG GGCTCATCCT CGTCGAGGGG CCGACGCCGG CCCTGCAGCT CTCCGCCGCG GAACGCACGA AGATCGTGGC CGAGGTGCAG AACGGCCTGT CCTGGTACGC GACGCAGAAC CCGGCGGCCG AGCTGACGTT CAGCTACGAC ATCCAGATCG TCCGGCTGCC CACCCCCGCC AACCCGTCCG CGAACGACCT CGAGGCCCTC TGGCGCGACC CCACGATGAG CCGGCTCGGC TACGCGGCGA ACTTCGACGG TGTCTACGAC TATGTCGAGG CGCTGCGCGC CCGGCTGCGT ACCCGGTGGG CCTACTGCGC GTTCGTCACG AAGTACCCGC TCGGCCACTT CGCCTACGCG TCGGTCGGCG GCCCGCGGAT GGTGCTCGCC GCCGACGCCG ACGGCTGGGG CCCGGACAAC ATCGACCGCG TGTTCGCGCA CGAGACCGGC CACATCTTCG GGGCGCCGGA CGAGTACGGC GGCGCCGGCT GCGACTGCGG CGGGAGCTGG GGTCGGTACG GGGTACCCAA CGGCAACTGC GACTCCTGCG CGCCGGCGCC GGTCGACTGC CTGATGCGCG CCAACACGTT CGCCCTGTGC CGGTACACGC CCGCGCACAT CGGCTGGGGC CACGGCGTGA GCGGCAACCC GGTGCTCCTC CAGGCGAAGG GCCTGGGCGT GCGGGGCAAC TTCGACGTCG TCGCGCCGTC GGCCTACGCC GGCCTCACCC ACGTCTGGCG GGACAACGAC GCCGCCGGCG CACCGTGGCG GGACCCATGG CAGACCGCGC AGGCCCTCGG CCGCGTCGAC GCGGTGACGA TGGTGCAGAG CACCCTGGCG AACCCGGGCC CGCTGGAGGT GGCCGTCCGC GTGGGCTCGC GGCTGTTCTT CCTGTGGCGG GACAGCACCG GGGCGTTCCA GTGGCGCGCG CCCGTCCAGC TCGCCCAGGG CGTCGGCGGC GTCCCGTCGC TGGTGCAGAG CAGGCTGGGC GGCAAGGGCA ACTTCGAGCT GCTGGCCCCG GCCGCTGATG TCGGCATCAT GCACATGTGG CGCAACCACG ACGTCTACGG CTACCCGTGG AGCGCGCCGA AGCTGTTCGC GGCCAACCTG GGACGGGTCG ACGCGGTCAG CCTGATCCAC GGGACGCTGG GCGGCGGGGC CGGGATGCTG GAGGCCGTCG CCCTGGTCGG CACCCGACTG GTGCACCTGA CGCGCGACCA GGCAGCCGTC TGGCGCACCG GCGGGATCTT CGCCGAGGGG GCGTCCGGCA ACCCGGCGCT GATTCAGAGC GTGTTCCCGG GGGCGCGCAA CTTCGAGGTC GTGGTGCCGT CCGCCGGAAC CGGGCTGATC CACTTCTTCC GCGACAACAA CCGCGCCGAC CCGGTGTGGA GCGGCCCCCG CCCGTTCGCA GCGGAACTCG GGCATGTGGA CGCCGTCTCG ATGATCCAGA GCAACTACGA CGGGAACCTC GAGGTGCTGG CCCGGGTCGC GAACCGGCTG TACCTGCTGT ACCGCTCAGG CGCCGCGGCC GTGTGGTCGG CGCCGCGCCG CGTCTTCTGA
|
Protein sequence | MPSAAHRADN GASKRARAAG SRCHHQESDN NVDATRYNSA EHSLRKDTRL AMAGDVPADE RILITLDDPD PEADARRGLV GLSRNAAGGA EGELPDGSAG DVGGSGGTGG AGPSEISGPS GPGDPGAQDV TEDPRHRERA IHRYGRVRVV GRPLATAAVA TARGVSAGTT PVDGTVVSTL SLPGGLDRTE ALGVEAFRLR AGDDYRRRKR SRPRDAAPWD MNRPCTDIAP PRGSAGPGAT PLAPTVRSLL NGLGSDASGP EATGSPSGAV GTGTAGAPSA GALSSYLEGS VAVGLILVEG PTPALQLSAA ERTKIVAEVQ NGLSWYATQN PAAELTFSYD IQIVRLPTPA NPSANDLEAL WRDPTMSRLG YAANFDGVYD YVEALRARLR TRWAYCAFVT KYPLGHFAYA SVGGPRMVLA ADADGWGPDN IDRVFAHETG HIFGAPDEYG GAGCDCGGSW GRYGVPNGNC DSCAPAPVDC LMRANTFALC RYTPAHIGWG HGVSGNPVLL QAKGLGVRGN FDVVAPSAYA GLTHVWRDND AAGAPWRDPW QTAQALGRVD AVTMVQSTLA NPGPLEVAVR VGSRLFFLWR DSTGAFQWRA PVQLAQGVGG VPSLVQSRLG GKGNFELLAP AADVGIMHMW RNHDVYGYPW SAPKLFAANL GRVDAVSLIH GTLGGGAGML EAVALVGTRL VHLTRDQAAV WRTGGIFAEG ASGNPALIQS VFPGARNFEV VVPSAGTGLI HFFRDNNRAD PVWSGPRPFA AELGHVDAVS MIQSNYDGNL EVLARVANRL YLLYRSGAAA VWSAPRRVF
|
| |