Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5555 |
Symbol | |
ID | 5673885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6727674 |
End bp | 6730538 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244411 |
Product | patatin |
Protein accession | YP_001509815 |
Protein GI | 158317307 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03607] patatin-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.744518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA GCGGCAACCA GCTCAATGTC GACCATGACG ACCTGGAGGA CATTCGAATC GCGGTCGTTC TCAATGGCGG GATAAGCCTC GCGGTGTGGA TGAGCGGGGT GGTCAACGAG ATCAACAGCC TTACCCAGCG CCGTCCGAGT GATCCGGCCC CGGGTAAGGC GGATCTGCGC TTTTCGGAAG CGGCGGCGGT CTACGGCGGA CTTCTTGATC TCGTCCACGG GAGGGCGCGG GCAGATGTGA TCGCGGGGAC GTCGGCCGGT GGCATCAACG GGGCGTTGCT GGCGTATGCC CAGGCCTACG GGGCCGATCT GCGCCCGCTC GGCGAGTTGT GGGCCGAGCT GGGCTCGTTC GACGCGCTGC TGCGCGACCC GCGTGAGACG CACCCGGCGT CCCTGCTGCG CGGCGATGAC TATTTCCTGC CCGAGCTGGT CAGTGCCTTC GAACGCATCG TGCCCGCGGG CGAACGATCC CAGCGGTATG TGCCGGCGAG CGAGCGGCCG ATCGACCTGA TCATCAACAC GACGCTGATG CGCGGCCAGC CGAAACAGCG CGTCGACGAC TTCGGGACGG AGATCATCGA ATCGGCCCAT ACCGGAGCGC TGCGATTCAC CCGCGCTGCG GACGCGTCAC CCGGACTTGA CCCTTTCTGG GACGCGAGGA TTACCCACCG GCTGGCGCTG GCGAGCCGGA GCACCGCGTC GTTTCCGGTC GCCTTCGAAC CGAGTTTCAT CCCGGTGGGG GAAGCGGGGA GGGATTCCTA CCATCCGGAC ATGGGCGCCG GCGCGGGCCT GCCGGCGGTG GCGCAGTTCG ACCGGAGCCG GTACGTCATG GACGGCGGAG TGCTGCTGAA CCAGCCGGTG AAACCGGCGC TCGCCGCGAT CTACGCCCAG CCGGCGGAGC AACAGGTACG GCGGCTCCTG GTGCATGTGA ACCCCGACCC GAGCAGTCCC GCCCCGGCCG AGGTCGCCGG CCTCGGCGAC GTCTACAGCC TCGGCAGCGA TACCGACGAC CGGCCGCCGA CGCCGGCCGC GGTGCTGCGG ACGCTGGCCA CGTTGCCCTA CGCGCAGTCG GTCGACGCCG AGCTCACCGA GATCCAGACG ACGAACGACC GGGTCCGCCG CTACCGCCCT GACCGCGCGA GGATCGTCCA CCACCTCAAC GAGAAGCTCG CCGAGAAGCT GATGGACGGC TACCGGAGCA CGCGCTCGTA CCAGGAGGCC GATCGCATCG GTGCGCTGCT GGCGGCGGCT GCCCCGCGGC CCCGGATCTG GTGGAGCCGT CCCGAGCTCG CAGCGGTGCT ACGGGGCGTC GGCGCGCGCG CCGACGGCGT CTCCTACATA CCGCCGGACG ACCATCTGCC AGCCGCTGAC GCCGACCCCG CGCGCTGGGC CTGGGGGGTC GAGCCGGTTG AGCGCATCGG CGCCCTAGCG GTCGACGTCT TCAAGCGCGC CATGTGGCTC GCCGATCCGA CCGACCCCGC CGAAGTGACT ACCCGCCGGC GGCTGCGCGG GCACCGCCGG CGCCTCCACG AAGCGCTGGC CGAGCTTCGT GAATGCAGCC GTGACGACGA GGTGTTCTGG CGGAGCTGGG CCACCCGGCC GCCGGCTCCC CCGGCGCGGG GCTCGGACGC GACAGAGCGT CAGGAGTGCC TTCGCTCCTG GGTGCAGGCC GCGCTGTCAC GCTGGCCGCT GCCGCCGGGG GCCAAGGAGG CCCCGGCCGC CTCCCTACCT CACCGGCTGG GCGCCGTCGC GGAGCGGATC GCGCGCGTCC TGGCCGAGGC GGGCGGCGAC CTGCGGCATG TCGCCGCGCG CGCCCGGCCA GGCGGTCCGC CGGCTGGCCC GCCCGGCGTT CGGCACGCGG CCGGTCCTCC CGACGCGGCT GGCGACTCCG ATGCGGCGGA CGAGGCCGAG TTGCTGCACA ACCTGGTCGA GGCGTTGTTT CCCGACGGCG CGCAGCCGTC CCCGGCGTCG TTGCTGCGCA GGCTGCTGGC GGTAGAGGTG TGCCAGATCG CGATCTCCGG CGCGCCGCCC GAGGTGGAGC AGGAGGTGCT GCTCCAGCAG ATCAGCGGCT TCACCCCCAA CTCGTTCGGC GGGCCCACCA CCCCGGCCAA GATCGCCGGA ACGGAACTGT TCTGGTTCGG CGGGTTCGCC AGCAGGTCCT GGCGGGTCAA CGACTGGATC TGGGGCCGGC TGGACGGCGC CACGCGCATG GTCCAGACCG TCCTCGACCC GGGCCGGCTG CGCCAGCTCG GCCTGTCGGC CAAGGACACT CACGACCAGC TGCGCCGGCT CGCCGTCGGG GGCAGCTATC AGACCCGGTT GGGCGAGCAC TTTGACGCGC AGAGCGAGGC GATCCTCGCG GAGCTGTCCT TCCTCGACGA TCCGGAGGCA AGAGCAGAGC CCCTCGTGGC CACTGCGCTC GCGGTCGCCC GGCGGCTGCA CGCCGACATC CTCACCGAGG AGCTGCCCCG CCTCGCCGTC GCGATCGAGG AGGATCGGAA GGAAGGCGGC CTTCCCACCG CGGGGGCACG TTTCCTGCGG GTGTGGACGG CCACACCCGA CCCCGGCCTG GACGAGCTCT TCGCCATGTT CGCCGACGCC GAGATCGCCA CGGAGTCATT CGCCGAGGCG GTCTCAGCGG GGCTCCTCTG GAGATCGGCT GCCGTCGCCA CACGGTTTGC CGCGAGCCTG TTCGCCCCGG TCCCGCTGGC CTACACCCCG ATGCTGCAGA CACTGGAACT GGCCTTCGAT GTCGTCCGGA AGGAGGCCGG TAGCGTGCTG ATGCTGCCCG CCGCGGTGAT GAGATGGGCC GGCCGGCAAA CCACCGCGAG GATCCGGGAA CACATCGGCG GATGA
|
Protein sequence | MSRSGNQLNV DHDDLEDIRI AVVLNGGISL AVWMSGVVNE INSLTQRRPS DPAPGKADLR FSEAAAVYGG LLDLVHGRAR ADVIAGTSAG GINGALLAYA QAYGADLRPL GELWAELGSF DALLRDPRET HPASLLRGDD YFLPELVSAF ERIVPAGERS QRYVPASERP IDLIINTTLM RGQPKQRVDD FGTEIIESAH TGALRFTRAA DASPGLDPFW DARITHRLAL ASRSTASFPV AFEPSFIPVG EAGRDSYHPD MGAGAGLPAV AQFDRSRYVM DGGVLLNQPV KPALAAIYAQ PAEQQVRRLL VHVNPDPSSP APAEVAGLGD VYSLGSDTDD RPPTPAAVLR TLATLPYAQS VDAELTEIQT TNDRVRRYRP DRARIVHHLN EKLAEKLMDG YRSTRSYQEA DRIGALLAAA APRPRIWWSR PELAAVLRGV GARADGVSYI PPDDHLPAAD ADPARWAWGV EPVERIGALA VDVFKRAMWL ADPTDPAEVT TRRRLRGHRR RLHEALAELR ECSRDDEVFW RSWATRPPAP PARGSDATER QECLRSWVQA ALSRWPLPPG AKEAPAASLP HRLGAVAERI ARVLAEAGGD LRHVAARARP GGPPAGPPGV RHAAGPPDAA GDSDAADEAE LLHNLVEALF PDGAQPSPAS LLRRLLAVEV CQIAISGAPP EVEQEVLLQQ ISGFTPNSFG GPTTPAKIAG TELFWFGGFA SRSWRVNDWI WGRLDGATRM VQTVLDPGRL RQLGLSAKDT HDQLRRLAVG GSYQTRLGEH FDAQSEAILA ELSFLDDPEA RAEPLVATAL AVARRLHADI LTEELPRLAV AIEEDRKEGG LPTAGARFLR VWTATPDPGL DELFAMFADA EIATESFAEA VSAGLLWRSA AVATRFAASL FAPVPLAYTP MLQTLELAFD VVRKEAGSVL MLPAAVMRWA GRQTTARIRE HIGG
|
| |