Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2294 |
Symbol | |
ID | 5670693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2743192 |
End bp | 2745522 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641241214 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001506635 |
Protein GI | 158314127 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.237204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.953855 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGTC CGGTCGAGTC GGCGCTCACC GGCGTCCCGT CCCGGACCGC CGCGGCGCGG TCTGAGACCA CCGCGGCACC GGCTGGCGCC CTGCCGGCCG GTACCGTGCC GGTCCAACCG GTGGACTCGC CGGTCCTGGG AGCGGTCGAG CCGCCGCTCC CGGCACCGCG GGACCTGGTG GACGAGTGGG CCACGCCGCC GAGCGTGCGC GGGCCCTCAC CCGGCCCGGA CGGCTCGATC GCCTTCGTCG GGGACGCCAC CGGGCGCCCG GCCCTGTGGG TCCGCGCGGC CGACGGGACG GAGCGGGTCC TCGACACCGG CCCGGCGCAC GTCCGGTCGG CGCTGTGGTC CCCGGACGGC GCCTGGATCG CGATCACCGT GGCCCCGGGC GGCGGGGAGC ACACCGAGGT CCACCTTGTC CGCCCGGACG GCAGGGTCCG CCCGGACGGC AGGGTCCGCC CGGACGGCAC GGCGGCGCAC CGGCTGGCCG GTGGCGTCCG GCCCGGCACG GCGGGAGCCG TGGAGGCGTG CGCGGCCACG GTCTCCCGGT GGGCGGCGGG CGGCCGGCTG CTCGTGGTCA CCGAGTCGGC CCGCTCCGGG CTGACCCACG CGGTCGCCGT CGACCCGGCC GGCCACCGCC GCCACCTCGC GGTGGGGCTG GCCCTGCAGG TCTGCGCCGT CCACGAGACC GCCGACCGGT GGCTGCTGCT CCTGCGGGAG GGCCCGCGGG GCGCCCGGCG CGTGCTGGTG GCCCGGGTGG ACGCGCCCGA CCCGCTGGCA CCGGCCGCCG CCCCGCTCGA GGCGTTCCCG CTCAACGACG AGATGGCCGG CGGAGTGGCC GGCAGCGGGG GGACCGTCGG AGGGGTGGCG ACCGAGGCGT TCGAGGTCGC GGGCGGCACC ACCACGGCCG TCTCAGGCAC CTTCGCCGCG GACGCCTCGC GTGCCCTGCT CGCCTGCGAC CTGGGCCGCG AGCGGCCGGG CCTGCTCGAG GTGCCCCTGG ACCCGCACGG CCGGCCGGGG CCGACCCGAC TGCTGGCCGG CCGGGACGAC GCGGACCTGG AGCGCTTCCT CCTCCTCGAC CCGGCCACGG CCGTGCTCGG CTGGAACGTC GGCGGGCGCA CCGAGCTCGC CGTCCACAGC CTGGACGACG GGACGTCGCG GGCCCTGCCG CCGCTGCCGC GCGAGGTGGT CACCGGCCTG CTCCCCGGGC CCGGCGGGGC GAGCCTGCTC CTGGCGCTGG ACGGCTCCAC CGCCCCCAGC GAGGTGTGGA CCTGCGACCT GACCGGCACC GCGGCGGGCA TTCCGCCCGG CACCGCGTCG GACGCCCCGG ACGGCACCTC GGCGGGCACC TCGGACAGCG GCGTGCCGGC CTACCGCTGC CTGGTCTCGC ACACGCCGAC GGCGTACCCC ACCGTCGAGA TCACGGCCGT GGCCGCGAGC GGGACGCACA CCGCGGGCCC GGGCCCGTGC CCGGCGGGGG CGCCCGAGGT GCCCGGCGGG ACGGAGGACG TGGGCTACCG GTTCGTCCGC CCGATCGCGC GCCGGTTCCT CGCGCACGAC GGGCTGGAGC TCACCGGCTG GTGGTACCGC CCGCGGGTGG CCCCGGGGCC GGTGCCCACC CTGCTCTACT TCCATGGCGG CCCGGAGGCG CAGGAGCGCC CCGTCCTCAA CCCGCTCTTC CACGCGCTGC TCGCCCGCGG CATCGCCGTG TTCGCGCCGA ACGTGCGCGG CTCCACCGGG TTCGGTCGCT CGTTCGAGGA GGCCGACCAC CTGGCCGGCC GCTTCGCCGG CATCGCCGAC GTCGCGAGTG CCGTGACGCA CCTGGTCACC GAGGGCCTGG CCGCGCCGGG CCACATCGGC GTGGCCGGCC GGTCCTACGG CGGGTACCTG ACGCTGGCCG CGTTGGTCTG CCACCCCGAG CTGTTCGCCG TCGGTGTCGA CGTGTGCGGG ATGGTCGATC TGGAGACCTT CTACCGGCAC ACCGAACCGT GGATCGCCGC ACCGGCGGTC ACCAAGTACG GCGACCCGGC GACCGACCGC GACCTGCTAC GGGCTCTGTC GCCGTTGCAC CGGATGGATG CGCTCGCGGC CCCGCTGCTG GTCGTGCACG GGGCCAACGA CACCAACGTC CCCGTGTGTG AGGCCGAGCA GACTGTCGCC GCGGCCCGGG CCCGCGGGAT CCCGTGCGAG TACCTGCTCT TCGAGGGCGA GGGCCACGAG GTCGCCGAGC GCGCGAACCG GCTGGTGTTC GTCCGCGCCG TGGTGGAGTT CGTCGCGGCG TGCCTGACCG GCGCGCAGGC GCCGGCGGAC ACCCTCGGCG AGGCCGTCTG A
|
Protein sequence | MIRPVESALT GVPSRTAAAR SETTAAPAGA LPAGTVPVQP VDSPVLGAVE PPLPAPRDLV DEWATPPSVR GPSPGPDGSI AFVGDATGRP ALWVRAADGT ERVLDTGPAH VRSALWSPDG AWIAITVAPG GGEHTEVHLV RPDGRVRPDG RVRPDGTAAH RLAGGVRPGT AGAVEACAAT VSRWAAGGRL LVVTESARSG LTHAVAVDPA GHRRHLAVGL ALQVCAVHET ADRWLLLLRE GPRGARRVLV ARVDAPDPLA PAAAPLEAFP LNDEMAGGVA GSGGTVGGVA TEAFEVAGGT TTAVSGTFAA DASRALLACD LGRERPGLLE VPLDPHGRPG PTRLLAGRDD ADLERFLLLD PATAVLGWNV GGRTELAVHS LDDGTSRALP PLPREVVTGL LPGPGGASLL LALDGSTAPS EVWTCDLTGT AAGIPPGTAS DAPDGTSAGT SDSGVPAYRC LVSHTPTAYP TVEITAVAAS GTHTAGPGPC PAGAPEVPGG TEDVGYRFVR PIARRFLAHD GLELTGWWYR PRVAPGPVPT LLYFHGGPEA QERPVLNPLF HALLARGIAV FAPNVRGSTG FGRSFEEADH LAGRFAGIAD VASAVTHLVT EGLAAPGHIG VAGRSYGGYL TLAALVCHPE LFAVGVDVCG MVDLETFYRH TEPWIAAPAV TKYGDPATDR DLLRALSPLH RMDALAAPLL VVHGANDTNV PVCEAEQTVA AARARGIPCE YLLFEGEGHE VAERANRLVF VRAVVEFVAA CLTGAQAPAD TLGEAV
|
| |