Gene Franean1_5550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5550 
Symbol 
ID5673880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6720875 
End bp6722932 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content67% 
IMG OID641244406 
Productprolyl oligopeptidase 
Protein accessionYP_001509810 
Protein GI158317302 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG ATGATCCGCA CTGCTGGCTT GAGGACGTCG CCGGGGAGGC CGCCCTCGCC 
TGGGTGAGGG AACACAACGC CGAGACGTTC GCAGAGCTCA CCCAGTCGGC CGAGTTCGCC
GGGCTGCGCG CGGAGATCCG CGAGGTGCTC GACTCGGATG ACCAGATTCC CTACCCGGTC
ACGAGGGGCG AGTACCTCTA CAACTTCTGG CGCGACGCGG GTCATCCGCG TGGCCTGTGG
CGACGCACCT CGCTGGCGTC GTATCGCGAT GCCAGCCCGG ACTGGGAAGT CGTGCTCGAC
GTCGACGCGC TCGCCGCCCG CGAGGGCGAG AACTGGGTGT GGCACGGCGC GCGGGTGCTG
CGACCCGAAT ACCGGCTGGC TTTGGTCGAG CTGTCCCGTG GCGGCGCGGA CGCCTCCGTC
ATCCGTGAGT TCGACCTGGT CACGAAGTCC TTCGTGGAGG ACGGCTTCTT CCTGCCCGAA
GCCAAAAGCT CGGTCGGTTG GCTCGATGAA GACCGGATCT ATGTCGGTAC CGATTTCGGC
GAAGGATCGT TGACCACTTC CGGCTACCCC CGCGTGATCC GAGAATGGCG GCGCGGCACT
CCGCTCGACG ATGCCGTCAC AGTCTTCGAA GGCGAGGTGG ATGACGTCTC GGTCGACGCC
TACCACGACC CGACCGAGGG TTTCGTGCGC GACTTCGTCG ACCGAAGTAT CGACTTCTAC
CACACCGAGA CGTTCCTGCG TATCCCGACG GGGCTGGTCC GCATCGACGT CCCGGACGAC
GCGCACACCT CGGTTCACCG GGAGTGGCTG CTCATCACCA CCCGGTCGGA GTGGGCTGTC
GACGAGAAAG TCTATCCGGC GGGCGCCCTG CTCGCCGCCG ACTTCGAAGC ATTTCTCGCC
GGACGACGAG AGCTCGAGCT CCTCTTCGAG CCGGACAAGC ACACCTCGCT CGCGTACCAC
GCGTGGACTC GCAATCATCT CATCGTCGCC ACTTTGCGGG ACGTGAAAAG CCAGTTCACC
GTGCTCACGC CGACAGCCTC GGGCTGGACG AGGGCGCCGC TGGCCGACCT CCCGGACACC
GGCTCCGCCT ACATCAGCGA CACCAGCCCC GACGTCGACG ACGAATACTA CCTCGGTGCC
AGCGGCTACA CCCAGCCAGC GACCTTGTAC CGCGGCGAGA TCGGCGTGGA GCCCGAGATC
CTCAAGCAGG CACCGGCGTT CTTCCCGACC GACGGCCGCA CCGTCAGCCA GTATTTCGCC
GTGTCCGACG ACGGCACCCG CGTCCCCTAT TTCGTCGTCG GTACCGGAAA ACCCGGGCCG
ACGCTGCTCT ACGGGTACGG CGGCTTCGAG GTCTCGCTTA CCCCCAGCTA CAGCGGTACT
GTCGGCCGGG CCTGGCTGGC GCGCGGCGGC ACCTATGTCG TCGCCAACAT CCGCGGTGGC
GGCGAATACG GCCCGGACTG GCATCAGTCA GCCATCCGCG AGAACCGGCT GCGTGCCTAC
GAGGATTTCG CCGCCGTGGC CGGCGACCTC GTCGGCCGGG GGATCACGAC GCCGGCCCAG
CTCGGCATCG AGGGCGGTTC CAACGGCGGG CTGCTCATGG GCGTCATGCT CACCCGCTAC
CCGGAGCTGT TCGGAGCCGT CGTCTGCTCG GTGCCGCTGC TTGACATGCG CCGCTACCAC
CAGCTGCTGG CCGGAGCCTC GTGGATGGCC GAGTACGGCG ACCCTGACGA TCCAGCCGAC
TGGGCGTTCA TCAAGGAATA CTCGCCGTAC CAGAATGTCC GGCCCGGCCG TCGTTACCCG
CCGACGTTCA TCACCACGTC AACCCGCGAC GACCGCGTCC ACCCCGGGCA CGCCCGCAAG
ATGGTCGCGC GGCTGCGCGA ACTCGGCTAC GCCATCCGCT ACTACGAGAA CATCGAAGGC
GGTCACAGCG GCGCCGCCGA CAACGAGCAG CTTGCCCACA AATCAGCCCT GATCTACGAG
TTCCTCTGGC GCACCCTCGG CACCGGTGCG CTGTCAGACG GAGACCGTCC TGCCGCCGGG
GCCGGTCGGA CCGAGTGA
 
Protein sequence
MADDDPHCWL EDVAGEAALA WVREHNAETF AELTQSAEFA GLRAEIREVL DSDDQIPYPV 
TRGEYLYNFW RDAGHPRGLW RRTSLASYRD ASPDWEVVLD VDALAAREGE NWVWHGARVL
RPEYRLALVE LSRGGADASV IREFDLVTKS FVEDGFFLPE AKSSVGWLDE DRIYVGTDFG
EGSLTTSGYP RVIREWRRGT PLDDAVTVFE GEVDDVSVDA YHDPTEGFVR DFVDRSIDFY
HTETFLRIPT GLVRIDVPDD AHTSVHREWL LITTRSEWAV DEKVYPAGAL LAADFEAFLA
GRRELELLFE PDKHTSLAYH AWTRNHLIVA TLRDVKSQFT VLTPTASGWT RAPLADLPDT
GSAYISDTSP DVDDEYYLGA SGYTQPATLY RGEIGVEPEI LKQAPAFFPT DGRTVSQYFA
VSDDGTRVPY FVVGTGKPGP TLLYGYGGFE VSLTPSYSGT VGRAWLARGG TYVVANIRGG
GEYGPDWHQS AIRENRLRAY EDFAAVAGDL VGRGITTPAQ LGIEGGSNGG LLMGVMLTRY
PELFGAVVCS VPLLDMRRYH QLLAGASWMA EYGDPDDPAD WAFIKEYSPY QNVRPGRRYP
PTFITTSTRD DRVHPGHARK MVARLRELGY AIRYYENIEG GHSGAADNEQ LAHKSALIYE
FLWRTLGTGA LSDGDRPAAG AGRTE