Gene Franean1_2294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2294 
Symbol 
ID5670693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2743192 
End bp2745522 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content78% 
IMG OID641241214 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001506635 
Protein GI158314127 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.237204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.953855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTC CGGTCGAGTC GGCGCTCACC GGCGTCCCGT CCCGGACCGC CGCGGCGCGG 
TCTGAGACCA CCGCGGCACC GGCTGGCGCC CTGCCGGCCG GTACCGTGCC GGTCCAACCG
GTGGACTCGC CGGTCCTGGG AGCGGTCGAG CCGCCGCTCC CGGCACCGCG GGACCTGGTG
GACGAGTGGG CCACGCCGCC GAGCGTGCGC GGGCCCTCAC CCGGCCCGGA CGGCTCGATC
GCCTTCGTCG GGGACGCCAC CGGGCGCCCG GCCCTGTGGG TCCGCGCGGC CGACGGGACG
GAGCGGGTCC TCGACACCGG CCCGGCGCAC GTCCGGTCGG CGCTGTGGTC CCCGGACGGC
GCCTGGATCG CGATCACCGT GGCCCCGGGC GGCGGGGAGC ACACCGAGGT CCACCTTGTC
CGCCCGGACG GCAGGGTCCG CCCGGACGGC AGGGTCCGCC CGGACGGCAC GGCGGCGCAC
CGGCTGGCCG GTGGCGTCCG GCCCGGCACG GCGGGAGCCG TGGAGGCGTG CGCGGCCACG
GTCTCCCGGT GGGCGGCGGG CGGCCGGCTG CTCGTGGTCA CCGAGTCGGC CCGCTCCGGG
CTGACCCACG CGGTCGCCGT CGACCCGGCC GGCCACCGCC GCCACCTCGC GGTGGGGCTG
GCCCTGCAGG TCTGCGCCGT CCACGAGACC GCCGACCGGT GGCTGCTGCT CCTGCGGGAG
GGCCCGCGGG GCGCCCGGCG CGTGCTGGTG GCCCGGGTGG ACGCGCCCGA CCCGCTGGCA
CCGGCCGCCG CCCCGCTCGA GGCGTTCCCG CTCAACGACG AGATGGCCGG CGGAGTGGCC
GGCAGCGGGG GGACCGTCGG AGGGGTGGCG ACCGAGGCGT TCGAGGTCGC GGGCGGCACC
ACCACGGCCG TCTCAGGCAC CTTCGCCGCG GACGCCTCGC GTGCCCTGCT CGCCTGCGAC
CTGGGCCGCG AGCGGCCGGG CCTGCTCGAG GTGCCCCTGG ACCCGCACGG CCGGCCGGGG
CCGACCCGAC TGCTGGCCGG CCGGGACGAC GCGGACCTGG AGCGCTTCCT CCTCCTCGAC
CCGGCCACGG CCGTGCTCGG CTGGAACGTC GGCGGGCGCA CCGAGCTCGC CGTCCACAGC
CTGGACGACG GGACGTCGCG GGCCCTGCCG CCGCTGCCGC GCGAGGTGGT CACCGGCCTG
CTCCCCGGGC CCGGCGGGGC GAGCCTGCTC CTGGCGCTGG ACGGCTCCAC CGCCCCCAGC
GAGGTGTGGA CCTGCGACCT GACCGGCACC GCGGCGGGCA TTCCGCCCGG CACCGCGTCG
GACGCCCCGG ACGGCACCTC GGCGGGCACC TCGGACAGCG GCGTGCCGGC CTACCGCTGC
CTGGTCTCGC ACACGCCGAC GGCGTACCCC ACCGTCGAGA TCACGGCCGT GGCCGCGAGC
GGGACGCACA CCGCGGGCCC GGGCCCGTGC CCGGCGGGGG CGCCCGAGGT GCCCGGCGGG
ACGGAGGACG TGGGCTACCG GTTCGTCCGC CCGATCGCGC GCCGGTTCCT CGCGCACGAC
GGGCTGGAGC TCACCGGCTG GTGGTACCGC CCGCGGGTGG CCCCGGGGCC GGTGCCCACC
CTGCTCTACT TCCATGGCGG CCCGGAGGCG CAGGAGCGCC CCGTCCTCAA CCCGCTCTTC
CACGCGCTGC TCGCCCGCGG CATCGCCGTG TTCGCGCCGA ACGTGCGCGG CTCCACCGGG
TTCGGTCGCT CGTTCGAGGA GGCCGACCAC CTGGCCGGCC GCTTCGCCGG CATCGCCGAC
GTCGCGAGTG CCGTGACGCA CCTGGTCACC GAGGGCCTGG CCGCGCCGGG CCACATCGGC
GTGGCCGGCC GGTCCTACGG CGGGTACCTG ACGCTGGCCG CGTTGGTCTG CCACCCCGAG
CTGTTCGCCG TCGGTGTCGA CGTGTGCGGG ATGGTCGATC TGGAGACCTT CTACCGGCAC
ACCGAACCGT GGATCGCCGC ACCGGCGGTC ACCAAGTACG GCGACCCGGC GACCGACCGC
GACCTGCTAC GGGCTCTGTC GCCGTTGCAC CGGATGGATG CGCTCGCGGC CCCGCTGCTG
GTCGTGCACG GGGCCAACGA CACCAACGTC CCCGTGTGTG AGGCCGAGCA GACTGTCGCC
GCGGCCCGGG CCCGCGGGAT CCCGTGCGAG TACCTGCTCT TCGAGGGCGA GGGCCACGAG
GTCGCCGAGC GCGCGAACCG GCTGGTGTTC GTCCGCGCCG TGGTGGAGTT CGTCGCGGCG
TGCCTGACCG GCGCGCAGGC GCCGGCGGAC ACCCTCGGCG AGGCCGTCTG A
 
Protein sequence
MIRPVESALT GVPSRTAAAR SETTAAPAGA LPAGTVPVQP VDSPVLGAVE PPLPAPRDLV 
DEWATPPSVR GPSPGPDGSI AFVGDATGRP ALWVRAADGT ERVLDTGPAH VRSALWSPDG
AWIAITVAPG GGEHTEVHLV RPDGRVRPDG RVRPDGTAAH RLAGGVRPGT AGAVEACAAT
VSRWAAGGRL LVVTESARSG LTHAVAVDPA GHRRHLAVGL ALQVCAVHET ADRWLLLLRE
GPRGARRVLV ARVDAPDPLA PAAAPLEAFP LNDEMAGGVA GSGGTVGGVA TEAFEVAGGT
TTAVSGTFAA DASRALLACD LGRERPGLLE VPLDPHGRPG PTRLLAGRDD ADLERFLLLD
PATAVLGWNV GGRTELAVHS LDDGTSRALP PLPREVVTGL LPGPGGASLL LALDGSTAPS
EVWTCDLTGT AAGIPPGTAS DAPDGTSAGT SDSGVPAYRC LVSHTPTAYP TVEITAVAAS
GTHTAGPGPC PAGAPEVPGG TEDVGYRFVR PIARRFLAHD GLELTGWWYR PRVAPGPVPT
LLYFHGGPEA QERPVLNPLF HALLARGIAV FAPNVRGSTG FGRSFEEADH LAGRFAGIAD
VASAVTHLVT EGLAAPGHIG VAGRSYGGYL TLAALVCHPE LFAVGVDVCG MVDLETFYRH
TEPWIAAPAV TKYGDPATDR DLLRALSPLH RMDALAAPLL VVHGANDTNV PVCEAEQTVA
AARARGIPCE YLLFEGEGHE VAERANRLVF VRAVVEFVAA CLTGAQAPAD TLGEAV