Gene Franean1_0787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0787 
Symbol 
ID5669203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp913228 
End bp914904 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content79% 
IMG OID641239715 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001505151 
Protein GI158312643 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.252141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCACC GAGGCGCCGC CGGCGGGGCC CGGGCCGGGC GGCACCCGTC GGCCGGAGTC 
ACCGGCGGGG TGCTGCTCGC GGTGCTGTCG GCGGTGCTGC TACCCGGCGC CGCGGCCGGC
ACGGCGGCCG CCGCGACGCC CCAGCCGGCG GCCCCGGGTG CGAGCGCCAG GCCGGCCCCC
CGGACAACCC CCGCGCCGAC GGCCTCGGCA CCGGGCCGCT CCCCGTCGGG CGGGCTGCCA
TCCAGCGGGT TCTCGTCCGC CGACGCGCAG TGGTACCACG AGAGCCTCCG GCTCGCCGAG
GCCCACCGGG TGAGCCGCGG CAGGGGCATC ATCGTGGCGA TCATCGACGG CGGGGTGGAC
GCCACCCATC CGAAGCTGCG CGGGCAGCTC CTCTCCGGCG CCGGGGTCGG TGCGGACGCC
GCTCTCGACG GGCTGCGCGA CGACGACCCC GACGGCCACG GCACGGCCAT GGCCGGGCTG
GTCGCCGCCC GCGGTGACGT CGGTGACCCG GCGGTGTGGG GGGTCGCGCC CGAGGCGAAG
ATCCTGCCGA TCTCCACCGG CGAGGAGGCC GACTCCGAGG AGGTCGCCCG TTCGGTGCGG
ATCGCCGTCG ACCGGGGGGC GGGCGTGATC AGCATGTCCC TCGGCTCGGT GGGGCGGGCG
ACGGGCGCGG AGGAGAGCGC CGTCCGCTAC GCGCTCGCGA ACGACGTGGT GGTCGTGGCC
TCGGCCGGCA ACACCGAGCC GGGCGACACC GAGGTGAACT CCCCGGCGAA CATCCCGGGC
GTGATCGCCG TGACCGGATC GGACTACCGC GGGATGTTCT GGGGCGGCTC GGTCCAGGGG
CCGGAGGCCG TCCTGGCCGC GCCCGGACCG GGCATCCGGG CCCCGGTGCC GACCAGGGTG
TCGCCCGACG GCCTGGACAC CGGAGGCGGC ACCAGCAACT CGGCGGCGAT CGTCGCCGGG
GTGGCCGCGC TCGTCCGCGC CGCGAAGCCC GGCCTGGACG CACCCAACGT CATCAACCGC
CTGATCCGCA CCGCGCTCGA CATGGGCCCG GTGGGGCGCG ACAGCCAGTA TGGCTTCGGG
CTGGTCGAGC CGGTGGCCGC GCTGACCGCC GAGCTCCCCC TCGTGGACGC GAACCCACTG
CTCACCGCCC CGATCCCACG CACCGGCAGC AGCGCGGACG GCGGCAGCGG GGCCGGGGGC
GGCGCCACCC CCGACGGGGC CATCCCGGCC CTGCCGACAC CGCCGCCGCC CGCGACCGCG
GCTCCCCCCG TGGGCGCGGC CGGAGCCGGC CCGGACGGGG GTGGGGACGA CCCGTCCGTG
CTGACCTGGG TCGCCGGGCT CAGCCTCGCG GCGTCGGCGG GCGTCCTGCT CGGCGCGCTG
GCGTACGTGC TGGCCGGCGC CCGGTTCGCC GCGACGCTGG GCCGCCGGGG CCGGGCCACG
GCCCATCTGG TCACGGAGCA GGGTGCTGGC CCGCCCGCGA CGGCGCCCGT CCCGCCGGGC
CCCCCGCCCG GTTGGACCAC GCAGCCCGGT TGGGCGGCTG GGCCGACCCG CGCCGCACCA
CCCGACCGGA CAGGGCAGCC CGTCCCGCCG CACCCGGCCG GGCCGGCGGC GGGCGGCCCG
CGCACCCCCG GCGGGGGCGT TCCCGTGGAC ACCCGCGGGT GGCGGACACC GCACTGA
 
Protein sequence
MGHRGAAGGA RAGRHPSAGV TGGVLLAVLS AVLLPGAAAG TAAAATPQPA APGASARPAP 
RTTPAPTASA PGRSPSGGLP SSGFSSADAQ WYHESLRLAE AHRVSRGRGI IVAIIDGGVD
ATHPKLRGQL LSGAGVGADA ALDGLRDDDP DGHGTAMAGL VAARGDVGDP AVWGVAPEAK
ILPISTGEEA DSEEVARSVR IAVDRGAGVI SMSLGSVGRA TGAEESAVRY ALANDVVVVA
SAGNTEPGDT EVNSPANIPG VIAVTGSDYR GMFWGGSVQG PEAVLAAPGP GIRAPVPTRV
SPDGLDTGGG TSNSAAIVAG VAALVRAAKP GLDAPNVINR LIRTALDMGP VGRDSQYGFG
LVEPVAALTA ELPLVDANPL LTAPIPRTGS SADGGSGAGG GATPDGAIPA LPTPPPPATA
APPVGAAGAG PDGGGDDPSV LTWVAGLSLA ASAGVLLGAL AYVLAGARFA ATLGRRGRAT
AHLVTEQGAG PPATAPVPPG PPPGWTTQPG WAAGPTRAAP PDRTGQPVPP HPAGPAAGGP
RTPGGGVPVD TRGWRTPH