Gene Francci3_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3830 
Symbol 
ID3905578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4591139 
End bp4592686 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content74% 
IMG OID637881156 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_482909 
Protein GI86742509 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.778945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0341938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGCTC CGGGCGTGGG TGCTCCGGGG GCGGGTGCTC CGGGGGCGGG TGTTTCGAGG 
ACGGGCGTTT CGGGTCCGGG GGCGCCGGGC GCCCGGGCGG GCTGCGACCC GGGCGCCGCC
GCGACCGACC GCGGGCGCGC TGCGGCGGAC GGGAACCCGC GGGAGCCGTC GGGAGGCTTG
CCGACGGTGA CCGCGGTGCG TGGCTTCGGC ACGCAGCCGC CGGCCCCACC GCCCGCCGAG
ACGGGCTCCC CGCGGGCGTG GTCGCCGACC TCGACGCCGT ATCCGCTGCC CGCGGACTCC
GACGGCGGCG GCTCCGAGAG CGGCGGCTCC GACAGCGGCG GCTCGAACGC CGCGCCGTTC
GCCGTGCCGT TCGCCGATGT CCTGACTGGT CGTTCGGCGC CGCGGGGGCG CCTGGTCGCT
CCTGGTCGCC CGATCGTTCT CGGTCACCGG GTCCTCCTCG CGAGCATCCT GGTCGCGGGT
CTGCTGGGTG GGGGGATCGG TGCGCTGCTC GTCTCGATGA CCGAGCGCGG CAGCGACCAA
CAGGCGTCGG CGGCCCTGCC GGTCGGTACC GCCGCGCCCA GCTCCGACCG CAGGTCGGTC
GCGTCGATCG CCGCGACGGC TCTGCCCAGT GTGGTGACGA TCGACGCCGG TGGCGGGGGG
GACGGCGATG CCGCGGGTAC CGGCTCCGGG GTGATCATTA GCCCGGAGGG GTACATCCTG
ACGAACAACC ACGTCGTGGC GTCCGCGGTC GCCGCCAGAA CTCCGATCTC CGTGCGCCGC
TACCAGGAGT TCGGCCAGGT CCGGGCCGAC CTGGTCGGGC GGGATCCCAA GACTGACATT
GCCGTTCTGC GGATCCCGGC CCCGCAGCCC CTGCCGGCGG TGACGCTCGG CCAGTCCGGC
TCGCTGGTCG TGGGAGCGCC GGTGGTTGCC ATCGGGGCGC CGTTCGGCCT CGCCGGCACC
GTGACGACCG GCGTCGTCAG CGCGCTCGAC CGCAATCCCA TGGTCCCGGC CGAGAGCGGC
ACCGACCCCA CGGTGCTCAT CGGGGCGATC CAGATCGACG CGGCGATCAA TCCGGGCAAC
TCCGGTGGTC CGCTGCTCGA CGGCCTCGGC CAGATGGTGG GGATCAATAC CGCGATCGCC
GCGGTGCCCG GCCATGAGTC CCAGAGCCAG AGCGGCAGCA TCGGCGTGGG GTTCGCCATC
CCGATCGACT TCGCCCGCTC GGTCGCTCAG GAGATCATTT CGACCGGTCG GGCGACCCAC
CCGTACCTCG GCGTGTCAGC CGCGACCGTC ACCGCCGGCC AGGCGAAGGC CATGGGGACC
ACGTCGGGTG CCCGCGTCGT CAACCTGGCG CCCGGTGGCC CCGCCGAGCG CGCCGGGCTG
CGCGTCGGCG ACATCATCAC GCGGGTGGAC ACCAGGGTTA TCAGCGGGAT GAACGACCTC
ATCGTCGCAG CCCGGCTCCA CCGGGTAGGT GACCGGGTGT CGGTCGCCTA CGAACGCGCC
GGTGCCACCG CGACCACCCA GCTGACGCTG CAGGAACAAC AGCGCTGA
 
Protein sequence
MGAPGVGAPG AGAPGAGVSR TGVSGPGAPG ARAGCDPGAA ATDRGRAAAD GNPREPSGGL 
PTVTAVRGFG TQPPAPPPAE TGSPRAWSPT STPYPLPADS DGGGSESGGS DSGGSNAAPF
AVPFADVLTG RSAPRGRLVA PGRPIVLGHR VLLASILVAG LLGGGIGALL VSMTERGSDQ
QASAALPVGT AAPSSDRRSV ASIAATALPS VVTIDAGGGG DGDAAGTGSG VIISPEGYIL
TNNHVVASAV AARTPISVRR YQEFGQVRAD LVGRDPKTDI AVLRIPAPQP LPAVTLGQSG
SLVVGAPVVA IGAPFGLAGT VTTGVVSALD RNPMVPAESG TDPTVLIGAI QIDAAINPGN
SGGPLLDGLG QMVGINTAIA AVPGHESQSQ SGSIGVGFAI PIDFARSVAQ EIISTGRATH
PYLGVSAATV TAGQAKAMGT TSGARVVNLA PGGPAERAGL RVGDIITRVD TRVISGMNDL
IVAARLHRVG DRVSVAYERA GATATTQLTL QEQQR