Gene Francci3_3796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3796 
Symbol 
ID3906081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4548847 
End bp4550148 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content70% 
IMG OID637881122 
Productpeptidase M16-like 
Protein accessionYP_482875 
Protein GI86742475 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0607988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.721107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTCGG CCCTGCCGGC ATCCTCCTAT CCGATCGAGC GGACCCGGCT CGGTAACGGC 
CTGCGCGTGC TGCTCGCCCC CGACCACACC GCGCCGGTGG TGGCCGTCTC GGTGCACTAC
GACGTCGGGT TTCGATCCGA GCCCGAGGGC CGTACCGGAT TCGCTCACCT GTTCGAGCAC
CTGATGTTCC AGGGCAGCGA GAACGTCGGT AAGGCCGAGC ATCCGAAGCA CGTCCAGGCC
GCCGGCGGAA TCTTCAACGG ATCGACGCAC CCGGACTACA CGGACTATTT CGAGCTGCTC
CCGGCCGGGG CGCTCGAACT GGCCCTGTTC CTGGAGGCGG ACCGGATGCG GGCGCCGAAG
ATCACCCGCC AGAACCTGGA CAACCAGATC GCCGTGGTGC AGGAGGAGAT CCGGGTCAAC
GTCCTGAACC GCCCCTACGG GGGATTTCCC TGGATCAAGC TGCCGCCGGT CGCGTTCGAC
ACCTTTCCGA ACGCCCACAA CGGCTACGGG GATTTCTCCG AGCTCGAGGC CGCGAGCTTG
GACGACGCCG AGGACTTCTT CGACAAGTAC TACGCACCGG GCAACGCCGT GCTGACCATC
GTCGGCGACA TCGACCCGGA GGAGACGCTC ACCTTCGTCC ACCGGTACTT CGGTGACATC
CCCGCCCGCT CGGTGCCGAC GCGGGTGAGT TTCGCCGAGC CGGTGCCGAG TACCGAGCGC
CGGGCGGTGC TGACCGACCC GCTCGCGCCG CGCGCCGCCC TGGCGGTCGG CTACCGGGTG
CCCGACCCGA TCGGAGACCT GTCCACCTAC CTGTCCTACT ACCTGCTCAC CGAGATCCTC
AGCGACGGCG ACGCCAGCCG GCTCGAACGC CGCCTGGTGC AGAAGGATCG CTCGGTCATC
GGTGTGAGCA CCTACCTTGG CACCTTCGGG GATCCGTTCG AGCAGCGTGA CCCGCTGCTG
CTGACCCTGG AGGCCCGCCA GTCCGAGGAC GCGAGCGCGG ACGCCGTCCT CGCCGCCGTC
GACGAGGAAC TGGCGCGGCT GGCGGGCGAG GGCCTGGCGG ACGGCGAGCT GGAGCGGGTG
CAGGCGCGGG TGGCGTCCTC GCTGCTGCGT GAGTCCGACG ACGCGCTGGG ACGGGCACTC
GCCATGGCCG TGCATGAGCT GCAACGGGGA CGTCCCGAGT TGGTGAACGA ACTGCCCGCG
GAACTGTCCG CGGTGACCGG GCAGGCCGTC GCCGCGGCCG CCCGGACGCT TCTCGACCAG
GGCCGCTCGG TCCTGGAGCT GCGTGCCGGC GCCGCCTCAT GA
 
Protein sequence
MKSALPASSY PIERTRLGNG LRVLLAPDHT APVVAVSVHY DVGFRSEPEG RTGFAHLFEH 
LMFQGSENVG KAEHPKHVQA AGGIFNGSTH PDYTDYFELL PAGALELALF LEADRMRAPK
ITRQNLDNQI AVVQEEIRVN VLNRPYGGFP WIKLPPVAFD TFPNAHNGYG DFSELEAASL
DDAEDFFDKY YAPGNAVLTI VGDIDPEETL TFVHRYFGDI PARSVPTRVS FAEPVPSTER
RAVLTDPLAP RAALAVGYRV PDPIGDLSTY LSYYLLTEIL SDGDASRLER RLVQKDRSVI
GVSTYLGTFG DPFEQRDPLL LTLEARQSED ASADAVLAAV DEELARLAGE GLADGELERV
QARVASSLLR ESDDALGRAL AMAVHELQRG RPELVNELPA ELSAVTGQAV AAAARTLLDQ
GRSVLELRAG AAS