Gene Franean1_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0117 
Symbol 
ID5668542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp138819 
End bp140615 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content75% 
IMG OID641239045 
Productpeptidase C2 calpain 
Protein accessionYP_001504490 
Protein GI158311982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGATCA AAGGGCAGTG GATCGAAGGG CAGTGGATCG AAGGGCAGTG GATCGAAGGG 
GCTCCGGTCG CCGCCGCGGC GGCGGTGCTG CGCCGGGTCG CGGTGGAGAG CTCGTCGTTG
CTTGCGCCGG TGATCGTCCT GGACAACGAC GAGACCTGGC GGTCACCCTG GCAGCGCCGC
TGCCATGGCC GCATCGACGA GTGGACGTCC GCGCTCGCGG CCGGCGCCCG TGCGGCGCGG
GAGCAGGCGG TGCGTCTGGA GTTCCTTGCG GCTCGGGGCG CCTGGGGAGC CGTGACGCTC
GGGCCGGCGG CGGTGAGCGC CCCGCGCTCG CCCGTCTCTC ATCCGGCCGT CGTGGACGTT
GCCTTCTCTC CGGAGCGGCT CGCGGAGCTG GCCCGCCGGC TCCGGCGGGC CGCGGACGAC
GCGGCCCGGC TCGCGGCCGA GGTTAGGCGC GTCGCGCACG CCCTGCCCGG TGACATGGGT
GGTGCCCTGG TGGCGGAGGG GTGCTGTCGG GGCCTCGACC GGGTGGCGGC CGAGGCACCC
GACATGGCCG GCGCCATCGA CGGCAGACTG GCCTACGCCG CGTCTGGCGG GTCTGTCGCG
TCTGCCGGGT TCGGCGGGTC CGGCGGGTTT GCGACATCCG CTCCGGCCGT GGAGCGGGCG
CCGGGTCCCG AGATGCAGCA CGCCGCCCGG CTCACCGAAC CGCGGTCGGC GGGCGCGGAC
ACAGCGCGCA CCGTGGCCGA GCTGACCGCG TCGATCGGCG CGGACCCGTT CCGCCTCGAT
CCGCGTGAGC TGAGCGAGCT GTCCACCCGG CTGGGCCGGC TCGGCCCGGC CGAGCTGCGG
GCGGTCATCG GCGGGCTGCG GGGGCGCCCG CTGGAGGTGC TCGCCGCCGC GGTCCGTATC
GCGCCGACCC GGGTGGAGCT GCGCACGCTG GGCCTCCTGC CGGCCGTCTT CTCCCTTGGT
GACCTGCTCC TGCGCGGGGC GCCGGCGTCG ATGGTGGCCG AGATCGCGCG GCTGTTCCCC
GGCCTCGAAC CGCCCATGGG CGGGCGGTAC CGGCCGTGGG CGTACACCGC GGGGCATCCG
CCGTCCACCG GCGACCTGAC GATGCGCGAC CACGGTCGCG ATCCGGTCTG GGCGAGCGGT
GTCACCCCGT CGGACGTCGG CCAGGGCGCG GTGGGGGACT GCTACCTCCT GGCCGCGCTC
ATCGGTATCG CGCAGGCCGA TCCGGGGCTG CTGCGCCGCA ATCTGCGGGA GAACCCGAAC
GGGACGGTCA GCGTCACCGT CCACCTTCCT ACCGGCGCGG TCCCGGTCAC GGTCACCAGA
AGCCTGCCTG CCCGGGCGGG GGCCGGGCAG GAGATCGCCG CGGACGCGGA CAACGCCGCG
GGCGAGCCGG AGCTCTGGGC CGCGCTCTAC GAGAAGGCCT GTGCGCGGAT GGCTGGCAGC
TACGCGCGTC TTGAGGGCGG TGACCCCGCC ACCGCGATGG AGTACCTGAC CGGCACAGCC
GCGGTCCGCC GCTCGCCGTC CGCCGTGGGT GTCGACGAGC TCGCCGCCAG GCTGGCAGCC
GGTGGGGTGG TGACCGTCGT GACGCGTTCG GACCTGCCGC CGGGTTGCGG GCTGGTGCCC
AACCACGCCT ACGCGGTGCT GAAGGCGGAC GCGCGGACGG GGCAGGCCCT CCTCCGTAAC
CCGTGGGATC AGGCCGGCAC CGACGACCGG CTGGAGTGGC ACGACTGGGA CGGTCTGAAG
CCGGCGCTGG CGGGCGTCCA GTGGGCTTCG ACCGGCCGCG GTCCGCCGGC TCGATGA
 
Protein sequence
MSIKGQWIEG QWIEGQWIEG APVAAAAAVL RRVAVESSSL LAPVIVLDND ETWRSPWQRR 
CHGRIDEWTS ALAAGARAAR EQAVRLEFLA ARGAWGAVTL GPAAVSAPRS PVSHPAVVDV
AFSPERLAEL ARRLRRAADD AARLAAEVRR VAHALPGDMG GALVAEGCCR GLDRVAAEAP
DMAGAIDGRL AYAASGGSVA SAGFGGSGGF ATSAPAVERA PGPEMQHAAR LTEPRSAGAD
TARTVAELTA SIGADPFRLD PRELSELSTR LGRLGPAELR AVIGGLRGRP LEVLAAAVRI
APTRVELRTL GLLPAVFSLG DLLLRGAPAS MVAEIARLFP GLEPPMGGRY RPWAYTAGHP
PSTGDLTMRD HGRDPVWASG VTPSDVGQGA VGDCYLLAAL IGIAQADPGL LRRNLRENPN
GTVSVTVHLP TGAVPVTVTR SLPARAGAGQ EIAADADNAA GEPELWAALY EKACARMAGS
YARLEGGDPA TAMEYLTGTA AVRRSPSAVG VDELAARLAA GGVVTVVTRS DLPPGCGLVP
NHAYAVLKAD ARTGQALLRN PWDQAGTDDR LEWHDWDGLK PALAGVQWAS TGRGPPAR