Gene Francci3_2591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2591 
Symbol 
ID3906497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3056696 
End bp3058066 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content73% 
IMG OID637879916 
ProductDyp-type peroxidase 
Protein accessionYP_481682 
Protein GI86741282 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.278958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGACA TTGTGGACGC TGCCGATGTA ACGGATGTCG CCGATGCGAC GGATGCGGTC 
CGGGAGAACC GCCCGATCGG TCCCCACCGC CGCAGGGTGC TGCGGGGCGC CGGGCTGCTC
GGCTTCGGCG GTGGTGTCGC GATCGGGGGA GGCCTCGGGG GAGTCATGGC CCTCGACGGC
GGTGTCGCCG TTGCCGCGGA ACGTCCCCAG GAGGACCAGT CGGGCGGGAC GGACCGCGTC
GCCGCGGTCC GTCGTGAGGC GCTGGCCGGG GTGGTGGGCG ACGGCCGTCA CCAGCCCGGA
ATCGCCGACC GCCCTCCCGC GCAGCTCGTC TTCACCGCCT ACGACCTGAC GACGTCCGGC
CCGGCCGCGG TCCGCACATC CCTCGCCGCG GTGCTGCGCA CCTGGACGGC CGCCGCCGCG
GTGCTGATGC GAGGCGAGCC GCTCGACGGC GCTGAGCGGG ACACCCAGGG ACTCGGACCG
GCGGGGCTGA CGATCACCAT CGGGCTCGGC GCGTCCGCGC TGCGCCGCGC CGGTCTCGAC
GCGCAGATAC CCGCGGAGTT CGCGGACATT CCCGCGATGC CCGGTGACCA GCTCGATCTG
GCGCGCAGTG GCGGGGATCT CGGCGTGCAG GTCTGCGCCG AGGATCCGAT GGTCGCGGTC
TCGGCTTCCC GGCAGATGCG TCGTCTCGCC GCTCAGGATG CCAGGCCCCG CTGGATCCAG
CGGGGATTCC TTCGTTCCGC GGCGGCCGCG TTCAACCCCG GGTCCACCCC ACGCAACCTG
ATGGGGCAGA TCGACGGCAC GGACAACCCG GGACCGGGCA CGCCGCGGTT CGACCGGGCG
GTGTGGGTGT CCAGCGGGCC GGAATGGATG CGGGACGGGT CCTACCTGGT CTGCCGGCGT
ATCCGCATGC TGCTCGATGC CTGGGCGCGG CTGGACGAGA CGGCTCAGAG CGCGGTCATC
GGCCGCCGTA AGTCCGACGG TACCGCGCTC TCGGCACCGC CGGTGGGGCA GGGCGGCGCT
GAGACCATCC AGCCGGACTT CACTGCCCGC GCCGCCGACG GCAGTCTCGC GATCGCCGGC
AACGCCCATG TCCGGCTGTC CCACCCGTCG TTCCACGGTG GGATCGCCAT GCTTCGTCGC
GGCTACTCCT ATGACGACGG CCTGGACTCA GCCGGCGAGC CGGACGCCGG CCTGTTCTTC
GCGGCCTACC AGGCCGACCC GCGTACGGCG TTCGTCGCGG TGCAGCGGAC CCTCGCGGCC
GGCGACGCAT TGAACACCTT CATCCGGCAC ACCTCCAGTG CCCTGTTCGC CGTCCCGCCC
GCGGCGCCCG CGGGCGGCTT TCTCGCGCAG GGGCTCTTCG GCGCCGGCTG A
 
Protein sequence
MSDIVDAADV TDVADATDAV RENRPIGPHR RRVLRGAGLL GFGGGVAIGG GLGGVMALDG 
GVAVAAERPQ EDQSGGTDRV AAVRREALAG VVGDGRHQPG IADRPPAQLV FTAYDLTTSG
PAAVRTSLAA VLRTWTAAAA VLMRGEPLDG AERDTQGLGP AGLTITIGLG ASALRRAGLD
AQIPAEFADI PAMPGDQLDL ARSGGDLGVQ VCAEDPMVAV SASRQMRRLA AQDARPRWIQ
RGFLRSAAAA FNPGSTPRNL MGQIDGTDNP GPGTPRFDRA VWVSSGPEWM RDGSYLVCRR
IRMLLDAWAR LDETAQSAVI GRRKSDGTAL SAPPVGQGGA ETIQPDFTAR AADGSLAIAG
NAHVRLSHPS FHGGIAMLRR GYSYDDGLDS AGEPDAGLFF AAYQADPRTA FVAVQRTLAA
GDALNTFIRH TSSALFAVPP AAPAGGFLAQ GLFGAG