Gene Franean1_7007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7007 
Symbol 
ID5675318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8544478 
End bp8545932 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content72% 
IMG OID641245853 
ProductDyp-type peroxidase family protein 
Protein accessionYP_001511244 
Protein GI158318736 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.54132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGAC TGCCCGATCC CAGCGGCCCG GACGCCGTCG CCGCCGAGGA CCCCACCACC 
GCCGTCACCT CCAGCGGCGC CGGCGTGAAC GACGCCGGCC CCACCACCGC CGTCACCTCC
AGCGACGCCG GCGCGGGCGA CGCCGACGGC GACGGCCCGC GCAGCGGCCG CGGGAGCCAG
CCGGCGCCCG CGGCACGCGG TTTCAGCCGT CGCCGCATGG TCGCGTTCCT CGGCGGCGCC
GCGGCTGTCG GCGCGGGCGG CACCGCCGCC GGCTTCGTCG CCTCCAACTC GGAGGAGGAG
TCGCCGGGGT CGGGCCAGAG GGTTCCGTTC TTCGGTTCGA ACCAGGCCGG AATCGTCACC
CCGGTACAGG ACAGGCTGCA CTTCGCCGCC TTCGACCTCA GCCCGGCGGC CACCCGCGAC
GACCTGATCG CGCTGCTCAC CGCTTGGACC AACGCGGCGT CCAGGATGAC CGCCGGGCTG
GACGTCGGCA CCGGCGCTGT CACCGGTGCG CCCGGCTCCC CGCCGGACGA CACCGGTGAG
GCGCTGGGGC TGTCGCCCGC CCGGCTGACC CTCACACTCG GCTTCGGCAC CAGCCTGTTC
ACCGACGCCA GCGGCAAGGA CCGTTTCGGG ATCGCCGCTT CGCGACCGGC GCAGCTCGCC
GACCTGCCCG CCTTCCCCGG CGACGCCCTC GACCCAGCGT CCAGCGACGG CGACCTGTGC
GTGCAGGCCT GTGCCGACGA CCCGCAGGTG GCCGTGCACG CCATCCGCAA CCTGGCCCGC
CTCGCCCGGG GCGCCGCCTC GGTGCGCTAC TCCCAGCTCG GGTTCGGCCG CACCAGCTCC
ACCTCGACCG GCCAGGCCAC CCCACGCAAC ATGATGGGTT TCAAGGACGG CACCGCCAAC
ATCAAGGCCG AGGACGCGGC CACGATGAAC ACCCACGTCT GGGCCCAGCC CGGTGACGGG
CCGGACTGGA TGACCGGCGG CAGCTATCTC GTCAGCCGCC GCATCCGCAT GCTCATCGAG
CCCTGGGACA GCACCCCGCT CACCGAACAG GAACGGGTCA TCGGCCGCGC CAAGGGAAGC
GGAGCTCCGC TCGGCCAACG GGACGAGTTC GACCCGTTGG ACTTCGCGGC GAAGGACTCC
GCCGGAGAGC TGGTCGTCGA CACCAAGGCC CACGTACGGC TCGCCCACCC GACCCAGAAC
AACGGCGCCG TGATCCTGCG CCGTGGCTAC TCCTTCACCA ACGGCACCGA CAACCTCGGC
CGCCTCGACG CCGGGCTGTT CTTCATCGCC TATCAGCGGG ACCCGCGGAC CCAGTTCGTC
ACAATTCAGA AATCACTGGC CGGCAGGTCC AACGACGCGC TCAACGAATA CATTCAGCAC
GTCGGCAGCG GCCTGTACGC CTGCCCGCCG GGCGTCCAGC CAGGACAGTA CTGGGGCCAG
AAGCTCTTCG CCTGA
 
Protein sequence
MSRLPDPSGP DAVAAEDPTT AVTSSGAGVN DAGPTTAVTS SDAGAGDADG DGPRSGRGSQ 
PAPAARGFSR RRMVAFLGGA AAVGAGGTAA GFVASNSEEE SPGSGQRVPF FGSNQAGIVT
PVQDRLHFAA FDLSPAATRD DLIALLTAWT NAASRMTAGL DVGTGAVTGA PGSPPDDTGE
ALGLSPARLT LTLGFGTSLF TDASGKDRFG IAASRPAQLA DLPAFPGDAL DPASSDGDLC
VQACADDPQV AVHAIRNLAR LARGAASVRY SQLGFGRTSS TSTGQATPRN MMGFKDGTAN
IKAEDAATMN THVWAQPGDG PDWMTGGSYL VSRRIRMLIE PWDSTPLTEQ ERVIGRAKGS
GAPLGQRDEF DPLDFAAKDS AGELVVDTKA HVRLAHPTQN NGAVILRRGY SFTNGTDNLG
RLDAGLFFIA YQRDPRTQFV TIQKSLAGRS NDALNEYIQH VGSGLYACPP GVQPGQYWGQ
KLFA