Gene Franean1_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1533 
Symbol 
ID5669937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1835124 
End bp1837355 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content71% 
IMG OID641240453 
Productcatalase/peroxidase HPI 
Protein accessionYP_001505879 
Protein GI158313371 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0328118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGAGA ATCACGAGAC AGTTGTTTCC GAACTGAACG AGGAAAGCGG TGGCGGCTGC 
CCGGTCGCGC ATGAGCGGGC TCCGCATCCC ACCCAGGGTG GCGGGAACCG TGGTTGGTGG
CCGAACCGGC TCAACCTGAA GATCCTCGCC AAGAATCCCG CGGTGGCCAA TCCGCTCGGT
GAGGAGTTCG ACTACGCCGC GGCGTTCCGG ACGCTCGACC TCCCCGCCGT CAAGCGGGAC
ATCGCGCAGG TGCTGACGAC GTCGCAGGAC TGGTGGCCGG CCGACTACGG TCACTACGGC
CCGTTCATGA TCAGGATGGC GTGGCACAGC GCGGGTACCT ACCGCATCAG CGATGGCCGT
GGTGGCGGCG GCGCCGGGCA GCAGCGTTTC GCGCCGCTCA ACAGCTGGCC GGACAACGGG
AACCTGGACA AGGCGCGCCG CCTGCTGTGG CCGGTGAAGA AGAAGTACGG TCAGGCGCTC
TCCTGGGCGG ACCTGATGAT TCTCGCCGGC AACGTCGCCC TGGAATCGAT GGGTTTCACG
ACCTTCGGCT TCGCCGGTGG CCGCGAGGAT GTCTGGGAGC CGGACGAGGA CGTCTACTGG
GGCCCGGAGA CCACCTGGCT CGGCGACGAG CGCTACACCG GTGACCGGGA GCTCGAGAAT
CCGCTCGGGG CGGTCCAGAT GGGCCTCATC TACGTCAACC CGGAGGGCCC GAACGGCACC
CCGGACCCGC TGGCCGCGGC CCGCGACATC CGCGAGACGT TCCGCCGCAT GGCGATGGAC
GACGAGGAGA CGGTGGCGCT CATCGCCGGT GGCCACACGT TCGGCAAGAC CCACGGCGCG
GGCGACCCGG ACAACGTCGG TCCGGAGCCC GAAGGTGCCC CCCTTGAGAC GCAGGGCCTC
GGCTGGAAGA ACGCCTTCGG TACCGGCAAG GGCGCGGACG CGATCACCAG CGGGCTCGAG
GGTGCCTGGA CTCCCACCCC GGTGAGCTGG GACAACAGCT TCTTCGAGAC GCTTTTCGGC
TACGAGTGGG CGCTGACGAA GAGCCCGGCC GGGGCCTACC AGTGGAAGCC GAAGGGCGGC
GCGGGCGCTG GCACCGTCCC CGATGCCCAC GACGCGGCGA AGAGCCACGC CCCGACGATG
CTGACGACCG ACCTCGCCCT GCGGTTCGAC CCGGTGTACG AGCCGATCTC GCGGCGGTTC
CTGGAGCACC CGGACGAGCT CGCGGACGCG TTCGCCCGGG CGTGGTTCAA GCTGACCCAC
CGTGACATGG GGCCGGTCGC GCGCTACCTC GGCCCGGAGG TCCCGGCCGA GACGCTGCTG
TGGCAGGACC CGGTGCCGGC GGTGGACCAC GAGCTCGTCG ACGCCGCGGA CGTCGCCGCG
CTGAAGGTCC GGGTCCTCGC CTCGGGCCTG TCGGTCTCCG AGCTGGTCGC GACCGCGTGG
GCGTCGGCCT CGACGTTCCG TGGCGGTGAC AAGCGCGGTG GTGCCAACGG CGCGCGCATC
CGCCTCGAGC CGCAGCGCGG CTGGGAGGTG AACGAGCCGG ACCGGCTGGC GGCGGTGCTG
GGGACGCTGA CCGGCATCCA GGAGGAGTTC CACGCCGCCC GGACCGACGG CCGGCGGGTC
TCGCTCGCCG ACCTGATCGT GCTCGCCGGT GGTGCCGCTG TCGAGCAGGC AGCCCGCGAA
GCCGGCTTCG ACGTCGAGGT CCCGTTCACC CCGGGCCGGA CCGACGCGTC CCAGGAGCTG
ACCGACGTCG AGTCGTTCGC GGCGCTCGAA CCGGCCGCGG ACGGGTTCCG TAACTACCTC
GGGAAGGGCC AGCGCCTGCC GGCCGAGTAC CTGCTGCTCG ACCGGGCGAA CCTGCTGACC
CTGAGCGCCC CCGAGCTGAC GGTCCTCGTC GGTGGCCTGC GGGTGCTGGG GGCGAACTTC
CGGCAGTCCT CGCTGGGGGT CCTCACCGCG ACGCCCGGGG TGTTGACCAA CGACTTCTTC
GCCAACCTGC TCGACCTGGG CACGACGTGG CGCCCGAGCG GCGAGGACGA CAACGTCTTC
GAGGGCCGCG ACGCCGCCAC GGGCGAGCTG ACCTGGACCG GTAGCCGCGT CGACCTCGTC
TTCGGCTCGA ACTCCGAGCT GCGCGCGTTC GCGGAGGTGT ACGCGAGCGA CGACGCGCGG
GAGAAGTTCG TACGCGACTT CGTCGCGGCC TGGGCCAAGG TGATGAACCT CGACCGCTAC
GACCTCGCCT GA
 
Protein sequence
MSENHETVVS ELNEESGGGC PVAHERAPHP TQGGGNRGWW PNRLNLKILA KNPAVANPLG 
EEFDYAAAFR TLDLPAVKRD IAQVLTTSQD WWPADYGHYG PFMIRMAWHS AGTYRISDGR
GGGGAGQQRF APLNSWPDNG NLDKARRLLW PVKKKYGQAL SWADLMILAG NVALESMGFT
TFGFAGGRED VWEPDEDVYW GPETTWLGDE RYTGDRELEN PLGAVQMGLI YVNPEGPNGT
PDPLAAARDI RETFRRMAMD DEETVALIAG GHTFGKTHGA GDPDNVGPEP EGAPLETQGL
GWKNAFGTGK GADAITSGLE GAWTPTPVSW DNSFFETLFG YEWALTKSPA GAYQWKPKGG
AGAGTVPDAH DAAKSHAPTM LTTDLALRFD PVYEPISRRF LEHPDELADA FARAWFKLTH
RDMGPVARYL GPEVPAETLL WQDPVPAVDH ELVDAADVAA LKVRVLASGL SVSELVATAW
ASASTFRGGD KRGGANGARI RLEPQRGWEV NEPDRLAAVL GTLTGIQEEF HAARTDGRRV
SLADLIVLAG GAAVEQAARE AGFDVEVPFT PGRTDASQEL TDVESFAALE PAADGFRNYL
GKGQRLPAEY LLLDRANLLT LSAPELTVLV GGLRVLGANF RQSSLGVLTA TPGVLTNDFF
ANLLDLGTTW RPSGEDDNVF EGRDAATGEL TWTGSRVDLV FGSNSELRAF AEVYASDDAR
EKFVRDFVAA WAKVMNLDRY DLA