Gene Franean1_4377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4377 
Symbol 
ID5672730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5223940 
End bp5224899 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content71% 
IMG OID641243246 
Productintradiol ring-cleavage dioxygenase 
Protein accessionYP_001508663 
Protein GI158316155 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACC GTCAGCAGCC AAGCCGGATG CCGACGTACG AGGGGCGTGC ACTGGCCCGT 
CCTGAGGAGG AGATCGTCGA TCAGGGGCTG GGCTTCGACA TGGGCACGGT GGTGAGCCGG
CGGCGGATGC TGGCCTTCTT CGGTGTGGGT GCCGCGGCAG CAAGCCTAGC CGCCTGCACT
CCAGGCCAGG TCGGGTCCTC GGGGGCGTCC GCTGCTACGG CGTCCGCGGT TGCAGGGGAG
ATCCCCGAAG AGACCGCGGG CCCCTACCCG GGCGACGGGT CCAACGGGCC GGACGTCCTC
GAGCAGAGCG GTGTGGTCCG CAGTGACATC CGGTCCAGCT TCGGCGACTC GACCGGCACC
GCCGAAGGCG TCCCCATGAC ACTGGCGCTG ACGGTCCGCG ACCTCGCGAA CGGTGGCACG
CCCTTCGCCG GGGTGGCCGT GTACGTGTGG CACTGCGACC GCGAGGGCCG CTACTCGCTG
TACTCCGACG GCGTCACCGA CCAGAACTAT CTGCGTGGCG TCCAGATCAC CGACACCGCC
GGCACGGTCC GTTTCACCAG CATCTTCCCC GCCTGCTACT CAGGACGCTG GCCCCACATC
CACTTCGAGG TCTACCCCGA CCAGGGCAGC ATCACCGACG CCACCACGGC CATCGCCACC
TCCCAGGTCG CACTCCCCCA AGACGTCTGC ACCACGGTCT ACGCCCAGCA GGGCTACGAG
GCGTCCGTGA GCAACCTGGC CCAGGTCAGC CTCTCCAGCG ACAACGTCTT CGGCGACGAC
TCCGGCGCCA GCCAACTCGC CACCGTGACT GGCGACGTCA CCGGCGGCTA CACCGTCTCC
CTTCCTGTCA GCGTCGACAC CGCCACCACC CCCGGCGGCG GCGGCCAAGC CCCCGGAGGG
GGCGGCGGCC AGCCGCCGTC CGGCGGGCCA GGTGGTCAGC CGCCGGCCAC ATCGAGCTGA
 
Protein sequence
MADRQQPSRM PTYEGRALAR PEEEIVDQGL GFDMGTVVSR RRMLAFFGVG AAAASLAACT 
PGQVGSSGAS AATASAVAGE IPEETAGPYP GDGSNGPDVL EQSGVVRSDI RSSFGDSTGT
AEGVPMTLAL TVRDLANGGT PFAGVAVYVW HCDREGRYSL YSDGVTDQNY LRGVQITDTA
GTVRFTSIFP ACYSGRWPHI HFEVYPDQGS ITDATTAIAT SQVALPQDVC TTVYAQQGYE
ASVSNLAQVS LSSDNVFGDD SGASQLATVT GDVTGGYTVS LPVSVDTATT PGGGGQAPGG
GGGQPPSGGP GGQPPATSS