Gene Franean1_3783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3783 
Symbol 
ID5672147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4485921 
End bp4486907 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content73% 
IMG OID641242662 
Productintradiol ring-cleavage dioxygenase 
Protein accessionYP_001508082 
Protein GI158315574 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.83161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATC ACCGGAGTCC CAGCCGCACG CGGGCGTACG AGGGGCGCCC GCTGGCCCGG 
CCCGACGAGG AGATCGTTGA CCAGGGCCTC GCCTTCGACG TCGGCACACT GCTGAGCCGG
CGGCGGATGC TGGCGTTCTT CGGCCTCGGC GCCGCGTCGG CCGGCCTGGC GGCCTGCGGC
TTGGACTCCG CCGGATCCAC TGCCTCAGCC ACCTCGGGGA CGTCCGCCTC GGCGGCCTCG
GCCGCGACGA CGTCGGCCAC GATCGGCGCG GCGGCCGGGG AGATCCCCGA GGAGACCGCG
GGCCCCTACC CGGGCGACGG GTCCAACGGG CCGGACGTCC TCGAGCAGAG CGGTGTGGTC
CGCAGTGACA TCCGGTCCAG CTTCGGCGAC TCGACCGGTA CCGCCGAAGG CGTCCCCATG
ACGCTGGCGC TGACGGTCCG CGACCTCGCG AACAGCGGCA CGCCCTTCGC CGGGGTGGCC
GTGTACGTGT GGCACTGCGA CCGAGAGGGC CGCTACTCGC TGTACTCCGA CGGCGTCACC
GACCAGAACT ACCTGCGCGG GGTCCAGGTC GCCGACTCCG CCGGCATGGT CCGTTTCACC
AGCGTCTTCC CGGCGTGCTA CTCGGGACGC TGGCCGCACG TCCACTTCGA GGTCTATCCC
GACCAGGCCA GCATCACCGA CTCGTCCAAG GCCATCGCCA CCTCGCAGCT CGCGCTGCCG
CAGGACGTCT GCGCCAAGGT CTTCACGCAG CCGGGCTACG AGGCGTCCGT GCGCAACCTG
GCGCAGGTCA GCCTCGACAG CGACGGTGTC TTCGGGGACG ACGGGGCTGC CAGCCAGCTC
GCCACCGTCA CCGGTGATGT CACCGGCGGC TACGCGGTCT CTCTCGCCTT GGGCGTCGAC
ACGTCGACGG CCGCGGGCGG CGGTCAGATC TCCGGCGGCG GCGCGCCGGG CGGCGCACCC
GGGGGCCGGG CGCCTGGCGG GCGGTGA
 
Protein sequence
MADHRSPSRT RAYEGRPLAR PDEEIVDQGL AFDVGTLLSR RRMLAFFGLG AASAGLAACG 
LDSAGSTASA TSGTSASAAS AATTSATIGA AAGEIPEETA GPYPGDGSNG PDVLEQSGVV
RSDIRSSFGD STGTAEGVPM TLALTVRDLA NSGTPFAGVA VYVWHCDREG RYSLYSDGVT
DQNYLRGVQV ADSAGMVRFT SVFPACYSGR WPHVHFEVYP DQASITDSSK AIATSQLALP
QDVCAKVFTQ PGYEASVRNL AQVSLDSDGV FGDDGAASQL ATVTGDVTGG YAVSLALGVD
TSTAAGGGQI SGGGAPGGAP GGRAPGGR