Gene Franean1_1630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1630 
Symbol 
ID5670032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1946364 
End bp1948145 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content71% 
IMG OID641240548 
Producthypothetical protein 
Protein accessionYP_001505974 
Protein GI158313466 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.468389 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACAA ACCCGGCGGG GGTGCTTTCC CGCGGCGTTC ACCCTGAGTC CACCGCCCGG 
CAGCCGCAGG CTCCGAGAAT TTCTCTCAGC CGCTCCTCCG ACGGGTCGGC GGGAGGTTTT
CTCGTGGACC ACGCGGTTTA CCACGTGCTC GAGCAGAGCC GGGAATACGA GGTCGATCTT
GTCGTCACCG CCGGCCAGTG GCCGGACGGG CTGACCGGGT TCGCATTTGT CGTCGGGCCG
GCCCAGCCGA CGGTGCTGGA CTTCGCGCCG AGCGGACCGG GAATGCTGAC CCGGGTCGAT
CTGGCGAAAC GCACCTGGCG GACCCGCCGA GTGGTCACCC CGGACCTGGC GATGCTCGGC
GGGCTGCGTG CCGCACTGGC TCCGGAGGAG CTCCAGCGGC TGCTGACCGG CGCCCACCCG
TCACTCAGCC ACACCGCGCC GCACTTCTTC GGCGACCGGC TGCTGCTCAC CGCCGACCGG
CAGCGGCCGG TGGAGCTCGA CCCGGTCACG ATGACCTACC GGACGTTCCT CGGCGCGGTA
CACGAGTACC CCCAGGTCGG GGCGCACCCG CTGTTCCCCG GGGTGCAGAC GACGTCCCAC
CCGGTGGTCG ACCCCGACGA GGGCTGTCTG TGGTGGAGCA ACATCCACCT GCGCCCGCGC
GGCCGGTCGA CCACCGACGT GGAGGGCCCG CTGTCGGTGG TGCGCTGGGA CGGCCACGGT
GAGCTGGAGA CCTGGGAGGT ACCCGGGGCG CGCATCACCC AGGGCACCCA CGAGATCGCG
GTGACCCGCG ACTACGTCAT CTTCACCGAG ATCGGCTTCC AGCCGGAGCC CGGCAGCGTC
GCCGGCCGCG GCCGCACCAG GCCGCACCTG CCGTTCACCG ACATCTATCT GGTCGCCAAG
CGGGACCTGA CCCGTGCCCG GGTCGGCTCC GCCGTGCCGG TGACGCACGC GCGGGTCCCC
TACGAGTCGT TCCACGAGTT CGCCGACTAC GGCCAGGACG GCGACGACGT CACGATGTAC
GTCGCGCACT CGAACGGCTG GGACATGAAC TACGCCATCA CCCGCTCGGA CACCGTTTGG
CGCACCGGGG ACAGGCTGCG CAGCTGCCTG TCCGGGTTCA TGCCGACGCC GGTCGACGCC
GCACCGGTCG GGCGGCACGT CATCGACGGC CGTACCGGGC AGGTGCGGCA GAGCAGGTAC
TTCCTCGACC CGCAGCGGCA CTGGGGAACC CTGCTCTACG CCCGCGACAC CCGCCCGGCG
GCGCTCGAGC GGGGCCGCCA CCTGTGGCAG GCGTACTGGG GCGCCACGCC CGACACGATG
GTCTCGGCGA TCGTCGAGAT GTACGCCGAC CATCCGTTCC GGGTGGTCGG CGTCGACGAC
CTGCCCACGA CCGAGATCCC GTCGTCGCTG GTCTGCATCG ACCTGGAGTC GATGACCGAG
CAGTCGGCGT GGACGTTCCC GGCCGGGACG ATCTGCGAGT CCCCGGTCTT CGTGCCGGAC
AAGGCCGGCG GCGATGGCTG GGTGGTGGTC TTCGTCAAGC ATGCGGACCG CACCGAACTG
CAGGTCTTCG ACGCCCTGGC GCTGGATCTC GGCCCGTGCG CCGTGGTGAC GGCGCCAGGC
CTGCGAATGC CCGTGCTGTT CCACTCGGGC TACACGGAGA CCATCCGCTC CCCCGGTACC
GACTACCGGC GCTCGTTCGC CGCCGACCTC GGCACCGGAT GGCGCGACCT CTCCCCGGCC
GCGCGCGCCA TCGTCACCGA GATCGTGGAG GCGTTCGGCT AG
 
Protein sequence
MGTNPAGVLS RGVHPESTAR QPQAPRISLS RSSDGSAGGF LVDHAVYHVL EQSREYEVDL 
VVTAGQWPDG LTGFAFVVGP AQPTVLDFAP SGPGMLTRVD LAKRTWRTRR VVTPDLAMLG
GLRAALAPEE LQRLLTGAHP SLSHTAPHFF GDRLLLTADR QRPVELDPVT MTYRTFLGAV
HEYPQVGAHP LFPGVQTTSH PVVDPDEGCL WWSNIHLRPR GRSTTDVEGP LSVVRWDGHG
ELETWEVPGA RITQGTHEIA VTRDYVIFTE IGFQPEPGSV AGRGRTRPHL PFTDIYLVAK
RDLTRARVGS AVPVTHARVP YESFHEFADY GQDGDDVTMY VAHSNGWDMN YAITRSDTVW
RTGDRLRSCL SGFMPTPVDA APVGRHVIDG RTGQVRQSRY FLDPQRHWGT LLYARDTRPA
ALERGRHLWQ AYWGATPDTM VSAIVEMYAD HPFRVVGVDD LPTTEIPSSL VCIDLESMTE
QSAWTFPAGT ICESPVFVPD KAGGDGWVVV FVKHADRTEL QVFDALALDL GPCAVVTAPG
LRMPVLFHSG YTETIRSPGT DYRRSFAADL GTGWRDLSPA ARAIVTEIVE AFG