Gene Franean1_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0235 
Symbol 
ID5668660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp287881 
End bp289170 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content74% 
IMG OID641239164 
Producthypothetical protein 
Protein accessionYP_001504608 
Protein GI158312100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.27533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA GCACCAGGCC GGCGACGCCC CGGGCCGGGC GGCCCGGGTT CTCCCAGCAC 
ACGGAATACC TGCGCGCGCG AGAGAACCAG GCGATGGTCC GACGCGACCG CCTGGCGCGC
GAGGTGCCGC GCGCGGCCAT CCGCGCCGCG GTGGTCGGCA TGGGCCTCGG CCTGGTCATC
GGGTTCGGGC TCGGGCTGCC CGGTCTCGGC GTGGCCGTCT TCGTGGTCCT GCTCATCGTC
TGGCCCGGTG GCATCGCCGT GGCCGCCTTC GGTGTCTCAC CGGACGTCGA GACCCTGCGC
GAGGCGGCCG AGGCTGAGCG CAAGACCGCC CGCGCGATCT CCCGGCTCCG CCGACACGGC
TATGTGATCA TGCACGACCG GGCCGTCCCC TACTCGCAGG CCACAATCGG GCACCTGCTG
ATCGGTCCCG GCGGCGTCAT GATCCTCGGC AGCGACACCA ACAAGGGCAT CGTCCGCTAC
GCCAAGGGCG GCGCCATGGT GGACGGCGAG TCGCTCAAGC CCGCGATCGA CAAGACCTCA
TGGCTCGGCG GCGAGGTGCG CAACCAGGTC CGCGCCGCCC TGCCCACCAC GAAGATCCCG
GTCTACCCGG TCCTCGTGAT GGTCGAGGCG AGCGTCCTGT GGAGCGACGG CGCGCTGGAC
GGCGTCACGA TCATCAGCGT CAAGGATGTC GTCAAGTACG TCCGGAGCAA GCCCGGGCGG
CTCAACCCCG GGCAGGTCCA GCAGGTCCTC GCCGCCGCCC AGCGGCTCTT CCCGCCGTAC
TCCTCCAACC GGCTCGCCGA GCACGTCGTC GTCGACCGCG ACCAGTGGCT CACCCTGATG
GACGCCCTGC GCACAATCCG CGAGCGCGGC GGCGACGCCT CCGAGATGCT CGAGCGCCTC
GCCCAGATCG AAGCCGACCT CGGCCGCCAG GCCGATCTCA TCGACCGCGC CGGCATGCCC
CTCGCCCGGG CCGCCGACCA GCCCGACGGC CCGACCGACA GCCCACCGCC CGCTTCCGGG
ACGGACACGG CGACCGACGC GATCGGCCTG CTGGACGTGG ACGGAACAGG CACCGCCAAG
TCCCTGGAGG GACCCCCGCG GGCCCGCCCC GGCGAGGGCC GGCGCGGCCG CATCCTGGCC
GCCGTCCGCC AGCCACGGGG CAGCGAGTCC ATCAGCACGT CGAGCCGGCC CCCCGGGGGC
GACGGCCCGA CGACGGCAAA GGGCGACCAG CCCCCCGCCC CCGGCGACGA CCGCGCCCAC
CCGACCTCCG GGCCCGGATC CGGGTCGTAG
 
Protein sequence
MATSTRPATP RAGRPGFSQH TEYLRARENQ AMVRRDRLAR EVPRAAIRAA VVGMGLGLVI 
GFGLGLPGLG VAVFVVLLIV WPGGIAVAAF GVSPDVETLR EAAEAERKTA RAISRLRRHG
YVIMHDRAVP YSQATIGHLL IGPGGVMILG SDTNKGIVRY AKGGAMVDGE SLKPAIDKTS
WLGGEVRNQV RAALPTTKIP VYPVLVMVEA SVLWSDGALD GVTIISVKDV VKYVRSKPGR
LNPGQVQQVL AAAQRLFPPY SSNRLAEHVV VDRDQWLTLM DALRTIRERG GDASEMLERL
AQIEADLGRQ ADLIDRAGMP LARAADQPDG PTDSPPPASG TDTATDAIGL LDVDGTGTAK
SLEGPPRARP GEGRRGRILA AVRQPRGSES ISTSSRPPGG DGPTTAKGDQ PPAPGDDRAH
PTSGPGSGS