Gene Franean1_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2051 
Symbol 
ID5670452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2469864 
End bp2471444 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content78% 
IMG OID641240973 
ProductPucR family transcriptional regulator 
Protein accessionYP_001506394 
Protein GI158313886 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.4241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGCGG CGGCCGACGC CCGCCTGCAC GCCGGCCTGC TGGGCAACTA TCTGGAGGTG 
CTCGCCTCCG CGGCCGACAG CGGCCGGCGG CTGTCCCGCG CCGAGCTCGA CGTCTTCCGG
GCGCTCGGCC AGGCCGCCGC CGAGTCCGGC GCCTCGCTGC CGGCGCTCGT CGACCTCTAC
CTCTCCGCGA CCTGGCGGAT CTGGCCGTCG CTGCCCGTCG TCCGCCAGGC CGACCGGGAC
GCGCGCGACC TGGGACGGAC GCTGCCGGCG GTCCTGGACG GGCGGGTGCC GGCCGTCGCC
CACGCCGAGC ACTCCCAGCA CACCGAGGAC GGCGGTGATC CCACCGATCT CAGCGGGCTG
ACCGTCGGGG CGGTCTCGCG AACCCGCGCG GCGGCGTCGG CCGTCCTGCG GGCCAGCGAC
GACGCGGTCG CGGCGGTGTG CGAGGGCTAC GAACGGGCCC GCGCGGCGCG GGCGCGCTCG
GAGGAGGCGA TGCGCCGCGA GCTGGTGGAC GACCTGCTCA CCGGCACCTC CGAGCTCGGG
CCGCTGCTCG AGCGGGCGGC CGCCTTCGGG CTCCGGCTGG AGGCGCCACA CGTCGTCCTC
GTCGCGGCGG GCGGGCGGCG CTTCCTGGAC GGGCGGGCGG TCGTGCGCGG GATCGAGGAG
GCGCTGCGCG CGCAGTGCGC CACGGAGCCC CTCGTCGCGA CCAAGGACGG GCTGCTCGTC
TGCGTCGTCC CGCAGGAGAC CGACCTGACC CTGCCCGTCC CGCCCACGGC GGCGATCACG
CCGGACGACG GCGCCGCACC GGACCGCCGC CCCGCCGCCG GCCCGACGGC CGGGTCGGCC
AGGGCCGGGG TGACCAGCGT GAGCGATCCC GCGTCGGGAC GTCACCATCC TGTGCATCAC
CCCGTGCACG CCCGGCCGGA CGGCGGCTCC GGGCACCCGA GGGCCGACCG GCCGACCCTC
GACCGGGAGA TCGCACGCCC ACGGGCCACG GCGCCGCGGG GCCGGCGCCG GATGGACCCG
CCCGGCCCCG GCGACGCCGG GTTCGCGCCG CTGCCGGCGT TCTCCCCCGC GGACGCGGCG
CCGTCCACCG CCATCAGGGC GGTCATCGGG CGGCTGGGGG TGGAACCGGA GCTCGTGTGG
CGGCTTGGGG TCAGCCGGCC CCGCAGCGGC GTCGCCGGCG TGCGCATCGG CTATGAGGAG
GCCCGCAACG CCGTCGAGCT GGCCGGGCGG ATGCGGCTGG ACGGGCAGGT CGTGCACGCC
GACGACCTGC TCATCTACAA GGTGCTGCTC CGCGACCGGG AACCGCTGGA GGAGCTCGTC
GAGGCGGTGC TCAGCCCGCT GCGGGCGGCG CGGGGCGGCG CGGGGCCGCT GATCGAGACG
CTCGACGCCT ACTTCGCGAC CGGCGGCGTG GCGCTCGCGG CGGCCCGCCG GTTGCACCTG
TCGGTGCGCG CGCTCACCTA CCGGCTCGAC CGCATCCACG CCCTGACCAG GCATGACCCG
ACCTCTCCGA CGGACCGGTA CGTCCTGCAG ACCGCGGTGC TCGGTGCCCG GCTGCTCGGC
TGGGAGGGCA CCTCGCGCTG A
 
Protein sequence
MLAAADARLH AGLLGNYLEV LASAADSGRR LSRAELDVFR ALGQAAAESG ASLPALVDLY 
LSATWRIWPS LPVVRQADRD ARDLGRTLPA VLDGRVPAVA HAEHSQHTED GGDPTDLSGL
TVGAVSRTRA AASAVLRASD DAVAAVCEGY ERARAARARS EEAMRRELVD DLLTGTSELG
PLLERAAAFG LRLEAPHVVL VAAGGRRFLD GRAVVRGIEE ALRAQCATEP LVATKDGLLV
CVVPQETDLT LPVPPTAAIT PDDGAAPDRR PAAGPTAGSA RAGVTSVSDP ASGRHHPVHH
PVHARPDGGS GHPRADRPTL DREIARPRAT APRGRRRMDP PGPGDAGFAP LPAFSPADAA
PSTAIRAVIG RLGVEPELVW RLGVSRPRSG VAGVRIGYEE ARNAVELAGR MRLDGQVVHA
DDLLIYKVLL RDREPLEELV EAVLSPLRAA RGGAGPLIET LDAYFATGGV ALAAARRLHL
SVRALTYRLD RIHALTRHDP TSPTDRYVLQ TAVLGARLLG WEGTSR