Gene Franean1_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1682 
Symbol 
ID5670084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2013637 
End bp2014695 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content75% 
IMG OID641240600 
ProductLacI family transcription regulator 
Protein accessionYP_001506026 
Protein GI158313518 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.374774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.894525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCGA TGTCTACAGA CCGCCACATC GAGCGCATCA CGTCCCGCGA GGCGACGGTG 
AAGCGGCCCG CCACGATCCG CGAGGTCGCG GCGCTCGCCG GCGTCAGCGT GTCGACCGTC
AGCAACGCGC TGGCCGGCCG GCGATCGGTC AGCGGAGCGT CCAGCGCCCG GGTTCGCGCC
GCGGCCCACC GTCTCGGGTA CCGGCAGGTC GACGCCGCGC GGCCGGTGCG GACGCCGACC
CGGCACGCCA TCGGCCTCAT CGTCCCGGAC GCGAGCAACC CGTTCTTCGC CGAGATCGCG
CACGGCGTCG AGACGGTGGC CCAGTCGTCC GGCTGGGCGG TGTTCCTCGG CAACACCGAC
CTCGACGACG CGCGCGAGGC CGACTACCTC GACCGCCTGG CCGGGGCGGC CGACGGGCTC
CTGGTCTGCT CGGCGTCCGG GCATACCGAG CAGCTGCAGC ACCTGGTCGA CAGCGGCGTC
GCCGTGGTGG CCTGCGACGA GCGCCTGGAG CTGACCGGTG CCGGCGGGGT CTTCGCCGAC
GACGACGCGG CCGGCCGACT CGCCGCCGGG CACATCCTCG CCCGCGGCGC GCGCCGGATC
GCGATGATCT GCGGGCCGGA GCACCTCACC ACGGCCCGCG AGCGCCGCAC CGGCTTCCGC
GCAGAGTTGC AGGCGTCGGG ACGCTCGCTG CCGCCGTGGC GCTCGATCGC CAGCCGGTAC
ACGATCGAGG CGGGGCGCTG GGCGGCCGAC CAGCTCCTCG CCGCCGATCC ACAGATCGAC
GCGATCTTCT GCTCAAACGA TCTGCAGGCC GTCGGCGCGG TGCGGGCGTT GCGGCACGCG
GGCCGGCAGG TGCCCGGCGG TGTGCTGATC ATCGGGATCG ACGGGATCTC CTGGGGCGAG
CTCACCGAGC CGTCGCTGAC GACGGTGGCG CGTCATCCCG AACGGCTGGG GGCCGAGGCC
GCCCGGTTCC TCATCGAGAT GGTCGGTGAC GGCGCCCGGC CCCGCGAGGT CGTGCTGCCG
GTGGAGCTGG TCGAGCGGGA GAGCACCCGC CGCGCCTGA
 
Protein sequence
MESMSTDRHI ERITSREATV KRPATIREVA ALAGVSVSTV SNALAGRRSV SGASSARVRA 
AAHRLGYRQV DAARPVRTPT RHAIGLIVPD ASNPFFAEIA HGVETVAQSS GWAVFLGNTD
LDDAREADYL DRLAGAADGL LVCSASGHTE QLQHLVDSGV AVVACDERLE LTGAGGVFAD
DDAAGRLAAG HILARGARRI AMICGPEHLT TARERRTGFR AELQASGRSL PPWRSIASRY
TIEAGRWAAD QLLAADPQID AIFCSNDLQA VGAVRALRHA GRQVPGGVLI IGIDGISWGE
LTEPSLTTVA RHPERLGAEA ARFLIEMVGD GARPREVVLP VELVERESTR RA