Gene Franean1_0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0436 
Symbol 
ID5668859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp517180 
End bp518205 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content71% 
IMG OID641239368 
ProductLacI family transcription regulator 
Protein accessionYP_001504807 
Protein GI158312299 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.197252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCTCGA CGCCGGACAG CCAGGAGGCG CGCCGCGTCA CCATCCAGGA CGTGGCACGC 
GAGGCCGGCG TGTCGGTCTC GGCCGTCTCC AAGGTCGTCC GGGACGCGTA TGGCGTCAGC
GAGGGTATGC GGGAGAAGGT CACCGCCGCC ATCGACGCGC TCGGCTACCG CCCGCACACC
GGCGCCCGGG CCATGCGCGG CCGCTCGTAC TCCGTCGGCG TGATGCTCAC CGAGCTGACC
TCGCCGTTCC AGCCGCAGAT CATCAACGGC ATCACGGCAC AGTTCGAGCC GACGCCGTAC
CAGGAGATCC TGATCGCCGC CGGCACCTCG CCCGACCGGC AGAAGCGCAG CATCGAGGCC
TTGATCGACC GGCAGGTCGA CGGGCTGATC GTCATCGCGC CCTGGATGGA GCAGGCGTGG
CTGGAGAAGC TGGGCGCGAG CCTGCCGACG GTCGTGCTGG CCCGGCATGG CGGCTCCGGC
ACCTTCGACA CGATCGTCGG CGACGACTTC GAGGGCGCCC GCCTCATGGT CGACCGCCTG
GTGGCCCTCG GGCACCGGCG CATCGTGCAC ACCAGCCAGC CCTCGGGCGG CCTGGAACGC
CCGTACGTCC TGTCGCACAC GCCACGGCTC GACGGCTACG AGGAGACGAT GCGAAGGCAC
GGGCTGGAGC CGGACGTCAT CGTCACCAGC TACTCGGAAG AAGGCGGATA CGAGGCCGCC
CGGCAAGCGC TGGCCCGTCC CATCCCGCCG ACCGCCATCT TCGCCGGCGC GGACATCGCC
GCGCTGGGCG TGCTGCGCGC GGCCGAGGAA CTCGGGCTGC GGGTTCCGGA AGACCTCAGC
GTCGCCGGGT ACGACAATAT CTACATGTCG ACGATCGGCC GCATCTCGCT GACCACGATC
GACCAGTCGG CCCAGCTCAC CGGTTCCCGC AGCGCCCGGT TGCTGCTGGA GCGCATCGAC
GGCCGCACCC AACCGGTGCA CTACCTCATC GCGCCGCGCC TGGTGGCTCG CGACACGACC
GCCTGA
 
Protein sequence
MVSTPDSQEA RRVTIQDVAR EAGVSVSAVS KVVRDAYGVS EGMREKVTAA IDALGYRPHT 
GARAMRGRSY SVGVMLTELT SPFQPQIING ITAQFEPTPY QEILIAAGTS PDRQKRSIEA
LIDRQVDGLI VIAPWMEQAW LEKLGASLPT VVLARHGGSG TFDTIVGDDF EGARLMVDRL
VALGHRRIVH TSQPSGGLER PYVLSHTPRL DGYEETMRRH GLEPDVIVTS YSEEGGYEAA
RQALARPIPP TAIFAGADIA ALGVLRAAEE LGLRVPEDLS VAGYDNIYMS TIGRISLTTI
DQSAQLTGSR SARLLLERID GRTQPVHYLI APRLVARDTT A