Gene Franean1_1530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1530 
Symbol 
ID5669934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1832504 
End bp1833715 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content61% 
IMG OID641240450 
ProductXRE family transcriptional regulator 
Protein accessionYP_001505876 
Protein GI158313368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0255908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAGA TCCGAGCGTT GGGAGACCGC GTCGCTCAGG TACGAGTACG ACGGTCCATG 
ACCCAGGCCG AGCTAGCGGA GCGGGCGGAC GTATCCGTTG ACCTGGTTAC GAAGCTTGAG
CAGGGGCAAC GTGATGGAAT CCGCATCACC ACGTTGCATA GCCTTGCCAG GGCGTTGGAA
GTACCTACAG CAACGTTCTT CAAGGTGGAG GTGGAGACAG CGATGAGCGA CAGCGAAGCG
CTGATGCCGC TCCGGCGGTT GCTACTACCC GGGCCCTCAG GCAGTCGGGC GGAGGAGCCT
GCGCTTGGGC TCAGGCCGCT ACGGGATCGG CTGGTTGCCC TGACTCAGGA CTACCATCAT
GCCCGGTACT CGCAAGCGGT ACGGACAGCT CCCGCGCTGA TTGAGGACAT CACGGCCGCG
AGCGCTCTCC ACCAAGGGGA AGATCAAGAA GCGGCTTACC GCCTTTTGTC GCATGCCTAC
ATCATGGCGG CAAGTGTGTT GATCCAGTTG GGGGGCGAAG ACCTCGCGTG CGAGGCCATC
CGGCGCAGCA TGGAAGCGGC AGACTCGGCC GGCGACCAGA TTCTCCGTGC TAGTGGTGTG
GTTTATTACC GCTGGGCTTT CATCAGACAA GGACGATTCG ACGATGCCGA GAGAGTGGCC
GTCGAGATGG CAGCAGAGAT TGAGCCGTCA ATCATGCGGT CCACCCCCGA ACACCTAGGC
GTATGGGGTC GACTTATGAC GGGTGCCAGC GCTGCCGCCG CGCGGAACAA CCGTCCGGAA
ACTGCCAGTG AACTACTATC GTACGCTCGG TCGTCGGCTG CGCGAATTGT CGATGGAAAG
ATGGACTATG CAAAATACTG GGCGGCATTT GGTCCATCTC AGGTGGATGC AATAGAGGTG
GAAAACGCCA TGACTCAGGG AGACGCGCCC CGCGCGTTGC AACTGGCAGG GTCCGTCCAC
CGGACCGAAA ATATGCCACT GAGTAACTGG ACACGTCATC TGTTGGCCGT AGCGGAAGCC
CAAACAGCCA CCAGAGACTA TCCCAATGCC ATACGGACCG TACAGGAGGC CCGTACGTTG
ACCCCTGAAT GGCTCCGAGA GCAGCGGCTA GCCGGGAAGG TGGTTCGGGA CCTCTTGGAC
GCGACCAGCG TCCGCCGTGC TCGCAAGAGC GGCCTAGCCG AGTTGGCTTC GTTCGTTGGG
GTTCAGCCCT AA
 
Protein sequence
MREIRALGDR VAQVRVRRSM TQAELAERAD VSVDLVTKLE QGQRDGIRIT TLHSLARALE 
VPTATFFKVE VETAMSDSEA LMPLRRLLLP GPSGSRAEEP ALGLRPLRDR LVALTQDYHH
ARYSQAVRTA PALIEDITAA SALHQGEDQE AAYRLLSHAY IMAASVLIQL GGEDLACEAI
RRSMEAADSA GDQILRASGV VYYRWAFIRQ GRFDDAERVA VEMAAEIEPS IMRSTPEHLG
VWGRLMTGAS AAAARNNRPE TASELLSYAR SSAARIVDGK MDYAKYWAAF GPSQVDAIEV
ENAMTQGDAP RALQLAGSVH RTENMPLSNW TRHLLAVAEA QTATRDYPNA IRTVQEARTL
TPEWLREQRL AGKVVRDLLD ATSVRRARKS GLAELASFVG VQP