Gene Franean1_4833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4833 
Symbol 
ID5673174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5772147 
End bp5775131 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content75% 
IMG OID641243689 
ProductLuxR family transcriptional regulator 
Protein accessionYP_001509105 
Protein GI158316597 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.010143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACTGG TTGAACGTGA CGACATTCTC CGTCAGCTCG ACCTTCTCCT GGCGGAATGT 
ACGGCGGGAA GCGGCCGCGT CGTATTGATC GACGGCCCGG TCGGCACCGG TAAGACGGAG
CTCGTCCGCT ACTGCGGCAC CCGGGCCGCC GGCAGCGGCG TGACAGTCCG CGCCGCGACC
TGCGCCCGTG CCGAACACGT CCTCCCGCTC GGAGTGGTCG GCCAGCTCCT GCGCGGCCTG
CCGGCGGGCG ACGGCGAGAC CGGCGCCGGC ACCGTCTGCA CCGGTCCCGA CGGTGCGGCC
GACCCGGACA CCGCCCGTGG CACGGACACA CTGTGCGAGG AGGCGACCCA GGACGGGTGC
CGAGGCCGCG AGGCGCGGGC GGACGGGGCG GGCCGGGCGG CCCGCATCGC GGAGCTCCTC
GACGTCGGTG CCGCGATCGC CGGCAGGCTG GGGCCGGATC CGGATCGCGA ACTGGCCCAG
ATCCACCAGG AACTCACGCT CCTCATCCTC GACCTCAGCA AGCGCGGTCC CGTCCTGCTG
TGCATCGACG ACGTGCAGTA CGCGGACGTC CCCTCCCTGC ACTTCCTGCT GCACCTCGTG
CGCCGGCTCG GATCGGCGCG CATCGCCGTG CTGCTCGCCG GTGATCTGGC TTCGCACCCG
CTGGACCTGC CGTTCCGCGC GGAGCTCGTC CGGGCGCCCG AGTTCCGGTC ACTGCGGGTC
GGCCCGCTCT CGCCCGCCGG CTCCCGCGAC CTCCTCGCCG CGGCCACGCC GGTCACTCCG
AACCCTTCTG CCACAGACCC GTACCAGGTC ACGGGGGGCA ATCCACTGCT GCTGACCGCG
CTGGTGCAGG ACGACCGCGG GTTCGGCCAG CCCGGTCCGG AGGTCTTCGG TCTGGCCCTG
CTGAGCTGTC TGCACCGCGG TGAGCCCGTC GCGGTCCAGG TGGCCCGGGC CCTGGCGGTG
CTGGAGACGC CGGTGCCCGA CGGGGTCGTC GCGCCGGACA CCGCGACGCT GGCGGGGATG
ATCGGCGCGG ACGTCCCCGC CGCGGCACGC GCCCTCGACG CCATGAATGC CGCCGGGCTC
CTCGATGACG GCCGCTTCCG CCACAAGGTC GCCCGGGACG CGGTGCTGGG CGATCTGACG
GCGTCCGAGC GGACTGATCT GCACCGGCGG GCGGCGCGGC TTCGCCACCA GCAGGGCGCG
CCGGCGGCGA CCGTGGCGGC ACACCTCGTC GAGGGCAACG ACGTGCAGCC CCCGTGGGGA
ACCGGCGTAC TCGTCGAGGC GGCCGAGCAG GCGCTGCTCG ACGGGCGGCC CGAGCGGGCC
GCCGCCTGCC TGAGGCTCGC GGCCCGGTCC GCCGCCGGTG AGCGGGAACG CGCCGCGATC
CGGGCCCGGC TCGCGCATGC CGAGTGGCAG ACGAGCCCGG CGGCGGCCGC GCGCCATCTC
TCCCCGCTGG TGAGCGCTGC CCACGCCGGC CTGCTGGAAC AGCGGACCAA CGCGGCCCTG
GTACGTCAGC TGCTGTGGCA CGGACGCTCG GCCGAGGCCG AGGATCTGCT CGCCCGGATG
CGTGCCGTGG CCCAGGCGGA GCCCGAGGAC GACGCGGCCG AGGTGCACGA CCTGGAGGTC
TGGCTCGCCA CCGTCCATCC GCCGCTGGCT CGCCGCCGGC GTGGGCCGGC CGCGGGCAGC
GCGGTGCCGG CGGCACCGGC GGCGGACTCG TGGCTGCGGT CAGCGGCGCT GCTCGCCGAC
GCGGTGGCCG CGGGCGGTCA CCGGACGGGG CGCGAGCCGG GGCCCTCCGG CATCACCGGC
GCTGGTGTGG CCGGCACCGG CCCGGTGGGG TCCGGTCTCG TCGGGGCGGA CCGGGCCGAG
AGTGCCCTGC GCGATCTGCA CCTGGCGCGA GCCGATCCGT GGGCCGGTGA GGTGGCGCTG
CTCGCCCTGC TGTTGCTGAC CAGGGCTCCG CGCATGGACG CCGCGGTCGC CTGGTGCGAG
CGCGTCCTGG CCGATCCCGA CCTGGAGGAC CAGACCGCGC GGGCGATGGC GACGGCGGTG
CGTGGCACCC TCGCGCTCGA GCAGGGCGAT CTGGCGGTCG CCGCCGACCA CGCCCGTGCC
GCGCTGCGTC GCCTGCCGGC CAAGGCGTGG GGAGTCGCGA TCGGCCTGCC GCTGGGCACC
CTGGTACTGG CCGCCACCCG GACCGGGGAC CTGGAGGAGG CGGCGAGGCA GCTCGTCCAG
ACCGTGCCCG AGGAGATGGT CGCCAGTCGC TACGGACTGG ACTACCTGCA CGCCCGTGGC
CACTACCATC TCGCCGCGAA CCACGCCCAC GCCGCGCTCG CCGACTTCCT GGCCTGCGGT
GACCTCATCC GCGGCTGGGG CCTCGACGCG GCGGCTCTCG TCCCGTGGCG CATCGGCGCC
GCCGAGGCGT GGCTGCGGCT GGGAAACGTG GACCAGGCCC GCCAACTGGC CCAGGAACAA
CTGAGCCGGC CCGGTGCCAC TCGGGTCCGC GGTCTGTCCC TGCGCCTGCT CGCCGCCGCC
AGCCCGCCCG GGCGCCGGCT TCAGCCGCTC ACCGAGGCGC TCGAGCTCCT CGAGGCCACC
GGGGACCGGC TCGGGCAGGC CTACGTCCTG GCGGACCTGA GCCGCGTCCA CGATCATCTC
GACCAGCGGC GGCGGGCGCG GCTGCTGCTG CGGCGGGCGC TGCACATCGC CACGATGTGC
GGTGCGCGAC CGCTCGCCCA GGAGCTCCTC GCGATCTCCG GCGACGGCAG AGCTGTCTTC
GGGCTCGGTG CCGACCAGGA GATGATCACC GGCCTGACGG ACTCGGAGCG GCGGGTGGCG
TCACTGGCGG TGATGGGCTA CACGAACCGG GAGATCGCGC TGCGGCTCTA CGTCACGCCG
AGCACGGTCG AGCAGCACCT GACCCGGGTC TACCGAAAGC TCAACGTCAA ACGCCGTCAG
GACCTCCCCG CCGACCTGTG GACGGACGCC ACCCACACCG GTTGA
 
Protein sequence
MALVERDDIL RQLDLLLAEC TAGSGRVVLI DGPVGTGKTE LVRYCGTRAA GSGVTVRAAT 
CARAEHVLPL GVVGQLLRGL PAGDGETGAG TVCTGPDGAA DPDTARGTDT LCEEATQDGC
RGREARADGA GRAARIAELL DVGAAIAGRL GPDPDRELAQ IHQELTLLIL DLSKRGPVLL
CIDDVQYADV PSLHFLLHLV RRLGSARIAV LLAGDLASHP LDLPFRAELV RAPEFRSLRV
GPLSPAGSRD LLAAATPVTP NPSATDPYQV TGGNPLLLTA LVQDDRGFGQ PGPEVFGLAL
LSCLHRGEPV AVQVARALAV LETPVPDGVV APDTATLAGM IGADVPAAAR ALDAMNAAGL
LDDGRFRHKV ARDAVLGDLT ASERTDLHRR AARLRHQQGA PAATVAAHLV EGNDVQPPWG
TGVLVEAAEQ ALLDGRPERA AACLRLAARS AAGERERAAI RARLAHAEWQ TSPAAAARHL
SPLVSAAHAG LLEQRTNAAL VRQLLWHGRS AEAEDLLARM RAVAQAEPED DAAEVHDLEV
WLATVHPPLA RRRRGPAAGS AVPAAPAADS WLRSAALLAD AVAAGGHRTG REPGPSGITG
AGVAGTGPVG SGLVGADRAE SALRDLHLAR ADPWAGEVAL LALLLLTRAP RMDAAVAWCE
RVLADPDLED QTARAMATAV RGTLALEQGD LAVAADHARA ALRRLPAKAW GVAIGLPLGT
LVLAATRTGD LEEAARQLVQ TVPEEMVASR YGLDYLHARG HYHLAANHAH AALADFLACG
DLIRGWGLDA AALVPWRIGA AEAWLRLGNV DQARQLAQEQ LSRPGATRVR GLSLRLLAAA
SPPGRRLQPL TEALELLEAT GDRLGQAYVL ADLSRVHDHL DQRRRARLLL RRALHIATMC
GARPLAQELL AISGDGRAVF GLGADQEMIT GLTDSERRVA SLAVMGYTNR EIALRLYVTP
STVEQHLTRV YRKLNVKRRQ DLPADLWTDA THTG