Gene Franean1_3827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3827 
Symbol 
ID5672191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4545250 
End bp4548558 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content74% 
IMG OID641242706 
Producttranscriptional regulator domain-containing protein 
Protein accessionYP_001508126 
Protein GI158315618 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2909] ATP-dependent transcriptional regulator
[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0262799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGG CGGGGGATGC GTCGCGGCGT GTACAGATCT ATGCCGGCGG GAATGCCGAA 
CTTGACGCCG CGCCTGGAAC AACCTCACGG GGAACGGCGA TGACCGAGGC GCCGGTTCCC
CCGATCCTCG CGGCCCGCGT TCGGCTACCC GCCCCCGCCG GCATGCACCG GCCGCGGCTG
GCCGGACCGC TCATCGTCCC GGCCGCCTAC CGGGTCGGAA CCGTGGTCGC GCCGGCCGGC
GCCGGGAAGA CAACCCTGCT TGTGGAGGTG GCGCAGGCGT GTGCGTGGCC GACCGCCTGG
CTCACGCTCG ACGACCGGAT CGGCGGCCTC GACTCGTTCC TCGCCCACCT GCATGCCGCC
GCGGCAGCGG CGGCCGGTCT GCCCGCCGGA TCCTGGACCA GCATCGAGAA CGCCATCGTG
GACCTCGACC GGCACCTGAC CGACAACCTG CTCATCGTCC TGGACGACCT GCACGCGGTG
GACGGACAGC CGGCCGAGGC GGCGGTCCAA CTCCTGCTCG ACTACCTCCC CCCCAAGGTC
CGGGTTCTCG TCGCCGCCCG CTGGCGCCCG AACCTCGATC TGCACCGGCT GCGTCTCGCC
GGGCAGATAC GGGAGGTGGA CGCCGACGCG CTGCGTTTCC GGACCTGGGA GGTCGAGGAG
CTGTTCCGGG ACTGCCACGG CGTCCGGCTG CGACCCGAAG AGGTAACCGC GCTGACCCGC
AGCACGGACG GCTGGGCCGC CGGTTTACAG CTGTTCCACC TCGCCACCCG CGGCCGGCCC
GCGTCCGAAC GGGCGAAGAT TCTCTCCGGG CTCGGTAACA CGCGGCTCAC CCGCGAATAC
CTGACGATGC ACGTGCTGTC CGCGGTGAGC GCGGACGAGC GGGACTTCCT GGTGCGGACG
AGCGTCTTCG ACCGGCTGAC CGAGAAACGC TGCGACAGCC TGCTGGGACG CACCGGGAGC
GGTCTTCAGC TCGCCGAGCT GGAACGGCGT GGGCTGTTCA CCTTCGTCGA GGACGACGGG
CGGACCTACC GTTACCACGA GGTGTTACGA ATTCACCTGC TGGAACGGCT CGTGCTGGAA
CACGGGGAGG AACGAGCCCA GGAGGTGCAC GGGGCGGCGG CGCGGCTGTG TGAGGAGGAC
GGCGCGCTCA CCGAGGCGCT CCTGTCGTAC TGCCGCAGCG GGGACTGGGA GCAGGTGCGG
CGGCTGCTCG GCCAGGGCGG CCGGCGCCTG GCCGACGACC CGGCCGCCTG GTTCGAGCTG
CTGCCGGCCG CGATCCGCGA CTCCGACCCG TGGGTGCTGC TCGCGATGGC CCGCCGGCTC
GCCGCGGACG GGTCGCTCGA AGCCGCGGCC GATACCTATC GCGATGCCGC GACGGCGTTC
GGTTCCGAAA CGCCGGCCCG GGTTCACCGC GAGCTCGCCG AACTCGACGA CTGGCTCCAC
CCCGCGCCGC GCCGCGGCAC GACCTGGCTG CGGACTCTTC GGGCGGTGCT GGCCGATCCC
GGCCCGCATG TCGAAGGATC CGCCGACCCC CCGCAGACGG CGCTCGTGCG CGGGATTGCC
GCCTTCGTCG CGGGAGAGAT CGCGCTGGCC CGCCAACGCT TCGACACGAT CACCGCGGGT
ATCGGCGCAA GTCCCGTGAT CGAGACGGTC GCCGCGGTCG GGCGGGCCGC GTGCGCCCTG
CTCACGCAGG ACGCACAGGC CGGCGAGACG GTCGAGGAGG GAGTGGCCGC GGCGGAGCTG
CTCGGGCATG CGGCCGTGCT CCGCGTTGCG GCCGCCGTCC GCGCCCTGTA CTTCGGCTAC
GGCAACCGGC ACGTCCTGCA CCTGCTCGAC GCCGCCGGCC GGGCCGGTGA CCGGTGGGGG
GTGGGGGCTC TCGTCCTGCT CGGCAGCTTT GCCGGCCTCC CGGTGCCGGA CGACGTCATG
TTCCCGGCCG ACGAGCTTCC CGCGCCCGCC CTGCCGCCGT CGACCGGCCT GGCGCCGGGG
CAGGTGTGGC CGGACGACGA GCCGATGCAC CTGCGCGCCT ACCGACAGCT GGCGATGGCA
CTGGACCACA CGGCAGGGAT CTTCGACGAG CTGACCGCCG GTGCGCTCGC GACCTGGGCG
CGGGCGTCCG CGCTGCTGGC CTGGCGGCGG GCGGGCGCGC CGCTGGACGA GGACGAGGCA
GCGCGGGTGG TCCGGGCAGC GGCCCGGCTG GGACCGGCCC CGCACGCCCT CGCCCTCGTC
GGTAGCGTCC CACCGCCCTC GGCCGCCCCA CCGCCGTCGC ACGCGGCCGG TCTGCCGCCG
TCGCCCTCTG CGGCGTCCGG TGCCCCGTTC GCCCCAGGGT CCGGCGCGAC ATCCGGCCAG
ACCCATCCGC ACGAGGTGGC GCGCCGGATC GCCGCCGAGG CGGGCGTCGG AAGCTGGGTG
GAAAGGCTGA TCGCGTCGCT GTCGGGCGGC GGCCGCCCGC CCGCTGCCGT CATGCCCGCG
GGCATGACGG GCACCGTCGC TGCCGCGGTC GACGCCGACG CGGCCAGCCC GGCGCCGGCC
GGTGAACCGC CCGCTGCGGA GCGCCGGCTG ACCGTCCGCT GCCTGGGTCG GTTCGAGCTG
GAGGTCGACG GAGTGCCGGT CGGCCTGGCC GGGGTACAGC CGCGCAACCT CGAGCTCCTT
CATGTGCTCG CGGTGCACGC GGACCGGCCG TTGCATCGGG ACCAGCTGAT CGAGCTGATC
TGGCCGGACG CCGCGCCGGC ACAGGCGAAC CACAGCCTGC AGGTCGCGGT CAGCGCCCTG
CGCGGGCTGC TCGAACCGGC GGTCCCGCGC GGCCGGCGCG GGCTGCTCCG CAGGGCCGGG
CAGGGCTACC TCCTGGCCCT CGTCGATCCG GACGACCACG ACATCCGCCG GCTCGAACGA
CTGCTCGATG CCGCGGCCGC GGCACGACGC TCCGGTGACG GCGAGGCCGA GTACCAGCGG
CTGGCGAGCG CGATCGAGCT CTACGCCGGG GACGTCCTGC CCGGCGGTGG CCCGGCGGAC
TGGCTGCTGG CCGAACGGGA CCGGCTGCGG ACGACCATCG CCGCGGCCTG TGAACGCGCC
GCGGCCATCT CGACCGAGAC AGGCCGGCAC AAGGATGCGG TCCGGCACGC CGAACTGGGG
CTCGAGGTCG ACCGGTACCG GGACGGGCTG TGGCGGACGC TCGTGGTGGC GCTCCGATCG
GGCGGGCAGC CGGTCGCGGC CGCCCGCGCG GAGGCCCGGT ACGAGGCGAT GCTCGCCGAC
CTGGGCATCG AACTCACACC GGCCGAACCT CCACCCGGTG CGGGACATGG GCCGGCTTGG
CCGACCTGA
 
Protein sequence
MPEAGDASRR VQIYAGGNAE LDAAPGTTSR GTAMTEAPVP PILAARVRLP APAGMHRPRL 
AGPLIVPAAY RVGTVVAPAG AGKTTLLVEV AQACAWPTAW LTLDDRIGGL DSFLAHLHAA
AAAAAGLPAG SWTSIENAIV DLDRHLTDNL LIVLDDLHAV DGQPAEAAVQ LLLDYLPPKV
RVLVAARWRP NLDLHRLRLA GQIREVDADA LRFRTWEVEE LFRDCHGVRL RPEEVTALTR
STDGWAAGLQ LFHLATRGRP ASERAKILSG LGNTRLTREY LTMHVLSAVS ADERDFLVRT
SVFDRLTEKR CDSLLGRTGS GLQLAELERR GLFTFVEDDG RTYRYHEVLR IHLLERLVLE
HGEERAQEVH GAAARLCEED GALTEALLSY CRSGDWEQVR RLLGQGGRRL ADDPAAWFEL
LPAAIRDSDP WVLLAMARRL AADGSLEAAA DTYRDAATAF GSETPARVHR ELAELDDWLH
PAPRRGTTWL RTLRAVLADP GPHVEGSADP PQTALVRGIA AFVAGEIALA RQRFDTITAG
IGASPVIETV AAVGRAACAL LTQDAQAGET VEEGVAAAEL LGHAAVLRVA AAVRALYFGY
GNRHVLHLLD AAGRAGDRWG VGALVLLGSF AGLPVPDDVM FPADELPAPA LPPSTGLAPG
QVWPDDEPMH LRAYRQLAMA LDHTAGIFDE LTAGALATWA RASALLAWRR AGAPLDEDEA
ARVVRAAARL GPAPHALALV GSVPPPSAAP PPSHAAGLPP SPSAASGAPF APGSGATSGQ
THPHEVARRI AAEAGVGSWV ERLIASLSGG GRPPAAVMPA GMTGTVAAAV DADAASPAPA
GEPPAAERRL TVRCLGRFEL EVDGVPVGLA GVQPRNLELL HVLAVHADRP LHRDQLIELI
WPDAAPAQAN HSLQVAVSAL RGLLEPAVPR GRRGLLRRAG QGYLLALVDP DDHDIRRLER
LLDAAAAARR SGDGEAEYQR LASAIELYAG DVLPGGGPAD WLLAERDRLR TTIAAACERA
AAISTETGRH KDAVRHAELG LEVDRYRDGL WRTLVVALRS GGQPVAAARA EARYEAMLAD
LGIELTPAEP PPGAGHGPAW PT