Gene Franean1_4842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4842 
Symbol 
ID5673183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5806001 
End bp5808940 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content74% 
IMG OID641243698 
ProductSARP family transcriptional regulator 
Protein accessionYP_001509114 
Protein GI158316606 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0556456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGATTCC ACGTGCTGGG GCCGCTGGAA GTGATCCGGG ACGGCCAGCC CATCGTCGTG 
CCCGGCGTCC ATCAGCGTGC CACGCTCGGT TTCCTGCTGT TGCATCCGAA CACGGCGGTG
GCCACCAGCA GGCTGCTCCA GGCCCTGTGG GACCAGGAAC CGCCGGTGAC CGCGCGCAAG
ATGGTGCAGA ACGCGGTGGC GGGCCTTCGC CGGACGCTGG GCGGCGGGGC GGACGTGCTG
ACCCGCCCAC CTGGATACCA GCTGAGCATC CGCGCCGACC AGGTGGACCT GGCCGCCTTC
CAGGAACTGA GCGCGCGTGG CCGGGCGGAG CTGGCCGCCG GTTCGTGGGC TGCCGCTGCC
GCGACCCTGT CGGCCGCGCT CGGCCACTGG CGTGGGCCCG CGCTGGCCGA CCTGGCAGAG
GAGGGCCTGG CCTGGCCGGA GGCCGTCGCC CTGGACAGCG CACGCATGGT CACCTTCGAG
GACTGGGCGG AGGCGCAGCT CGCCCTCGGC CACCACCAGG AGCTCGTGCC CGAACTGGAG
TCGGCCGTCG AGACGACCCC GCTGCGGGAG CGGCTGACCG GGCTGCTGAT GTTGGCGCTC
TACCGCTGCG GCCGGCAGGC GGACGCGCTC GAGCGCTACC GGCGCACCCG GGGCGCTCTC
GTCGACCAGC TCGGTCTCGA TCCCGGGCAC GAGCTGCAGA ACCTGGAACG GGCGATCCTC
GACCACGAGC CCTGGTTGAG CTCACGGCCG ACGGGCGCGC TCACCTCCCG CCACGCCGGG
GTCCTCGCTC CGGGCCGGAT GATCGCGGGC AGCGGCGCTC TCGCGGGGAC TGGCGCTCTC
GCGAGGGCCG GTGCGGTCGC GGGAACCGGT GGCCTGGCGG CCGGCGGTGT GGTCGCGGCG
GCCGGAGTGG CTGGCATGGG CGGCGACGGC GGCGCGGCCG GTGCGCGGCT GCCGGAGGTC
TTCCCGGCCC GGGCTCCCGG CGTGCACGCC GCCCGGAAAT GGGTCAGTGT CCTCGCCGGT
CGACTCGACA TCGGGTCGGG GGCGGAGCTC TCCGATCCGG AGGACGCCGA CCGGGTCCTG
CGGGCCGTCA CCGCCACGGT CCGGGCCGAG ACCGCCCGTT TCGGCGGCGA GGTCCACACG
GCGCTCGGCT CGGTGTGGAT CGCGGTCTTC GGCCTGCCCC GAACCCGGGA GGACGACGCG
GAGCGCGCCG TCCGCGCGGG CCTGGCCATC CGGCGGGCGG TCGCCACATC GGCGAAGGCG
TCCTCGTACG GTGTCGGACA GTACGACCTG CGGATGGCTG TCACTACCGG GGAGGTCCTC
GCAACCCTCA CCCAGGCCAC GCCGGCGGCC GCGGCTTCGG CGGCTGCCGC CGCGATGTCC
GGGGATGTAA CCGGCCGGTG CATGCGCCTG CTGGCGTCTG TGCCCAGGGG TGAGCTGCGG
GTCTGCGAGG CGACCAGGGC GGCGTCCGAC GGGGCCTTCA CCCATGCTTC GACGGACGAG
CACGGCCGAG GCGTCGGCGG CGTGGTGGCG GTGCGCCTGG GCGGCTTGGC GAGTGCGTTC
GGGGCCGGTC CGGCGGGCCG TGAAGGAGCC CTCCACGTGC CATTCCTCGA GCGGGAGCAC
GAGAAGGACC TGATGCACCG GCTGCTCGCC GAGACCGCCC GCCGCCGCCG GCCACGGCTG
GTGACGATCT TCGGTGAGCC CGGCATCGGC AAGAGTCGGC TGCTGTGGGA GTTCTGCCGG
GAGGCCGGCG ACGCGGCTGT CTCGCGCGAC GTCCAGTTCC TGCACGGCCG GATCGGGAGG
TTCCAGAGCG GCCCGTTCGA GGTTCTTACG GACATCACCC GCACGTGGGC GGGGATCGCC
GAGGACGACT CCCGTGAACT GGCGGTGACG AAGCTCGCCG CCGCGGTCGG GATGACGGGC
GCCACCTTCG AAGTCTCGCG CCGCCTGGCG GCCGAGCTGG TTCCGCTGCT GGACCCAGCC
TGGCCGGGCG GCGAGGACGG AGCACGCGTG CTCGCGGCCT GGCGTCGGAC ACTTGGGAGG
ATCGCGGCCG TGGCCCCGAC GGTGCTCGTC GTCGAGGACC TGCACCGGGG AGACGACCGG
CTGCTCGACT GTCTGGAGGT GCTCGACGAG CAGCTGGGCC GCGTTCCGCT GCTTGTCGTC
GCCAGCACCC GGCCGGACCT GCTCGAACGG CGGCCGTGCT GGGCCGGCGG CAAGCGTGAC
GCGACGACGA TGTCCCTCGA CCAGCTCTCC GCCGCCGCCA TCCAGCATCT GATGGTCGAT
CTGGCGGTGT TGTACGGGCT GCGCCCCGCG TCGTCGGCGC CGCCGCTCGC GGGCGCGGGC
GGCGGCGGGC CCGCGCCCGA GACCGCGGCC CACGAGTGCG GGACGGTCGC CAAGGTCGGC
GGCAATCCGC TGTTCGCGGT CGCCCTCGTC GAGATGCTCC GCGCGGCACG GGCCGCCAAG
GGTGACGGCG TCTCCCCGGC TCTGCTGATC GACAGTGCGG GCCCGATCGG CCCGGGCTCC
GCGATGGTGT CGATCCCGCG CACGGTTCAC GGCGTGATCG CGGCGTGGCT GGACACTCTG
CCGTCACGCA GCCGTCTGGT GCTGCAGGGC GCGGCGGTGT TCGGCGCGAC CGTCTGGGCC
GCGCCGGTCG CGGCCGAGTG CGAGCTGACC CGCGAAGACG TCCTACGGGA GCTGGAGTAC
CTCGAACGCC GGGGCGTGCT GCGGCAGATG CCGACCGTTG CCGACCCCGA CCCGCACTAC
GAGTTCCGGC ATGCCTTCGT CCGGGACGTC GCCTACTCGC AGATCCCGCG TGCCGTCCGG
GGAGGGTGGC ACCTGTGCTT CGCCCGCTGG TTGGGTGACG AGTACGGGCG GGTCGCACAG
GACGATCTCC GCCGGCACCA CCATCGGCGC GCCGACAGCC TCACCGCCGC GGCCGGCTAG
 
Protein sequence
MRFHVLGPLE VIRDGQPIVV PGVHQRATLG FLLLHPNTAV ATSRLLQALW DQEPPVTARK 
MVQNAVAGLR RTLGGGADVL TRPPGYQLSI RADQVDLAAF QELSARGRAE LAAGSWAAAA
ATLSAALGHW RGPALADLAE EGLAWPEAVA LDSARMVTFE DWAEAQLALG HHQELVPELE
SAVETTPLRE RLTGLLMLAL YRCGRQADAL ERYRRTRGAL VDQLGLDPGH ELQNLERAIL
DHEPWLSSRP TGALTSRHAG VLAPGRMIAG SGALAGTGAL ARAGAVAGTG GLAAGGVVAA
AGVAGMGGDG GAAGARLPEV FPARAPGVHA ARKWVSVLAG RLDIGSGAEL SDPEDADRVL
RAVTATVRAE TARFGGEVHT ALGSVWIAVF GLPRTREDDA ERAVRAGLAI RRAVATSAKA
SSYGVGQYDL RMAVTTGEVL ATLTQATPAA AASAAAAAMS GDVTGRCMRL LASVPRGELR
VCEATRAASD GAFTHASTDE HGRGVGGVVA VRLGGLASAF GAGPAGREGA LHVPFLEREH
EKDLMHRLLA ETARRRRPRL VTIFGEPGIG KSRLLWEFCR EAGDAAVSRD VQFLHGRIGR
FQSGPFEVLT DITRTWAGIA EDDSRELAVT KLAAAVGMTG ATFEVSRRLA AELVPLLDPA
WPGGEDGARV LAAWRRTLGR IAAVAPTVLV VEDLHRGDDR LLDCLEVLDE QLGRVPLLVV
ASTRPDLLER RPCWAGGKRD ATTMSLDQLS AAAIQHLMVD LAVLYGLRPA SSAPPLAGAG
GGGPAPETAA HECGTVAKVG GNPLFAVALV EMLRAARAAK GDGVSPALLI DSAGPIGPGS
AMVSIPRTVH GVIAAWLDTL PSRSRLVLQG AAVFGATVWA APVAAECELT REDVLRELEY
LERRGVLRQM PTVADPDPHY EFRHAFVRDV AYSQIPRAVR GGWHLCFARW LGDEYGRVAQ
DDLRRHHHRR ADSLTAAAG