Gene Franean1_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1585 
Symbol 
ID5669988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1893060 
End bp1896002 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content76% 
IMG OID641240504 
Producttranscriptional regulator 
Protein accessionYP_001505930 
Protein GI158313422 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.79609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0274178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCGGA TCCGGCTGCT GGGCGAGGTC GGTGCCGTCA CGTCCGGCGG CGAGCCGGTC 
GACGTCGGCC CGGCCAAGTG TCAGGCGCTG CTCGCCGCGC TCGCGCTCGC GCCCGGCAAG
GTCGTGCCGG TCGGGAGGCT GGTGGAGCTG GTCTGGGGCC CGCACCCGCC CAGGACGGCC
GACCGCACCC TGCAGTCGTA CGCCACCCGG CTTCGCAGGG CGCTCGGTCC CGGTTCGATC
CGCCGGGTCG GCGCCGCCTA CCTGCTGGAT GTCGTGCCGG AGGCGGTGGA CGTCGTCCGC
TTCCAGCGAC TGCTCGACGA GCGGGACGTC GCCGCCGCCC TCGCCGAGTG GGCGGGCCCG
CCGCTGGCCG GGCTGGTCGT CCCCGGCCTG GCCGGAGCCG TCGACGGGCT GGTCGAGCAG
TGGCTGGCCG CCGTGGAGGT GGATCTGGCG CGCCAGGTGG AGACCGACGC GCGTTCCGCG
CTGGGCCGGC TCACCGAGCT GACCGCCGCC TACCCGCTCC GCGAGGGCCT GTGGGCGCTG
CTCATGACGG CGCTCTACCG GGCCGGGCGC CCGTCCGAGG CCCTGGCGGC GTTCCGCGCC
GCCCGCCGGC ACCTGGTCGA CACCCTCGGG GTGGAGCCGG GGCCGGCGCT ACGCGAGCTC
GAGGCACTGG TCCTGGAGCA CGACAGACGG CTCCTGCCGC CCGGCGAGGG ATTCGCCCAG
GTCGGCGAAC ATCCCGTCCC GGGCGAGCGC CCGGGCAATC TTCCCCGCCG GCTGGGGCGG
CTACTCGGCC GCGATCATGA TCTGGAGGCC GTCGGCGGCG CGCTGGCCGT CTCACCCGTC
GTGACCCTGG TGGGGCCGGG TGGGATCGGC AAGACCCGGC TGGCCCTCGC GGCGGCGGCG
CGGCACCTCG ACGCGGACGG CGTCTGGCTG GTCAAGCTCG CCGAGATCGC GGCGCCGTCC
GATGTGGCGC GGGCGGTCGC CGGTGTCGTG GGCGTGACGG AGAGCTCCGG GCGCCCGCTG
GGCGAGTCCG TCGTGGCGGC GCTGCGCCAT CGCCGGGTGC TGCTCGTCCT GGACAACTGC
GAGCATGTCG TCGACGGCGC CGCGGCGCTC GCCCAGGCGA TCGCCGAGGG CTGCCCCGAG
GTGCGGATCC TCGCCACCTC CCGGGAGCGG CTCGACCTCG GGCACGGCTT CGAGCGGCTC
GTCGCCGTGG GCCCGCTGGA GCCCGCAGGG GCGGCCGCCG AGCTGTTCGC GCAGCGTGCC
CGCGCGGTCT TCCCCGCCTT CGACGCGCGG GCCGGCCGCG CCGAGATCAC CGAGATCTGC
CGCCGTCTCG ACGGCCTCCC GCTGGCCATC GAGCTGGCCG CCGCCCGCAC CGCGGCCCTC
GGCCTCACCG ACCTGCTCGA ACGTCTCGAC GACCAGCTGC GCCTGCTCGT CGGCGGCGCG
CGGACGGCCG ACGAGCGCCA CCGGACGCTG CGCGCCACCG TGCGGTGGTC CTACGACCTG
CTCGCTCCCG CCGCGCAACG CCTGTTCCGG CAGCTGTCCG TCTTCGCCGG GCCGTTCGAC
CTGGGCGCAG CCGCCGCCGT CGCGGCAACC ACCGTCGCGG CAACCGCCGA AAGCAACGGT
GCGGACGGCT CGCGCGCCGC CGGCTGGCCG GGCGACCCGG GCACGGCCGA CGTGGGCGGC
GCGGGCGGCG CGATGGGCGA GGTGGAGGAC CTGCTCGGTG ACCTCGTGGC GCGGTCGATG
CTCGTGGCGG AGACCGGGCC GTTCGGGCGG CGCTTCCGGC TTCTGGAGAC GATGCGCCAG
TTCGCCGCCG AACGGCTCGC TGATGCCGGT GAGACCCGGC AGGCGACCGC ACGGCACGCC
CGGTGGTGCC TGAAGGCGGC CACCCGCGTC CAGGTGCTGC TCGCCGGTCC GGCCGAGGTC
GAGGGCGTCG CCCGTCTCGA CGAGCTCTGG CCCAACCTGC GGGCCGCGTT CGACCGGGCG
TGCGCGGCCG GTGACACGGC GCTGGCCCGG GCGCTCGTGC GCCGCGTCGT CGTGGAGATC
GTCCGCCGCA GCCGCCACGA GATCGGCGAC TGGATCGAAC GGCTCCTGGC GCTCGGGCAG
CCGGGGGCGG AGCCGGACCC CGACCTCGTC GTCTTCGCGC TCACCTGGGC GGCGCAGCGC
TACAAGATGA GCCAGGACAG CGCCGCCTAC GAACGACTCG TCGAACGCCA CGGCGAGCCG
GACCATCCGT TGGTCCGCCA TGCGCGGGCG TCCGTCTACG AGGACTACCC GGCCCATGTG
ACCTGGGCCG CGCCGGCCGT GGCGGAGCTG CGCCGGCACG GCGCGGACGA CCTGGCCGAG
CAGTTCGAGC TGGACGTCGG CGCGGCCCTG CTGTTCACCG GCCGGTTCGC CGAGCACGAC
GCCGCGGTCG GGGCGCTGGT CGAGCGGTAC CGCCGGCAGG GGCCGCCCAC CCTGCTGAAC
CTGACTCTGG TGCTGCTCGG CTACTCGGCG CTGCTGCGCG GCCGGCACGA CCACGCCGAC
CGGCTGTTCG ACGAGGCCGT CGGCGTCGAG GTGCCAGCCC GGACGCACTC GCCGAACCGG
GCCGTCCAGG CCCGGGCGGT CTTCCGACGC GGTGAGCGGG CACAGGCGTT CGCCATCCTG
CACTCGCACG TCGAGCAGCT GCTCGACGAC GGGAACATGC AGGCCGTCTG CGTCACCACG
GTCGAGTTCG TCAACATGGC GGCGGCCGTC GGCCGGCTCG AGGACGCGGC ACGCATGCTG
GGCTTCCTCG AGACGACCGG CCTGCTCGAG CATCCCGCGT GGCGGACGCT CGTCGACGGC
GCGACGGCCC GGGTCGCCGC CGACCCCCGC GTCGACCCGG AGCGGGAACG GGCCGGCGGC
CGGCGGCTGG ATGACCGCCA GGCGCTGGAG TACATGCGCG GCGTCCTGGG CCTCCTCACC
TGA
 
Protein sequence
MLRIRLLGEV GAVTSGGEPV DVGPAKCQAL LAALALAPGK VVPVGRLVEL VWGPHPPRTA 
DRTLQSYATR LRRALGPGSI RRVGAAYLLD VVPEAVDVVR FQRLLDERDV AAALAEWAGP
PLAGLVVPGL AGAVDGLVEQ WLAAVEVDLA RQVETDARSA LGRLTELTAA YPLREGLWAL
LMTALYRAGR PSEALAAFRA ARRHLVDTLG VEPGPALREL EALVLEHDRR LLPPGEGFAQ
VGEHPVPGER PGNLPRRLGR LLGRDHDLEA VGGALAVSPV VTLVGPGGIG KTRLALAAAA
RHLDADGVWL VKLAEIAAPS DVARAVAGVV GVTESSGRPL GESVVAALRH RRVLLVLDNC
EHVVDGAAAL AQAIAEGCPE VRILATSRER LDLGHGFERL VAVGPLEPAG AAAELFAQRA
RAVFPAFDAR AGRAEITEIC RRLDGLPLAI ELAAARTAAL GLTDLLERLD DQLRLLVGGA
RTADERHRTL RATVRWSYDL LAPAAQRLFR QLSVFAGPFD LGAAAAVAAT TVAATAESNG
ADGSRAAGWP GDPGTADVGG AGGAMGEVED LLGDLVARSM LVAETGPFGR RFRLLETMRQ
FAAERLADAG ETRQATARHA RWCLKAATRV QVLLAGPAEV EGVARLDELW PNLRAAFDRA
CAAGDTALAR ALVRRVVVEI VRRSRHEIGD WIERLLALGQ PGAEPDPDLV VFALTWAAQR
YKMSQDSAAY ERLVERHGEP DHPLVRHARA SVYEDYPAHV TWAAPAVAEL RRHGADDLAE
QFELDVGAAL LFTGRFAEHD AAVGALVERY RRQGPPTLLN LTLVLLGYSA LLRGRHDHAD
RLFDEAVGVE VPARTHSPNR AVQARAVFRR GERAQAFAIL HSHVEQLLDD GNMQAVCVTT
VEFVNMAAAV GRLEDAARML GFLETTGLLE HPAWRTLVDG ATARVAADPR VDPERERAGG
RRLDDRQALE YMRGVLGLLT