Gene Franean1_5445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5445 
Symbol 
ID5673776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6587426 
End bp6590911 
Gene Length3486 bp 
Protein Length1161 aa 
Translation table11 
GC content78% 
IMG OID641244300 
Producttranscriptional regulator 
Protein accessionYP_001509706 
Protein GI158317198 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.202606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAATTC AGGTCCTCGG ACCGGTCGAG GTGCTCCTGG GCGGCGCGCC GCTCGATCTC 
GGCCCACCGA AACAGCGCGC GCTGCTCGCC CTTCTCCTGG CGGCCGGCGA CACCGTCATC
CCGGTCGAGA CGATCATCGA ACGGCTGTGG GACGGCCAGC CGCCGGCCAG CGCGACGGCG
AGCCTGCAGG TGTACGTCTC GAACCTGCGC CGTCTGCTCG AGCCGGAGCG GGACCCCCGG
GCGCGGGCGA CCGTGCTGGT GACCCAGGCG CCGGGGTACG TGCTGCGCAC CGGCGCGGCG
GAGGTCGACG CCCGGCAGTT CGTCGCGGCG CTCGGCGCGG CCCGCGGCTC GCTGGAGGCC
GGGCGGCCGG CCGAGGCGCT GCCCGCCCTC GACGCCGCCC TGGCGCTGTG GCGCGGTGAC
GCCTACGCCG ACGTGCGGGA CGCGTCCTGG ATCAGGCCGG AGGCCGGACG TCTCGAGGAG
CTGCGGCAGA GCGCCGTCGA GGACCGCGCG CGGGCGCTCG TCGAGCTGGG GCGGCACGAC
GAGGCCGCCG CCGATCTGGA GGCCCTCACG GCGCGTCACC CGCTCCGCGA GCGGGCCTGG
GGGCTGTTCG CCCTCGCGAT CTACCGGTGT GGCCGGCCGG CGGAGGCGCT GGCCGCGCTG
CGGGCGGCAC GCGCCCACCT GGCCGAGGAG CTGGGCGTCG ACCCCGGGCC GGAGCTCGCC
GCCCTGGAAC TCGCCATCCT GCGTCACGAC CCCGCGCTGG CCTGGGCCGG TGGCGCCCAC
GAGCCGCCCG CGGCGCAGCC GGCCGTGTCT CCGACGACGG GGCTGGACGG CCCACAGCCT
GTGCTCCCCG ACGTGTCCCC CGCCGCGGCC GGGGCCCCTG AGCCGTCCGG CCCGTTCGTG
GGCCGCGACG AGGCGCTGGC GGCGCTGCTC GCGGCGATCC CGGCGCCGGC CCGGCGCGGG
CGCGTGGTGC TGGTCGACGG CGAGCCGGGT GTCGGCAAGA CGCGGCTCGT CGCGGAGCTG
GGCCGCCGGT GCGGCCGGTT GACGGCCTGG GGAGCGTGCC CCGAGCACGA GGTGGCGCCG
GCCCTGTGGC CCTGGGAGCA GAGCCTGCGC GCGGTGGCCG CCGCGCGGCC GGACGTCCCG
GTGCCGCCCG GTGTGGCCGC GCTCATCGAC CAGCGGGCGC AGGAGCCCGG CGGGTACGAC
GCGGCCGGAG CCCGGCTGCG GCTGTACGAG GGAGTCGCCG GCTACCTCGC GGCGGCCGCG
CCGTTGGTCA TCGTCCTCGA CGATCTGCAC TGGGCGGACG TCTCCTCGCT GCGGCTGCTG
GCCCATGTCG GCGGGGCGGC GCCGCGGGGC GTGATGGTGA TCGGCACGTT CCGCACCCAC
GAGGCCGGCG CGCTGGCGGA CACCCGCGCC GCGCTGGCCA GGTCCGGCTG CGAGCGGCTC
CGCCTCGACG GGCTGGACGA CGCCGCCGTC CGCGAGCTGA TGCGGGCCAC CACCGGCACG
GACCCGGGAC CCGAGACGGC GAGCGCCCTG CGCTCCCGTA CCGCGGGCAA CCCGTTCTTC
ATCGGTGAGC TCGTCCGCCT GCGCGGGCCC GGCCGACGCG GCCTGGCGGG GACCGGCGCC
GCCGGGCCGG CCACGGACCG GGGCGCGGAG CAGGACACGG CGCAGGGCGC GCTGCCCGAT
CACGTCCGCG ACGTGCTGTC CCGGCGGGTC GCGCGCCTGC CGGAGCCCGC CGCCGCGCTG
CTCGCCACGG CGGCGGTGGC CGGCGGCGAG TTCGACGCGG ACGTGGTCGC CCAGGTCGCC
GGGCACACCT TCGAGACGTC CCTCGACCTG CTCGACGCCG CGCTCGCCGC CGGGCTCATC
ACCGAGGCCG GCGACCGGCT GGGCCGGTTC CGGTTCTCCC ACGCGCTGGT CCGCGAGGCG
CTCGACGCCG GCCACTCCCG GCTGCGCCGC GCGCACCTGC ACCGCCGGTA CGGCGAGGTC
ACCGCCGAGC GGTACGCCGG ACGTCCCGAG CGGGCCGGCG AGGTCGCGCG GCACTGGCTG
GCCGCCGCCG AGCTCGGCGC CGAGACGGCC CGTGCCGCGA CGGAGCACGC CGCCCGCGCG
GCGCGCGCGG CGGCCGACCG GCTCGCGCCC GAGGACGCGG CCGGCTACTG GCAGGAGGCG
CTGGCCGCGG CCGAGCTGGC GGGCGCCGAC CGCGGCACCC GGCTGGAGCT CCTCCTCGGC
CTGGTCAGCG CCCGGTACGC GGCGGGCCAG CTCAACGACG GGCTCGACGT GGTGGACCGG
GCCCTCGACG AGGCCGGCGA CGATCCGCAC CTGATCGTCC GGGTCGCCGA GGCCGCGATG
GGTGGCTCGC CGTGGTTCCC GTTCCCCTAC GGCACCGACC GCGGCCGGCT GCACCGCGCG
CTCGAGCGGG CGCTCGGCGG GCTGCCGACC GCCGGCCGCG ACCGGGCGCT GGCCGTCGGC
TGCCTCGCGG TGCTCGAATC CCACGTCGGG CGGATGGCCG AGGCCGAGCG GGCCGGCGCA
CTGGCCGTGG CCGCGGCGCG GGACGTCTCC GACGACCCTG CGCTGCTGCC GCGGGTCCTG
CACCTGCGCA GTCTCACCGT GACCGGGGTC GACTACGCCG AGCACCGGCA CACCTGCGCG
CGAGAACTGG CGTCCCTGCC GTCGACACCA CCGGAGCTGC TGGTCGGCGC CCACCTCACC
CTCGTCGACA ACCTGGTCCG CTTCGGCCAC GTCGCGCGGG CGCGGGCCGC GCTGGGCGAA
GCGGACACGA TGATCAGCCG GCTCGGCTCG CCGACCCTCG CCTACCAGGC GGCAATCATG
CGGGCGGCGC TGCTGGCCTT CTCGGGCGAG CTCGAGGAGT CCGCGGACGT GGCGACCGCC
GCCGTCGGGC GCCTCGGGCT GGCCGCGCAG AACGGGGTCG AGAGCTCGTT CGTCGCGAAC
GTCATCGACC GGGCCCTCCA GGCCGGCACG CTGGCCCGGT TGGCCGACAC GCTGGCCCGC
TCGCTGGCCT CCACCGGCAT CGAGGCCCTG CGCGGGTCGC TGGCGCTGGC CCTCGCCGCC
GCCGGGCGCC CCGAGGCGGG ACGCGCCGCG CTCGCCGAGA TGCGGCTGCC GCCGCGCGAC
TACACCTGGC TCAGCGCGGT CGTCATGCGG CTGAACGCCG CGGTGGCCCT CGGCGTGCTG
GACACCGTCG AGGAGGACAT CGCCCACCTG CGGCCGCACT CGGGCGAGCT GGCGAGCCTG
GGCACCTGCA CAGCGGTCGT CGGGGCCGTC GACAGCCACC TGGGCGAGGC CTACCTGGCG
CTCGGTGACC ACATGGCCGC CCGGGAACAC CTGACCGCGG CGCTCGCCCT GCTCGAGGCG
AACGACTCCC CCTACTGGGC GGCAAGAGCA CACCAAGCGC TTGCCAAGTG CCCGTCAAGC
GGCAACCGCC ACGCTGAGTC CCATCCGAAG GAAAACGGGA GGGAACCTCG ATGGCTCGCG
CACTGA
 
Protein sequence
MRIQVLGPVE VLLGGAPLDL GPPKQRALLA LLLAAGDTVI PVETIIERLW DGQPPASATA 
SLQVYVSNLR RLLEPERDPR ARATVLVTQA PGYVLRTGAA EVDARQFVAA LGAARGSLEA
GRPAEALPAL DAALALWRGD AYADVRDASW IRPEAGRLEE LRQSAVEDRA RALVELGRHD
EAAADLEALT ARHPLRERAW GLFALAIYRC GRPAEALAAL RAARAHLAEE LGVDPGPELA
ALELAILRHD PALAWAGGAH EPPAAQPAVS PTTGLDGPQP VLPDVSPAAA GAPEPSGPFV
GRDEALAALL AAIPAPARRG RVVLVDGEPG VGKTRLVAEL GRRCGRLTAW GACPEHEVAP
ALWPWEQSLR AVAAARPDVP VPPGVAALID QRAQEPGGYD AAGARLRLYE GVAGYLAAAA
PLVIVLDDLH WADVSSLRLL AHVGGAAPRG VMVIGTFRTH EAGALADTRA ALARSGCERL
RLDGLDDAAV RELMRATTGT DPGPETASAL RSRTAGNPFF IGELVRLRGP GRRGLAGTGA
AGPATDRGAE QDTAQGALPD HVRDVLSRRV ARLPEPAAAL LATAAVAGGE FDADVVAQVA
GHTFETSLDL LDAALAAGLI TEAGDRLGRF RFSHALVREA LDAGHSRLRR AHLHRRYGEV
TAERYAGRPE RAGEVARHWL AAAELGAETA RAATEHAARA ARAAADRLAP EDAAGYWQEA
LAAAELAGAD RGTRLELLLG LVSARYAAGQ LNDGLDVVDR ALDEAGDDPH LIVRVAEAAM
GGSPWFPFPY GTDRGRLHRA LERALGGLPT AGRDRALAVG CLAVLESHVG RMAEAERAGA
LAVAAARDVS DDPALLPRVL HLRSLTVTGV DYAEHRHTCA RELASLPSTP PELLVGAHLT
LVDNLVRFGH VARARAALGE ADTMISRLGS PTLAYQAAIM RAALLAFSGE LEESADVATA
AVGRLGLAAQ NGVESSFVAN VIDRALQAGT LARLADTLAR SLASTGIEAL RGSLALALAA
AGRPEAGRAA LAEMRLPPRD YTWLSAVVMR LNAAVALGVL DTVEEDIAHL RPHSGELASL
GTCTAVVGAV DSHLGEAYLA LGDHMAAREH LTAALALLEA NDSPYWAARA HQALAKCPSS
GNRHAESHPK ENGREPRWLA H