Gene Franean1_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0001 
Symbol 
ID5675805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp71 
End bp1654 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content69% 
IMG OID641238929 
Productchromosomal replication initiator protein DnaA 
Protein accessionYP_001504376 
Protein GI158311868 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.501183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACC TCAGCGCCGA CTCCGTCGCC GGTCAGCCGT CCCGGGGAGC CGCGCCCACC 
TCGGAGCCGG ACCTGTCGGC GGTGTGGGAG CAGGCAGTGG CGGGCGTCGC GGACGGGACG
CTCTCCGGCC AGCAGCGCGC GTGGCTGAGG CTGACCCGTC CACTCGGCCT CGTCCAGGAC
ACCGCGCTGC TCGCCGCGCC GAACGAGTTC ACGAAGGACC TGCTCGACTC CCGGCTTCGA
CCGTTCCTCT CCACCGCGCT GTCCACGGCC TACGGCCGGG AGATCCGGGT CGCGGTCACC
GTCGAGCACA CGCCGGACCC GGAACCGATG ACCGGTCCCC TCACTCTCCC GGGCACCGAC
CCCGGCGGCG GCCCCCGCTC GCTACTCGGA ACCGGCCACG AGCCCCGGCC GGGGGAGGAG
CACCGAGGCG AGGACCGCCG GAACGAAGAT CGTCTCGACG GACGTCTCGA GGGTCGGGTC
GACGGGGCGC CCCCCGGGCG GATGGCCCCG GGCCTGGGCC GCGACCCGTC GCCTCGCCCG
TCCGAACCAG CCCGGCTCAA CCCCCGCTAC CTGTTCGAGA CCTTCGTCAT CGGTGACAGC
AACCGCTTCC CACACGCCGC CGCCGTGGCG GTCGCGGAGG CCCCGGCAAA GGCCTACAAC
CCGCTGTTCA TCTACGGTGA CTCCGGGCTG GGCAAGACTC ACCTGCTGCA CGCGATCGGC
CACTACACGG TCAAGCTCTA CCCCGAGAGC AAGATCAAGT ATGTGAGCAT GGAGGAGTTC
ACCAACGACT TCATCGCCTC GATCCGCGAC GACCGTCAGC TCGCGTTCCA GCGCCGCTAC
CGGGACATCG ACGTCCTGTT GGTGGACGAC ATCCAGTTCC TCGAGAACAA GGAGCGGACG
CAGGAGGAGT TCTTCCACAC CTTCAACGTC CTGCACGACA CCGAGAAGCA GATCGTCATC
AGCTCGGACC GCTCGCCCAA GCAGCTCTCG GCCCTCGAGG ACCGCCTGCG CAGCCGATTC
GAGTGGGGGC TGATCACCGA CGTCACGCCG CCTGACCTCG AGACGCGGAT CGCCATCCTC
TCGAAGAAGG CCGCCACCGA GCGGCTGCCG GTCCCCCCGG ACGTCCTCGA GTACATCGCC
ACGCACATCG AGCGCAACAT CCGCGAGCTC GAAGGGGCGC TCATCCGGGT CGCGGCCTTC
GCCAGCCTGA ACAAGTCCCA CGTCGACCGC ACACTCGCCG AGATCGTGCT GCGCGACCTC
ATCCCCGACG CGGCCAACCC GGAGATCACC GCCGCGGCCA TCATGAACGC CACCGCGGCG
TACTTCGGTG TCTCCATGGA GGACCTGTGC GGGACGTCGC GCAGCCGGGT CCTGGTGACA
GCCCGTCAGA TCGCGATGTA CCTCTGCCGT GAGCTCACCG ACCTGTCCCT CCCGAAGATC
GGCCAGCACT TCGGCGGACG GGACCACACC ACCGTGATGC ACGCCGACCG GAAGATCCGC
GGCCTGATGG CCGAACGCCG AGCGATCTAC AACCAGGTAA CCGAACTGAC CAACCGCATC
CGCGCCCAAG CCCGACACGC CTGA
 
Protein sequence
MTDLSADSVA GQPSRGAAPT SEPDLSAVWE QAVAGVADGT LSGQQRAWLR LTRPLGLVQD 
TALLAAPNEF TKDLLDSRLR PFLSTALSTA YGREIRVAVT VEHTPDPEPM TGPLTLPGTD
PGGGPRSLLG TGHEPRPGEE HRGEDRRNED RLDGRLEGRV DGAPPGRMAP GLGRDPSPRP
SEPARLNPRY LFETFVIGDS NRFPHAAAVA VAEAPAKAYN PLFIYGDSGL GKTHLLHAIG
HYTVKLYPES KIKYVSMEEF TNDFIASIRD DRQLAFQRRY RDIDVLLVDD IQFLENKERT
QEEFFHTFNV LHDTEKQIVI SSDRSPKQLS ALEDRLRSRF EWGLITDVTP PDLETRIAIL
SKKAATERLP VPPDVLEYIA THIERNIREL EGALIRVAAF ASLNKSHVDR TLAEIVLRDL
IPDAANPEIT AAAIMNATAA YFGVSMEDLC GTSRSRVLVT ARQIAMYLCR ELTDLSLPKI
GQHFGGRDHT TVMHADRKIR GLMAERRAIY NQVTELTNRI RAQARHA