Gene Francci3_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0001 
Symbol 
ID3902947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp35 
End bp1723 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content68% 
IMG OID637877331 
Productchromosomal replication initiator protein DnaA 
Protein accessionYP_479125 
Protein GI86738725 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000690317 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAACC TCCGCGCCGA CTCCGTCGCC GGTCTGCCGT TCGGGGACGA GCCTTCGGGT 
GACCCGGACC TGGCCGCGGT GTGGAGCCAG GCCGTCGCCG GGGTCGCCGA CGGCACGCTG
TCCGCCCAGC AGCGTGCCTG GCTGCGGCTG ACCCGCCCCC TCGGGCTCGT CCAGGACACG
GCTCTGCTGG CCGCGCCGAA CGAGTTCACC AAGGATCTCC TCGACTCGCG CCTGCGCCCC
TTCTTGTCCA CAGCGTTGTC CACAGCCTAT GGGCGGGAGA TCAGGGTCGC GGTCACCGTC
GAACACCTGC CCGATCCGGA ACCAATGAGC GGACCGATCC GGATCGTACG GCCGGTGGAT
GCCAGGGGCG ACACCACACC CGGCCAGGGC TCGGGCCCCG CCTCCGGTTC GGCGTTGAAC
GCGGGTACCG GATCAGGATC GACCGGCGCC GCCGCAGCCC CGGTGCCGCC GACGAGCCCG
GGCTCGTCGG CGGTGCCGGT GCCGGCGCCG GCACCGGCAC CAGTGCCGCC GGCGCCGGCG
GCACTGGTGA ACGGCGAACT GCCCTTCCCC GACGCCACCG AGGGAACACC ACCGGTACGG
GTCAGCGCGG GTCTCGGACG CGATGCGGCG CCGCACGAGA CCGAACCGGC CCAGGCCCGG
CTGAACCCCC GCTACATTTT CGAGACGTTC GTCATCGGCG ACAGCAACCG GTTCCCCCAC
GCGGCAGCGG TGGCCGTCGC CGAGGCACCC GCGAAGGCCT ACAACCCGCT TTTCATCTAC
GGGGACTCCG GGCTCGGCAA GACTCACCTT CTGCACGCGA TCGGTCACTA CGCACTCAAG
CTCTACCCGA ACATGCGGGT GAAGTACGTG AGCTCCGAGG AGTTCACCAA CGACTTCATC
AACTCGATCC GGGACGACCG CCAGCAGGCG TTCCAGCGGC GCTACCGTGA CATCGATGTC
CTGCTCGTTG ACGACATCCA GTTCCTGGAG AACAAGGAAC GGACGCAGGA GGAGTTCTTC
CACACCTTCA ACGTCCTGCA CGACGGCGAG AAGCAGATCG TGATCAGCTC CGACCGCTCG
CCCAAGCAAC TCTCGGCCCT GGAGGACCGG CTGCGCAGCC GCTTCGAGTG GGGGCTGATG
ACCGACATCA CCCCGCCCGA CCTCGAGACG CGCATCGCCA TCCTGTCGAA GAAGGCGGCT
ACGGAGCGCC TGCCGGTACC CCCGGATGTC CTCGAGTACA TCGCCACGCA CATCGAGCGC
AACATCCGTG AGCTGGAGGG GGCGCTGATC CGGGTCGCGG CCTTCGCGAG CTTGAACAAG
TCCCACGTCG ACCGCACGCT CGCCGAGATC GTGCTGCGTG ATCTCATCCC CGATGCCGGC
AATCCCGACA TCACGGCCGC CGCCATCATG AACGCGACGG CGGCGTACTT CGGCGTCTCG
ATGGAGGACC TGTGCGGCAC CTCACGTAGC CGCGTGCTGG TCACCGCCCG TCAGATCGCG
ATGTACCTGT GCCGGGAGCT GACCGACCTG TCGCTACCGA AGATCGGCCA GCACTTCGGG
GGTCGGGATC ATACGACGGT CATGCATGCC GATCGCAAGA TCCGCGGTCT GATGGCGGAA
CGGCGCGCGA TCTACAACCA GGTCACCGAA CTGACAAACC GCATCCGTCT GCAGGCCCGG
CAGGCCTAG
 
Protein sequence
MSNLRADSVA GLPFGDEPSG DPDLAAVWSQ AVAGVADGTL SAQQRAWLRL TRPLGLVQDT 
ALLAAPNEFT KDLLDSRLRP FLSTALSTAY GREIRVAVTV EHLPDPEPMS GPIRIVRPVD
ARGDTTPGQG SGPASGSALN AGTGSGSTGA AAAPVPPTSP GSSAVPVPAP APAPVPPAPA
ALVNGELPFP DATEGTPPVR VSAGLGRDAA PHETEPAQAR LNPRYIFETF VIGDSNRFPH
AAAVAVAEAP AKAYNPLFIY GDSGLGKTHL LHAIGHYALK LYPNMRVKYV SSEEFTNDFI
NSIRDDRQQA FQRRYRDIDV LLVDDIQFLE NKERTQEEFF HTFNVLHDGE KQIVISSDRS
PKQLSALEDR LRSRFEWGLM TDITPPDLET RIAILSKKAA TERLPVPPDV LEYIATHIER
NIRELEGALI RVAAFASLNK SHVDRTLAEI VLRDLIPDAG NPDITAAAIM NATAAYFGVS
MEDLCGTSRS RVLVTARQIA MYLCRELTDL SLPKIGQHFG GRDHTTVMHA DRKIRGLMAE
RRAIYNQVTE LTNRIRLQAR QA