Gene Francci3_0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0048 
Symbol 
ID3903527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp59204 
End bp62632 
Gene Length3429 bp 
Protein Length1142 aa 
Translation table11 
GC content72% 
IMG OID637877378 
Productadenylate/guanylate cyclase 
Protein accessionYP_479171 
Protein GI86738771 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC CAGGCTGGCC GCCGGCGGAT GAGCCGTCCG GGTCGCGCCG TCCCGGCCGG 
GACCCGGCGG TGGCACCGGC GTGGGATGGG TCACCTCAGG CGCGTCGGGT ACGCGCATCC
CCCACCGCAT CGGGTCGCGA TCGGCCTACC ATCCGGCACA TGGTGCCGTG CCCGACATGC
GCGGAGGACA ACCACGAGCG GGCCCGCTTC TGCTCCGGAT GTGGGAACCC CCTTCCCACC
CCGGGGAGCG GCGCCCGGAA GACCGTGACG ATCATGTTCG TGGACATCAC CGGGTCCACC
GACATCGGCG AGAGAATCGA CTCCGAACCG CTGCAGCAGG TCATGTGGCG GTTCTTCACC
ACCGTCCGTG AAGTGATCTA CATCCACGGC GGATCGGTCG AGAAGTTCAT CGGCGACGCC
GTCTTCGCCG TCTTCGGCAT TCCCGTGCTG CACGAGGACG ACGCGCTGCG CGCGGTTCGC
GCCGCCCTCG ACATCCGGGC GGCGATGGAG GCGCTCAATG CCGACCTGCA GCGCGAGTGG
GGCCTGAAAC TGCGGGTGCG GATCGGGATC AACACCGGCG AGGTGACCGT CGCAGGCGGT
GGCGTCACCG GCGATCCGGT GAACGTCGCC TCCCGGCTGG AACACGCCGC TCCCCCCGAC
GAGATCCTCA TCGGGGACAC CACCTACCGC TTCATCCGGC ACAGCGTCAC GGTAAGCCCG
CTCGGCCCCC TGGTCGTCCA GGGCAAGCGC GACCCGCTGC GGGTGCACCG GTTGATCGGG
CTGGTAGACC CGACCGCGGG TCCCGGCAGC CGGGCCCCGA CCCACGGCTT CACCGCCCCG
GTCATCGGGC GTAACCGGGA ACGCCGCCGG CTGCAGGACG CCTTCGAGGC CGTGGTCGAG
GAACGCACCT GCCACCTGTT CACCGTGCTG GGCCCGGCGG GCATCGGCAA GTCCCGGATG
GTCGGCGAGT TCTGCGCGGC CATGGCGAAC CGGGCGACCG TGCTCACCGG GCGTTGCCTG
TCCTACGGCG AGGGCATCGC CTACTGGCCG CTGATGGAGA TGGTGCGCCA GGCCACTGGT
CTGTCCGCTG ACGAGAGCGC CCCGGACGGG CGGCGTCGGC TGCGGGAACT GCTCGACGGT
GTCACCCAGG CGGCGGAGAT CGTCGAGGCG CTCGCGCCCC TGGTGGGGCT GGGCGGCGCG
GAACTCGGTC CGCAGGAAAG CTTCTGGGCC GTCCGGTCCT TCTTCCAGGC GCTGGCGGCC
CGACGCCCGC TGATCCTGTG GTTCGACGAC GTGCACTGGG CCGAGCCGAC CCTGCTCGAC
CTGCTGGAGA ACATCGCCGA CTGGTCCCGC GACGCCCCCA TCATGCTGCT GTGCCTGGCC
CGCCCGGAAC TGCTGGAGGA GCGGCGGGAC TGGGGAGGCG GCAAGCTCAA CGCCACCTCG
ATGCTGCTGG CGCCGCTGAC CGAGGCGCGC TGCCAGCGCC TGATCCGCAC CCTGATGGGC
TCCGACGACC TCGACCCGGC CCTCGTCTCC CGGATCACTA CCTCCGCGGC CGGCAACCCA
CTGTTCGTCG AGCAGATGGT GGCGGCCCTC GTCGACGACG GCCTGCTGCG CCGGGAGGGC
AGCCGGTGGA TCGCCACCGG CAGCCTGCGC AACGTGACGG TGCCGCCGAC CATCTCGTCC
CTGCTCGCGG CCCGGCTCGA CCGGCTCGAC CCGCCCGAAC GGCGGGTCCT TGAACGCGCC
GCCATCGTGG GGGAACGCTT CTACCTCGAC GCTGTCATCG ACCTGTCGGA CCCGAGCGAA
CAGCCGATGG TCGCCGCCCA CTGTCTGAGC CTGGTCCGCA AGGAACTGGT GCATCCGGAC
CGGTCCGACC TGCCCGGCGT CGAGGCGTTT CGGTTCCTGC ACGTGCTGCT GCGCGACTGC
GCCTACCAGT CCACCGCCAA ACGGCAGCGC GCCGATCTGC ACCAGCGGTT CGCCCAGTGG
CTGCAGACCC GGCTGGAGGG TGGCCCCGGT GAGCACGACG AACTGGTCGG CTACCATCTG
GAGCAGGCGT TTCGTTACCG GGTGGAGATC GGGCAGCGCG ACACCGAGAC GATCGAGCTG
GGCCGATCCG CGGCGACCTG TCTGATCGAG GCGGCGGACC GGGTGCGCCA GGGCGACGAG
ACCGGCGCGG CCCAGCTGCT CAAGCGGGCC ATCACCCTGC TGCCCGAGAC GGACCCACTG
CGGCTACGGG CCGAGATCGA CCTCGGCTGG GCGCTGTACT CGTTCGGCCG GCTCTCCGAC
GCCGAGCGCA TCCTGCGCCA GGTGACCGAG CGGGCCCGGC GCGCCGGGGA GGAGGGCCTG
CGGGCCCACG CCCGGCTCGC GTACCTGCGG GTGCTGTTCT CCACCGATCC GGAGGGCCTG
GTCGCCAGGA CGCTGACCGA AGCCGCGGTG TGCCTGACCA ACTTCGTGCA GGCCGGCGAC
GAGGTCGGCG CGGCGCTCGC CTGCCGCAGC CAGGCCAATG CCTACCTGGC GGCCGGGCAG
TTCGCCGCGG CCGAGCGGGC GATGGAGAAC GCCGTCCGGC ACGCGGAGGA GTCCGGCGTG
CCCCGGGCCG CGCAGTCGCT GCGCCGCGAG CTCACCATGC TCATGAGTTG GGGGCCCCGG
CCGGTCGCCG TCGGGATCAC CCGGGCGAAC GAGGCCCTCG ACGCCGCCGG CGACGACCGG
GCTCTGCGTC GGGCCGTGCT CGCCCAGCTC GCGGTCCTCA CCGCCATGTC GGGGGATCTG
GAGGGCGCGC GGATCCATCT CAAGGCGACC GAGGAGATCG TTCACGACCT GCGCGCCCCG
CGGACCGACC CCTTTCATGG CGGGTTCGTC GTGGCCCGGG TGGCGCTGCT CGCCGACGAG
CTGGCAGTCG CCGAGCGGGA GCTGCGGCGA AGCTGCCGGC AGCTGTCCCG GATGGGGGAA
CGCGCGTTCC TGGCCAACCG GGCCGCGGCA CTCGCCGACG TTCTCGTCCG GCTCGGCAAG
GTTGACGACG CGGGCAAGTA TGTCACCCGG TGTCGGGACG CCGCCGCCGC CGACCAGCTA
CCGGCCCAGG CCGGATGGTG CGGGGTCCAC GCGAAGCTCC TGGCGCTGCG CGGCCGCGAT
GCCGAGGCCC TCCGGTTCGC CGACACCGCC GTCGATCTCG CCAGCCGCAC CGACGACGTC
GACGGCCAGG GACATGCGCT GCTGTCCCGC GCCGAGGTTC TCTACCGGGC CGGCCGCAAG
GACGACGCCG CCGAGAGCCT GGAGGCTGGC ATCACCCGGT ATCTGCACCG GGGCAACGTC
GCCGCGGCAA ACCTCGGCCG CCGGCTGTTC GACGCGCTCG ACGGTCCCCC GCCGGGAACC
GGAGGCCCCC CACCAGGGAC GGGAGACCCC TCAGCAGGGA CGGGAGACCC CTCGCCAGGA
ACTGGCTAG
 
Protein sequence
MTAPGWPPAD EPSGSRRPGR DPAVAPAWDG SPQARRVRAS PTASGRDRPT IRHMVPCPTC 
AEDNHERARF CSGCGNPLPT PGSGARKTVT IMFVDITGST DIGERIDSEP LQQVMWRFFT
TVREVIYIHG GSVEKFIGDA VFAVFGIPVL HEDDALRAVR AALDIRAAME ALNADLQREW
GLKLRVRIGI NTGEVTVAGG GVTGDPVNVA SRLEHAAPPD EILIGDTTYR FIRHSVTVSP
LGPLVVQGKR DPLRVHRLIG LVDPTAGPGS RAPTHGFTAP VIGRNRERRR LQDAFEAVVE
ERTCHLFTVL GPAGIGKSRM VGEFCAAMAN RATVLTGRCL SYGEGIAYWP LMEMVRQATG
LSADESAPDG RRRLRELLDG VTQAAEIVEA LAPLVGLGGA ELGPQESFWA VRSFFQALAA
RRPLILWFDD VHWAEPTLLD LLENIADWSR DAPIMLLCLA RPELLEERRD WGGGKLNATS
MLLAPLTEAR CQRLIRTLMG SDDLDPALVS RITTSAAGNP LFVEQMVAAL VDDGLLRREG
SRWIATGSLR NVTVPPTISS LLAARLDRLD PPERRVLERA AIVGERFYLD AVIDLSDPSE
QPMVAAHCLS LVRKELVHPD RSDLPGVEAF RFLHVLLRDC AYQSTAKRQR ADLHQRFAQW
LQTRLEGGPG EHDELVGYHL EQAFRYRVEI GQRDTETIEL GRSAATCLIE AADRVRQGDE
TGAAQLLKRA ITLLPETDPL RLRAEIDLGW ALYSFGRLSD AERILRQVTE RARRAGEEGL
RAHARLAYLR VLFSTDPEGL VARTLTEAAV CLTNFVQAGD EVGAALACRS QANAYLAAGQ
FAAAERAMEN AVRHAEESGV PRAAQSLRRE LTMLMSWGPR PVAVGITRAN EALDAAGDDR
ALRRAVLAQL AVLTAMSGDL EGARIHLKAT EEIVHDLRAP RTDPFHGGFV VARVALLADE
LAVAERELRR SCRQLSRMGE RAFLANRAAA LADVLVRLGK VDDAGKYVTR CRDAAAADQL
PAQAGWCGVH AKLLALRGRD AEALRFADTA VDLASRTDDV DGQGHALLSR AEVLYRAGRK
DDAAESLEAG ITRYLHRGNV AAANLGRRLF DALDGPPPGT GGPPPGTGDP SAGTGDPSPG
TG