Gene Francci3_1491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1491 
Symbol 
ID3903128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1785092 
End bp1786780 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content71% 
IMG OID637878828 
Productdiguanylate cyclase 
Protein accessionYP_480596 
Protein GI86740196 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.430715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACAAGA ACGAGGAAGG CCATCCACCA ATGCCGTTAC GCATACGACT GGCGCGGGGC 
GCGGGTTCCG TTCTGGTGGG CACTCTTCGG CGTGGCTGGC GGCGAGTCCG CGGGGTGACC
ACGCGGGGAG CGGGCACGAC TGCCCGGACC GGTCGCGGGG CGGCACGAGG CGAAGCGTCC
GCCCCACCCG CGAGCGCCGA ACCCGTCGGC CCCTGGATGA TCGGCCCTCC GCCGTTGGGC
CCTTATCTGG CCGGCCCTTA TCTGGCCGGC CCACGCCTGG CCGGCGCCCA CGTCACGGAA
CCTCGCCTGG CCGTGCTGCG CCTGCTCGGC CACGGCTGGA TGCTGTGGCG CGCGCCCGGG
GGGTTGATGC GGCGGGGGTT AATCGTAGTC TTCTGCTGGC TGAGCATGGC CGGGTTCGCG
GCGGCGGGTC TGCGTCCGCG ATGGACCGAC CTCGCGGCCT TCGCCACCTT CGTTGTTCTC
GGGGCGCTGG CCGTGGTGGC GTCCGGACGG CTGGGTGAGG ATCTGACCGT TCCCGGGGCC
CGACGTCATG ACCTGCTGTC CACCTGGACG GTGGCGGCGG CGGTCCTGCT GCCGCCGTTC
TACCCGGCGG TTGTCAGCCT GCCGCTGTGC TGGCTCGCCG GCCGTCGTGA TCCAGTACAG
CCGCCGCATC TGCGCGTGTT CCACGCCGCG GCGTTGGGCA TCGCGGGCTT CTGTGCCTCC
AGCGTCCATC TGCTGCTTAG CCCCGATCGA GGGCCGTTCA CCGTCGACAA CCTCGTCGGG
TCCGGCGTCG CGGTGCTGGC CTTCCTCGCC GCGGTTGCGG TTTACCCGGT TGTGGCGAGG
CTGACCGTGA CGGGGATGAG GCTGACCGTG ATGGGGATGG CGACGTCCGA TGGTCGGGAG
AGCCTCGATT CGTGCTCCGG TGTGGCGGCC GGTCCAGCCC GGCCGGGCTG GCCGGACCGG
CCGGACCGGA CCGGGGTCGT CCCGTCACCT GTCATCCCCG CCCGGACCCC GCTCCGGCAT
GCGTGCCGCG ACCTCGTGCC CGCGCAGGCC GCCGAGATCT GCTCGTCGAT CGTGGTCGCG
GTGCTGTGGG CGGCGAACCC GTTGCTCATG CTGGCGATCA CACCCCCCGT CCTGTTGCTG
CAACGCAGCC TGTTGCACGC CGAGCTCCTG CATGCCGCGC GGTCCGACGC GAAGACCCGG
CTGGCCAACC CGGCGTACTG GCGCCAGGTC GCGGAACGCG AGATCAACCG GGCCTGCCAG
GTGGGGCGTC CACTGTCGGT GCTCCTGGTC GACATCGACC ACTTCAAGCG GGTCAATGAC
AGGTTTGGTC ATCTCATCGG GGACGTCATT CTGATCGCGG TGGCGGACGC GCTGCGGGCG
GCGACCCGCC CCCGGGATCT CGTCGGACGG TTCGGGGGCG AGGAGTTCGT GGTGCTGCTC
ACCGAGGTCG AGCTGGAGAA CGCGGCCGAT GTCGCGGAAC GGATCCGCCG CCAAGTCGCC
GGAACCCACT GCCGGCTGGA GGGCCGGCCT CCCCTGTCGG TGACCGTGTC CGTGGGAGTC
GCGACACATC ACGGCCCCCG CGGCGACCTC GCCGGCCTCA TCGCCCGTGC GGACTCCGCG
CTCTACCGGG CCAAGGCCGA TGGTCGTAAC CGGGTACGGC TCGCCGGGCC GGTCTACAGC
TCCGTCTGA
 
Protein sequence
MDKNEEGHPP MPLRIRLARG AGSVLVGTLR RGWRRVRGVT TRGAGTTART GRGAARGEAS 
APPASAEPVG PWMIGPPPLG PYLAGPYLAG PRLAGAHVTE PRLAVLRLLG HGWMLWRAPG
GLMRRGLIVV FCWLSMAGFA AAGLRPRWTD LAAFATFVVL GALAVVASGR LGEDLTVPGA
RRHDLLSTWT VAAAVLLPPF YPAVVSLPLC WLAGRRDPVQ PPHLRVFHAA ALGIAGFCAS
SVHLLLSPDR GPFTVDNLVG SGVAVLAFLA AVAVYPVVAR LTVTGMRLTV MGMATSDGRE
SLDSCSGVAA GPARPGWPDR PDRTGVVPSP VIPARTPLRH ACRDLVPAQA AEICSSIVVA
VLWAANPLLM LAITPPVLLL QRSLLHAELL HAARSDAKTR LANPAYWRQV AEREINRACQ
VGRPLSVLLV DIDHFKRVND RFGHLIGDVI LIAVADALRA ATRPRDLVGR FGGEEFVVLL
TEVELENAAD VAERIRRQVA GTHCRLEGRP PLSVTVSVGV ATHHGPRGDL AGLIARADSA
LYRAKADGRN RVRLAGPVYS SV