Gene Francci3_4063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4063 
Symbol 
ID3907024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4861356 
End bp4863140 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content68% 
IMG OID637881392 
Productdiguanylate cyclase 
Protein accessionYP_483142 
Protein GI86742742 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCTC CATCGTGGCG GCGCGCTTGG CGTAGCCACG TCTGGCTTGT CTATCTCTGT 
CTCGGTCTGA TCGCCATCCT CGTCTACGGC CAGTTTCCCA CGACCGGCGG CCCACGTGCC
GTCCGGCTGG TCATCTACCT CTGCCTCAGC GCCTCGGCCG CGGGGGCCGT GTTCCTCGGC
CTGCGACGCC AGGAGTTGGC CGACCGACGT CCCTGGCTGC TGATCGGAAT CAGCCAGGTG
ATGTACGCCA TAGGCGATGG TGCCTTCTAC ATCTTTCACT ATGTGCTTGA CATCACAGAG
TATCCGGCCT TGCCGGATGT GTTATATCTG GGCCACTATC CGCTGTTCGT CGTCGGTCTC
GTTCTGCTGA CCCGGCGACG GACCCAGGAA CGGGACATCG CCGGCCTGCT CGACGGATCG
ATGTTCCTGC TCGCGGCGGT TCTGTTGTCC TGGCTCTACC TGATCGCGCC GCAGGTGCGT
GCGGACCATG ATCTGTTCGT CGGCGTCACC TCGGTGGCCT ATCCGGCGAT GGATCTGGCG
CTGCTGGGGG TGGCCATCCG TCTCGTGCTG GGGCACGGGA GCCGCCCACC CGCCTTCTTC
CTGTTGGCGG GCAATCTGCT CGCCAACCTG ACCGCCGACA CCATCTACGT CGTGCAGCAG
ACGAACGGGA CCTTCGAGGT CGGCAACTAC CTCGACGCGA TCTGGTTGAC AGGCAACCTC
GCCCTCGGCG CCGCCGGGTT GCATCCGACC ATGGTCGACC TCACGCGGCC GGCGTCCCCG
CGTGCGCAGG CGCGCAGCGG CCGGCGCCTG GTGGCACTGT CGAGTGCCGG GCTGGTGGCT
CCGACCGTGC TGGCCGTGCA GGTGATCCGT GGTGATCTAC GGGATCTGCT GGTCGCCGTG
GCCGTAAGTG CGATCATGAT GATTCTGATC ATCGTTCGGA TGGCGGGCCT GGAGGCCGAC
CAGCGGCGTC TGGCGATCAC GGACGGGCTG ACCGGCCTGC GTACCCGCCG CTATCTGGAG
ACCGAGATGA GGTTGGCGGC CGGTCGGGTA CGACGCACCG GATCCGGAAT GGGGCTGATC
CTCGTTGACG TCGACCACTT CAAGTCGGTG AACGACCGAT TCGGCCATCC GGCTGGTGAT
CAGGTGTTGA CGGAGGTGGC TCGGCGACTG CTGGTCGTGA CCCGTCCCGG TGACGTCGTC
GCCCGTTACG GGGGTGAGGA GTTCGCGCTG CTCACGCTGA ATGTCCGGGG AGACGCCCTC
GCCGATATCG CCGAACGTCT GCGCCGCGGC GTCGGACGAG AACCCGTCCG GATCGTCCTG
CCCGCGCCGC GAGTTCCGGA TCCGCCGGCC CGCACGGGGG AGTCCGGGGC TGCGGGGTCT
GCGGAGCCGA CGAGGGCGCG CGGCCAGGCC CGGTCGGGTG CCGGTGTCCG GGGAGACACC
GGCGAGGAGT CGCTGCTGAT CGCTGTGACG GTCTCGGTCG GAGCGGCGGC CCTCCCCGAT
CACGCGGACG ACACCTTCGC GCTCGTCGCA GCCGCGGATC GCGCGCTGTA CGCCGCCAAG
GCCGCGGGAC GGGACCGGGT CGCGGTCGGC TCCGCCAGAC CGGGCCATGA CGTCGGTGGG
CATGACGTCG GTGGGCATGA CGTCGGTGGG CATGACGTCG GTGGGCATGA CGTCGGTGGG
CATGACGTCG GTGGGCATGA CGTCGGCGAG AGGGTTGGTT CGAGTAGACA GCCCCCAGCC
CCGGATGCCG TATCCGCCGT GGTGTCCCGG CGTACGCGAT GGTGA
 
Protein sequence
MPAPSWRRAW RSHVWLVYLC LGLIAILVYG QFPTTGGPRA VRLVIYLCLS ASAAGAVFLG 
LRRQELADRR PWLLIGISQV MYAIGDGAFY IFHYVLDITE YPALPDVLYL GHYPLFVVGL
VLLTRRRTQE RDIAGLLDGS MFLLAAVLLS WLYLIAPQVR ADHDLFVGVT SVAYPAMDLA
LLGVAIRLVL GHGSRPPAFF LLAGNLLANL TADTIYVVQQ TNGTFEVGNY LDAIWLTGNL
ALGAAGLHPT MVDLTRPASP RAQARSGRRL VALSSAGLVA PTVLAVQVIR GDLRDLLVAV
AVSAIMMILI IVRMAGLEAD QRRLAITDGL TGLRTRRYLE TEMRLAAGRV RRTGSGMGLI
LVDVDHFKSV NDRFGHPAGD QVLTEVARRL LVVTRPGDVV ARYGGEEFAL LTLNVRGDAL
ADIAERLRRG VGREPVRIVL PAPRVPDPPA RTGESGAAGS AEPTRARGQA RSGAGVRGDT
GEESLLIAVT VSVGAAALPD HADDTFALVA AADRALYAAK AAGRDRVAVG SARPGHDVGG
HDVGGHDVGG HDVGGHDVGG HDVGGHDVGE RVGSSRQPPA PDAVSAVVSR RTRW