Gene Francci3_0368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0368 
Symbol 
ID3905427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp428922 
End bp432944 
Gene Length4023 bp 
Protein Length1340 aa 
Translation table11 
GC content68% 
IMG OID637877697 
Productdiguanylate cyclase 
Protein accessionYP_479484 
Protein GI86739084 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0370806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0163356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGAG GTGGCCGAAT CGGTGGGCGG CAAGCGCCCG GGCGAGTCAC GGCCGCAATC 
AGAACCTCGG CGGCCCGGGC CGGATCGCGA CTCCGGCGGC GATTCATCCA CCCGCTGATC
GTCGGACTAC TCACGGTTGC GGTCCTCGGT ACCATGGCAA TCTGGACGGG CTCCGATCTG
CTGAGCCGGG AGAAGATCGA CCGCCTCAAC GAACGTTCCC AGCTGGTCAA GCGGCTCGCG
CAGTGGGCGG CGATCGTCGA TAACCCGGCG GCGATGCAGT CCGCGACCGA TCGGACGCCC
TTCCAGCCCG GTGACCCCCT GGGAAACGAA GCCCTGCTCC CGCAGCTCCA GATCGCTCAC
TCCGGCGATG TCATCCGGGT GGTTGCGCTG CTCGACACCG CGGGCCATAC CACGGCCTCC
TATCCACGGG GGAACTCCAT AGTCCCCGCC GACCTGGGGC AGGCCTGGTC CGCTGCCTGG
TCCGGGGTTC CCGCGGTGTC CCCCGTCTTC GCCCTCCACG ATGGCCGCGT CAGAGCAACG
GTGGTTCCCG TCGGCCGTCC GCGGCCCTGG GCCGTCCTGG TCGCCGTGTC CACCGAAGAG
TCCAACCTCC AGTTCACCAA CTGGATCACG GCCCTGCTCG GGGTCGGCCG GGGCAGCATG
TCGACGGTCG GCCCGGACGG GGTACCGATC GCATCCACCG AGCCGGGCAT GCTGGGTCGG
CAGGTCCTCT CGACCGCCGA TCTCTCCCGC TCCCACGCCC AGCGTGGTGG TGCGCGGATC
TGGGAGACGA CCGAGAACGG CAGGAGGATC ACGAACATCA CGGCGGTGCA GCCGACGACG
GGGTACCTGA CCTACTTCCG GCAGGCCACG GACTCGCTGA ACGCCGACCT CCGGGCCCGA
CGGAACCAAC GGAATCTGAC GTTGCTGGCC TTCGCGCTGA CGTCGGTCCT GAGCATCGTC
TTCGTCGGCC TGCTGCGAGA GACGGCGGCT CGGCGCTCCC GGGCCGGGCT GTGCGCGTTG
TTCGCGGCGG CCCATGACAT CGTGCTGACA ACCGACCTCA CTGGACGACT CACGTTCATC
AGCCCCGCGA TCGTCCCGTT ACTCGACCAT TCCCAGGAAA CCTGGCTCGG GCGCAAGGTC
GCGGATCTGG TGCACCCGGA CGACGCCCAC CGGCTCGTAC GATTGATCGA GAACCCCGCT
GCCGGGTCCC TGCTCAACAT TCGGATGACG GCGGCGAACG GCACATCCCG GTGGTTCGAC
GTCGCGACCC GCCACCTGTC CACCCGGGAT GGATCCGGCG AAGTGCTGAT CACCTGCCAT
GCGGTCGGTA AACGGAAGCA GTTGTTGGAC CAGCTCGGCT ATCAGGCGCG CCATGACGTA
CTCACCGGAC TGTGTAACCG GACCGCCTTC GAGGAGCAGC TAGAGGATGC CCTCGCCGCG
GCTGGGGGCG TCGCGGTCCT GTTCATCGAC CTCGACAGCT TCAAGCCGGT CAACGACACC
TTCGGCCACG CGGCCGGCGA CCGGGTGCTG CAGGTCATCA GTGGACGGAT CCGAACACTG
CTGGGCCCCG ACGATGTCGC CGGGAGATTC GGCGGCGACG AGTTCGGGGT CCTGCTGCTG
CGCGCGAACG AAATCGCCGC TCGCACCATG GCCGGTCGGG TCATCCGCGC GGTACGGGAA
CCGATCCTCG TCGACGGCTA CGAGGTCTGC GTCGCCGCAA GCATCGGAGT AGCGCTGGCC
GTCAGCCGGC CCAGCCATTC CGGGCGGCTG CTGCGCGCGG CCGACGAGGC CATGTACAAG
GCCAAGCAGG CTGGTCCGGG ACGATATGCC ATGGCGGACC CCCCAGCGGC CGAGACCACC
ACGCTCAAGG CGCCCGCACC GTCCGCGTCT GTGCTCCTCG CCACCGACCG GATCATGCCG
CGATCCCCGG TCACCCGACC GCGCTCGATG TCCTTCCCAG CCTTGCCCAG ATCCCCAGGC
CCGCCCAGAT CCCCAGGCGC ATCCGGATCG TCACCCGCGG GCCGGACCGG GCATCCGCTG
TCTCCTCGAC GCTGGCGGAC AACCGTGCGC GAGCAGCTGG GGCGAGCCAT CCCACTGCTC
GTGCTGGGGA CCGTCATCCT GGCCGCCACG GCGGTCACCC TGGAGATCGA GAATGCCAAC
CGGCGGCGAG ACGAGGCCCA ACACACCGCC GAGACCCACT CCCTCGTCCT CCGTCTGGCC
GAGTATGTCG CGGACCTGGG TCAACCCGTG CGCCTGATCG ACCCGATCTC CAGACTTCCC
TGGTCGCTGA CCGAACCCGC TGTCGACCAG CGGGTGCTGG CGTCGGTCTC CCGATCGGCG
CTCGCGGGCC CCGACACCGT TCTGGCACTG GTCGATCTCG ACGGCAGGCC GACGGCGGTC
CAACCCACCG GCGCAACGAT CCCGTTCCCC CCACGGGACC GGATCTGGGC TCTCGCCCGC
ACCAGCGGAA TCAACATCCC TGTTTTCCAC CTGGGCGATC GGGTTCGGGC GTACAGCATC
GTGCCGATCG TGCGGGACGG ACGCAGCGCC GCGTTTCTGG TCATGGGTAG ACCGGGAGCC
GGGATGCCGG GAGCCGAGAT GGTCCGCGTT TTTGGCGGCG GGGGTACCGG GACGAGCGGC
ACCGGAGAGG ACACCCGGAT CGTCATCGTC GACGAGGCCG GGCGCATCGC CCTGTCCGAC
GATCCGTCCG CGGTCGGGAT CGCCCTGATT GACGGCACGG AGCTGCGGTC AATCAAGCCA
GGCCAGAGCC GGCAGGTGAC GGTGCGCAAC GGCGGCGGTC ACGTCGCCGT CGCCGCCGCG
ATCCCCGGCC ATCCCACGTG GGGGTACCTC ATCCTGCAAC AGAACGGCCC GCTGCCGCGC
GACCCGCGAG ACAACCATGC CGTCGGCGAT CTGCTGCTGC TCGGCATTGT GACCGTCACG
TTCTCAGGCC TGACAAAAAT GATCTTCCGG GGAGAGCAGG CGGTCCGGCG CGACCGGGCG
CGGCTGCAGA CCCTGCTGCA CGAGTCGCAC GACATCATCG TCAACCTGGA CCGTGCGGGC
CGGCCGACCT TCATCAGTTC GGCGGTCGAA AGCCTGCTCG GCTACCCGGT CAAGGCCATG
ATCGGGCTGC CGCTCATCGA CCTGGTCCAT CCCGAGGACC AGGCTGCGAC GCGGGCCTTC
CTCGCGGACC GGCAGCGTGG TGGACCGGGG TCCCTGCTGG ACGTCCGCAT GCAGACGGTT
AACGGCGACC ATCGCTGGTT CGACATCGAG GCCGGCGCCT GGCGGCCCGC GTCCGGGCCG
GGCTGCTTCG ACGGTGGTGT CCTGCTCACC TGCCACGAGA TCAGCGAACG TCGTCAGTTG
CAGGAGCAGC TACGGAAACG TGCCACCCAC GACCCGTTGA CCGGCCTGCC GAACAGGGCG
GCGCTTACCG AATTCCTCGA TCAACTGGCT CGGGAGCAGA CTCCGTTCGC GGTTCTACTC
ATCGATCTGG ATGACTTCAA ACCGATCAAC GACACGTTCG GTCACCAGGT GGGCGACGAT
GTGCTGTGCA CGGTCGCCAC TCGGTTGGCC GACGTCCTCG CCCCGGGCCC GTCGGTGGCC
AGCGCGGGAC CGGTCGAGCC GATCGGTCCC GCGCTGATAG ATCCCGAGAC GATCGATCCC
GAGATGATCG ATCCCGAGAT GATCGATCCC GAGATGATCG GTCCCGAGAT GATCCGACCG
GAAGCACATG CTGGGCAGGC GTTCCGACTC GGCGGTGACG AGTTCGTCAT TGTGCTGCCC
GGGGCCGACC CCCTGACGAT GCGCCAGACC GAGCAGCGGG TGCGCGAGTT CGTCGAGGCA
CCCCTCACCG TGGCCGGCAA CACCCTTGTG GTCAAAGCAA CGATCGGCCT GGCGTCCTCG
CAGGCCGTCG GGGGCGCCGG CTCGCACAGT CCCGAGAGCG TGGTCCGGCA CGCCGATACG
GACATGTACG AGGCGAAGAC GAGCGCCCGG GCCCGACGCT CCATCTCCCC CGCGGCGCGA
TGA
 
Protein sequence
MIRGGRIGGR QAPGRVTAAI RTSAARAGSR LRRRFIHPLI VGLLTVAVLG TMAIWTGSDL 
LSREKIDRLN ERSQLVKRLA QWAAIVDNPA AMQSATDRTP FQPGDPLGNE ALLPQLQIAH
SGDVIRVVAL LDTAGHTTAS YPRGNSIVPA DLGQAWSAAW SGVPAVSPVF ALHDGRVRAT
VVPVGRPRPW AVLVAVSTEE SNLQFTNWIT ALLGVGRGSM STVGPDGVPI ASTEPGMLGR
QVLSTADLSR SHAQRGGARI WETTENGRRI TNITAVQPTT GYLTYFRQAT DSLNADLRAR
RNQRNLTLLA FALTSVLSIV FVGLLRETAA RRSRAGLCAL FAAAHDIVLT TDLTGRLTFI
SPAIVPLLDH SQETWLGRKV ADLVHPDDAH RLVRLIENPA AGSLLNIRMT AANGTSRWFD
VATRHLSTRD GSGEVLITCH AVGKRKQLLD QLGYQARHDV LTGLCNRTAF EEQLEDALAA
AGGVAVLFID LDSFKPVNDT FGHAAGDRVL QVISGRIRTL LGPDDVAGRF GGDEFGVLLL
RANEIAARTM AGRVIRAVRE PILVDGYEVC VAASIGVALA VSRPSHSGRL LRAADEAMYK
AKQAGPGRYA MADPPAAETT TLKAPAPSAS VLLATDRIMP RSPVTRPRSM SFPALPRSPG
PPRSPGASGS SPAGRTGHPL SPRRWRTTVR EQLGRAIPLL VLGTVILAAT AVTLEIENAN
RRRDEAQHTA ETHSLVLRLA EYVADLGQPV RLIDPISRLP WSLTEPAVDQ RVLASVSRSA
LAGPDTVLAL VDLDGRPTAV QPTGATIPFP PRDRIWALAR TSGINIPVFH LGDRVRAYSI
VPIVRDGRSA AFLVMGRPGA GMPGAEMVRV FGGGGTGTSG TGEDTRIVIV DEAGRIALSD
DPSAVGIALI DGTELRSIKP GQSRQVTVRN GGGHVAVAAA IPGHPTWGYL ILQQNGPLPR
DPRDNHAVGD LLLLGIVTVT FSGLTKMIFR GEQAVRRDRA RLQTLLHESH DIIVNLDRAG
RPTFISSAVE SLLGYPVKAM IGLPLIDLVH PEDQAATRAF LADRQRGGPG SLLDVRMQTV
NGDHRWFDIE AGAWRPASGP GCFDGGVLLT CHEISERRQL QEQLRKRATH DPLTGLPNRA
ALTEFLDQLA REQTPFAVLL IDLDDFKPIN DTFGHQVGDD VLCTVATRLA DVLAPGPSVA
SAGPVEPIGP ALIDPETIDP EMIDPEMIDP EMIGPEMIRP EAHAGQAFRL GGDEFVIVLP
GADPLTMRQT EQRVREFVEA PLTVAGNTLV VKATIGLASS QAVGGAGSHS PESVVRHADT
DMYEAKTSAR ARRSISPAAR