Gene Franean1_0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0099 
Symbol 
ID5668524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp117226 
End bp118992 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content73% 
IMG OID641239027 
Productdiguanylate cyclase 
Protein accessionYP_001504472 
Protein GI158311964 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCACA GCCGTCGGCG CGACCGGCTC TTCCGCCGCG CGGCGGTCGG CGCCGCGGTG 
CTGACGGGCG CCTGCGTGGT TCTCGCCCAG GTGCTCCCCG ACAGCCTGGC GCTGTCCGCC
GTCCGACTGA CGGTGATCGG GTTCCTGCTG CTCGGCGGGA CGACCTGCGT GGTCGCCGGG
CTGCGTCGCC ATGGCGCCGA GCGGTGGTGG CGGCTTCTCA TGGCGGTAAT GGTTCTCGCC
GTGGCGGTGG CGTCGGCCGC GGTGTTCCGG GACACGGCGG CGGGGCGATC TCCCGTCCCC
CAGCTGACCC CGGCCTCGCT GGTGTATCTG ATTCCGCTCG CGATGGGTGT CGCCGGCGTG
CTGCTCTACC CGACCGACCC CGTCGAGCAC GACGACGCCG AGGACGGCGG GCCGCTGCAC
GCCTACCGCT GGTACGCGAT CACGGTCCTG GACGGCATGA TCGTGGTCGG CTCGGTGGCG
CTGCTGGTCT GGGCCACCGT GCTGGAGCGC ACGGTCGGGC ACGGCGAACC ACTCGGCCCC
GGCCCGCTCT ACTCGATCAT CCTCGCCGCG GTCTCGCTCG TGGTCTTCGT CGTGCTGATC
CTGGTGGCCG TGTTCCGCGA GCCGCGCGAC AGCCGCGGCC ACGCGCTCCT GCTCGCGGGC
ATGTGCGCCG CCTCGATCTC CGCCATGTGG GAGCTCGCGG TGCTCATCCA CGGCCTGGAC
GACGTCCCGC GGCTGACCGA CCTGCCCATT GGCATCGGTG CGCTGCTCAT CGGCCTGGCC
GCCATCTCCA CCGATCCCGA TACGGGTGCC ACTACCGGTG CCGACGCCGC TGCCGACGTC
GATGTCGATG CCGATGCCGA TGCCGTGGGA GTGGGACTGG CCGCCGTTCC CAGCCTGGGT
CGGCGGGCCG CCTCGGCCAG GCTCCGCCGG TGGCACGCGA TCCTGCCGTA CCTCCCGCTG
ACCGCGGCCG GGGCGGCGAC GGTGCTCCAG ATCGCCGGAG ACGGCATCGG GCACTGGGAG
GAGATCTGGG CGCTGCTCGC CCTGCTGCTA CTCGCGCTGG TGCGCCAGAT GATGACGATG
TCGGACAACA TCCGCCTGCT CGGCCAGGTG GAGGAGAAAC AGCGGCAGCT GCGGCACCAG
GCGTTCCACG ACCCGCTGAC CGGGCTGGCG AACCGCAGCC TGTTCATCGA CCGGCTCGAG
CGGGCGCTGC ACCGCCAGCC GGGCCCCGCC GAGCGCTTCG CCGTCCTGTT CTGCGACCTC
GACGACTTCA AGCGGGTCAA CGACGTCCTC GGCCACGCGG CGGGCGACGA CCTGCTGCGG
ATCACCGGCG CACGGCTCGC CGGCTGCGTC CGCGCGGCGG ACACCGTGGC CCGCCTCGGC
GGTGACGAGT TCGCGATCCT GCTCGTCTCG GCCAACATCG ACGATCCCGA GGCAGTCGGA
TGTCGGCTGG CGGCCGCGGT CCGGGCGCCG GTGCGGCTGG CGAGCCACAC CTTCACCGTC
GCGGCCAGCG TGGGCCTGGT GACCGTCGAC CCGGAGACCG GGACGGGTGC CCCGCACCAA
GGCGCGGACC CGGACCCGGA CCCGGACACG GACACGGAGC CGCGCGCGGA CGTTCCGGAC
ACCGCCGAGC AGCTGCTGCA CCGCGCCGAC CTGGCGATGT ACGCGGCCAA GGCCAGGCGC
AACGGGGAGC CGGCCGTCTA CACCCCTGAG CTGGTGGGCC CAGGGCGGGC GCGGGCACGG
CCCGCCCGGA ACGTCCCGCT GCCCTGA
 
Protein sequence
MGHSRRRDRL FRRAAVGAAV LTGACVVLAQ VLPDSLALSA VRLTVIGFLL LGGTTCVVAG 
LRRHGAERWW RLLMAVMVLA VAVASAAVFR DTAAGRSPVP QLTPASLVYL IPLAMGVAGV
LLYPTDPVEH DDAEDGGPLH AYRWYAITVL DGMIVVGSVA LLVWATVLER TVGHGEPLGP
GPLYSIILAA VSLVVFVVLI LVAVFREPRD SRGHALLLAG MCAASISAMW ELAVLIHGLD
DVPRLTDLPI GIGALLIGLA AISTDPDTGA TTGADAAADV DVDADADAVG VGLAAVPSLG
RRAASARLRR WHAILPYLPL TAAGAATVLQ IAGDGIGHWE EIWALLALLL LALVRQMMTM
SDNIRLLGQV EEKQRQLRHQ AFHDPLTGLA NRSLFIDRLE RALHRQPGPA ERFAVLFCDL
DDFKRVNDVL GHAAGDDLLR ITGARLAGCV RAADTVARLG GDEFAILLVS ANIDDPEAVG
CRLAAAVRAP VRLASHTFTV AASVGLVTVD PETGTGAPHQ GADPDPDPDT DTEPRADVPD
TAEQLLHRAD LAMYAAKARR NGEPAVYTPE LVGPGRARAR PARNVPLP