Gene Franean1_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2140 
Symbol 
ID5670540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2568165 
End bp2570354 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content73% 
IMG OID641241061 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_001506482 
Protein GI158313974 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGG CCGTCCGGCG TCTCACCACG CCGGTGGCCC GACGGTGCCA GGAATCACCC 
GGTTCGACCG CATACAGCGG TTCGGCAGTG CGTGGCGGCG CGGCTGCGCG CAGCAGTGCC
GTGGCGACGG CCGGCGGGGG GAAGACGGCG AGGGCGCAGG CCACGGTCGC GCACGCCGGA
GGGGAAACGT CGGATGCGCC GATGACCTCT CCTGCGATGG CGCACTCGCC GACCACCGGA
GGTTCTCCGA CGTTTCCGGA GCTGGAGGTC TCCAGCGCGG GGCGCTCGAG CGGCGGAGAC
CAGCACGGCC GGAATCGGCC CTGGGGCGGG CGGCAGCGCG CGGGCGGCCG TGTCGACGAG
ACCACGACAT CCGGTCGGCG GCCCGGCGTC CTCGCGTCTG CGTCCGTGGT TCTTCTCGTA
GGGGTCGTGT CCTGGTGGGC GGCGATCAAG GTGCTGGCCC CGTCCGGCGC GATCACCACA
TCGCTGACGG TGGCGGCGGT GGCGTGCCTG CCGCTGTCCG GACTGGGTGA CCGGATGGTC
GCGGCGGTCC GTCGCCGCTG GTACGGCTCG GACGATCTGA CTGGCCTGCC AGGTCGTCGC
CATTTCGTCG CGACCGCCTC CCGCTGGCTC GGGGGCGGTC ACCGAGCCCG GGGCTCGGTG
TCGGCGGCGC TCATCCTGAT CGATCTCGAC CGCCTCCGGG ACATCAACGG GACGCTCGGC
CACGAGCACG GTGACCACAT GCTGGCCACC GTCGGCGCCC GGCTGCGTTC CGTGCTGCGT
CCCGCCGACC TGCTCGCGCG GGTCGACGGC GACGAGTTCG CGGTACTGCT GCGGGACGTC
GACCTCGCCG GCGCCGAGGC GGTCGCCCGG CGGATCCGGG AGGCGCTGAG AATCCCTGTC
CGCCTGGATG ACCTGCGCGT CCAGGCCGAC GTCAGTGTCG GTATCGCCCA TGCTCCCGAA
CACGGGCGCG GCATCCTGGA GCTCATGCGG CGGGCGGAGG AGGCGATGTA CGCGGCCAAG
GGGACCCACA CCGGCCAGCG TGTCTACGAC CCTGCCTGCC AGCTCGGCAA CCGTGCCCAG
CTGGGGCTGC GGGCCGACCT GCGGGAGGCG CTGGACGGCG GCCAGATCGA ACTCCGCTAC
CAGCCCAAGG CCGAAATGCG CAGCGGCCGG ATCAGGGGTG TCGAGGCGCT GGTGCGATGG
CGCCATCCCA CCGGCGGGCT CCGTCCGCCG AACATGTTCC TGCCCGAGAT GGAACGCGCC
GGCCTGATGG GGCGCCTGAC CCAGCAGGTC CTCGACATCG CCCTCGCCGA CTGCGCCCGC
TGGCACGCGG CCGGCGCCGC GCTGGCCGTG TCGGTGAACG TGCCCGCCTC GGTCATCGTC
GACCGCGGAT TCGTGGACCT GGTGCGCGGT GCGCTGGAGC GCCACGGCCT GCCCGCGTCG
GCGCTGGTCG TCGAAGTCAC CGAGGACGGG CTCATCACGG TCCTGGAGCA GGCACAGCGG
ACCCTCTCCG GCCTGCGTGA CCACGGTGTG CGGGTCAGCC TCGACGACTA CGGCACAGGC
CTGTGCTCGC TCGCCTACCT GCGGGAGCTC CCCGCGGACG AGGTGAAGCT CGACCAGCGG
TTCCTGCGCG ACATCGACCG CGACTCCTCG GCGGCCGAGA TCGTCCGGTC CACGGTTTCC
CTCGCGCACG CGCTCCGGCT GCGCATCGTC GCCGAGGGCG TCGAGACGTC GCGCTCGTGG
GCGTCGCTTG CGGCCTGGCA GTGCGACGAG GTCCAGGGCT ACTTCGTCTC CCGCCCGCTG
GCGGGGGAGC GGGTGCTGAG CTGGCTGCGC GAATGGGGCG ACCGGCTCCG GTGGCTGCCC
TCCGGGGGGG AGCCTGCGCC GACGGGTCCG ATCAGGGTCA CCACCGCCTC GCGTGGCGCC
CGAGTGCACT CGGTGGCTAC CGCCGCGAAC GCCCAGCTCT CCTCGCCCGC AGCAGGAGCG
GCGTCCGCCG CGCCGGTCGC GACGGCGTCG ATGATGACTT CGGGTTCGGC GTGCGGCGCC
GCGGAGCCGC CCACGGCGGG CAAACCCTCG CTCGCCGTGC CGTCCAGTCG CCCGGGCCTG
CGGTCGATGG GATTCCGCAT GTCGCAGCAT GCTGAGGCGG GAAGCCGGCG GCCGTCCGGT
CAGCCTGCCC ATGATGGGTG GGGAGCCTGA
 
Protein sequence
MSRAVRRLTT PVARRCQESP GSTAYSGSAV RGGAAARSSA VATAGGGKTA RAQATVAHAG 
GETSDAPMTS PAMAHSPTTG GSPTFPELEV SSAGRSSGGD QHGRNRPWGG RQRAGGRVDE
TTTSGRRPGV LASASVVLLV GVVSWWAAIK VLAPSGAITT SLTVAAVACL PLSGLGDRMV
AAVRRRWYGS DDLTGLPGRR HFVATASRWL GGGHRARGSV SAALILIDLD RLRDINGTLG
HEHGDHMLAT VGARLRSVLR PADLLARVDG DEFAVLLRDV DLAGAEAVAR RIREALRIPV
RLDDLRVQAD VSVGIAHAPE HGRGILELMR RAEEAMYAAK GTHTGQRVYD PACQLGNRAQ
LGLRADLREA LDGGQIELRY QPKAEMRSGR IRGVEALVRW RHPTGGLRPP NMFLPEMERA
GLMGRLTQQV LDIALADCAR WHAAGAALAV SVNVPASVIV DRGFVDLVRG ALERHGLPAS
ALVVEVTEDG LITVLEQAQR TLSGLRDHGV RVSLDDYGTG LCSLAYLREL PADEVKLDQR
FLRDIDRDSS AAEIVRSTVS LAHALRLRIV AEGVETSRSW ASLAAWQCDE VQGYFVSRPL
AGERVLSWLR EWGDRLRWLP SGGEPAPTGP IRVTTASRGA RVHSVATAAN AQLSSPAAGA
ASAAPVATAS MMTSGSACGA AEPPTAGKPS LAVPSSRPGL RSMGFRMSQH AEAGSRRPSG
QPAHDGWGA