Gene Franean1_4759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4759 
Symbol 
ID5673101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5683830 
End bp5685503 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content72% 
IMG OID641243616 
Productdiguanylate cyclase 
Protein accessionYP_001509032 
Protein GI158316524 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.805923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAC TCCGTGGCGT CACGGGCGCC ACACACGGTT TGCGGCGTTC GACCGTTTCG 
GGCCGTACGC CGCAGTTGTT GTGGTGCGCG GCCGTCGCCG CCGTGGTGCT TTTTGCGCTG
GAGGTCGCCC TCGCGGTGCT GTTGGGCCCG CGGCCGGGCG TCGTCGCCAC GATGTCGCTC
AACGCCGCCG CCCAGGTGGC CGCCGCGGCG GCCTGCTTCT GGACCGCTCG GCGCACCCGC
CCCGGTGACC GGCGCTGGCG GCGGCTCATC GGCGTCATGG CGGGCAGCTC GGCGCTCGCC
AGCGTCGCGA CGGCGGCAGC CGTGCTGGGT GGTGAATCTC CCCGGGTGTC CTCGGGGTGG
TACGCCGTCC TGCTCGTTTT CCACGGTGTG GCCCTGGCCG GGCTGCTGTC GCTGCCCACC
GACCCGGTGG ACGGCCGGGC GGGGAGGCAG GGCCGCGGCT CGTCCCGCTG GTACGTGATC
ACCATGCTGG ACTGTGTACT GATCGTCGGT TCGATCGTCC TGCTGGAATG GGGGACGGTG
CTCGCCGCGG TCGTCCGGAC AGGTGCGCCC AACCTCAGAG AGTTCCTGCT CTCCCTGGTC
CACGCGGTCT CCGTACTGAT CCTGGTCACG GCGGTGGTGC TGATCGCGAG CTTCCGCCGG
CCGCGCTCCC CGACGACGTT GGCACTGCTC GGCACAGGTC TGCTCGCCTA CGGTCTCACC
AGCAACGTCT TCGTCTACCG CGTCGCGCAG CACCAGCTCG ACATGCCGCC GTTGAGCGTG
GTCGCATTCG GCCTCGCCTT CCTGCTGGTC TTCCTTGCCG CGCTGCTGCC GTTCCCGGCT
CGCGCACCGC CGGACGGCCC GGCGCTGGAC GGCCCGGCGC CGGCCGGGCC GCAGGCGACG
TGGGCGTACG TGGTGCTGCC CTACGCGGTG CTCGGTGCGG CGGGCCTGCT GGTGGTCGGC
AAGCTGTCGA TCGGTACGCA GGTCGACCGG TTCGAGACGT ACGGCATCGC CGGGCTGCTG
CTGGTCGCGC TGGTCCGCCA GATGGTCACG CTGGTCGAGA ACGCCCGGCT ACTCACCGGA
ATCCGGGACC AGGAACGGGA GCTGCACCAC CAGGCCTTCC ACGACCCGCT GACCGGCCTG
GCGAACCGGG CCCTGTTCAC CCGCCGACTA CAGCGAGCAC TCACGCGCGC CACCGACGAC
GCCGGTGACG CCAGCGACGC CGGTGGCGGC ACGTCGCTGT CCGTGCTGTT TCTGGACCTT
GACGAGTTCA AGCACATCAA CGACACCTTC GGGCATGCCG CCGGGGACGA GCTCCTCCGG
ATCAGCGCGG AGCGGCTGCG GGCGGGAACA CGGACCGTGG ACACCGTCGC CCGGCTCGGC
GGCGACGAGT TCGCCGTCAT CCTCGACGGC GACGGACTCC GCAATCCCCG CCGTGTCGGG
GAGCGGCTCG CGGCAGCGAT ACAGGCACCG TGTCTGCTGG CGGGACGGCC TTACACCCCG
CGAGCCAGCC TCGGACTGGT CACCCTCGAC TCCCCCGCCC GCCCGGCCGA CCCCGACGTC
CTGCTCCACC AGGCCGACCT TGCGATGTAC GCGGCCAAAC GCGAACGGGC CGGCAGGCTG
AAGGTCTACC GGTCGGACAT GAGCCCTCCG ATCGCCGCAC CGCCGCCCGG CTGA
 
Protein sequence
MSELRGVTGA THGLRRSTVS GRTPQLLWCA AVAAVVLFAL EVALAVLLGP RPGVVATMSL 
NAAAQVAAAA ACFWTARRTR PGDRRWRRLI GVMAGSSALA SVATAAAVLG GESPRVSSGW
YAVLLVFHGV ALAGLLSLPT DPVDGRAGRQ GRGSSRWYVI TMLDCVLIVG SIVLLEWGTV
LAAVVRTGAP NLREFLLSLV HAVSVLILVT AVVLIASFRR PRSPTTLALL GTGLLAYGLT
SNVFVYRVAQ HQLDMPPLSV VAFGLAFLLV FLAALLPFPA RAPPDGPALD GPAPAGPQAT
WAYVVLPYAV LGAAGLLVVG KLSIGTQVDR FETYGIAGLL LVALVRQMVT LVENARLLTG
IRDQERELHH QAFHDPLTGL ANRALFTRRL QRALTRATDD AGDASDAGGG TSLSVLFLDL
DEFKHINDTF GHAAGDELLR ISAERLRAGT RTVDTVARLG GDEFAVILDG DGLRNPRRVG
ERLAAAIQAP CLLAGRPYTP RASLGLVTLD SPARPADPDV LLHQADLAMY AAKRERAGRL
KVYRSDMSPP IAAPPPG