Gene Franean1_3270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3270 
Symbol 
ID5671644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3872751 
End bp3874409 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content69% 
IMG OID641242162 
Productdiguanylate cyclase 
Protein accessionYP_001507582 
Protein GI158315074 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTAGAT TCCACGGTGC CATCGGCGCC GCGCCAGACC CGTGGCGTTC GACCAGTTCG 
GGCCGTATGC CACGGCTGTT TCTGTGTGCC GTTGTCGCCG CCTTGGTGCT GCTGGGGCTG
GATGTCGCCT TCGCTGTGCT ACTGGACCCG CGACCCAGCC GAATCGGTAC AACTTTCGTC
GACACTGCCA CCGAGCTCGC TGCCGCGGCT GCCTGTTTCT GGACCGCTCG GCGGATGCGC
GGGGCGGAGC GTCGCTGGCG GCGGCTGATC GGCTTCGCGG CCAGCGGTGC GATGCTCACT
GGCCTTGGGA CGGCGGCGAT GTTACTGGCC ACTGGCTCGG TTGGCCCGCC GCCGGTCTAT
GCTGTCCTCA TCCCGTGCTA CGGGCTGGCG CTGGCCGGGC TGCTCTCCCT GCCCACCGAC
CCGCTGGACG GCCGGGGCGG GAGCGCCGAG ACGGTGCGGC ACGGCGGGCT CCGTTGGTGC
GCGATCACCA TGCTGGATTG CCTGCTGCTC GTCGGTTCGA TCATCCTGCT GGAGTGGGTG
ACAGTGCTCA CCGCGCTCGT CCACGCGGGC GGGCACGACC CCGTACACTT CCTGTTCTCC
CTGGTCCACG CGGTCGCCGG TCTGATCCTG GCTACGGCGG TGCTGCTGAT CGCAAGCTTC
CGCCGGCCGC GCTCCCGCGG GACGTTGGCG TTACTTGGCA CCGGCCTGCT GACCTACGGG
CTCACCAACA ACATCCTCGT CTACCACACC GCGCAGGGCA GGATCTCCCT TCCACCGTGG
AGCCTGATCG GGTTTGCTCT GGCCTGGCTG CTGATACTCC TCGCCGCGCT CGTCCCGGTC
CCGGCTCACC CGCAGGCCGA CGGCCCGACG CCGGTTAGCC AGCGGGTACT GTGGGCGCAC
GCCCTGCTGC CCTACGTCGT GCTCGGCGCG GTAAGCCTGC TGGTGCTGGG CAAACTGACC
ACCGGGACGC GGCTCGACCG GTTCGAGACA TACGGCATGC TCGGCCTGCT GCTGGTGACG
CTTGTCCGGC AGATGGTCAC CTTGGCTGAG AACACCCGGC TGCTGGCTGC GGTGCGGGAA
CGGGAACAGC AGCTGCATTA CCAGGCGTTC CACGACCAGC TGACCGGCCT GCCGAACCGG
GCGTTGTTCG CCCGACGGCT AGAACAGGCA CTCGCACACA ACCCCGACAC CGACGACGTC
AGTGTCGGCA CCCCGCTGTC GGTGATGTTC CTGGATCTCG ACGAGTTCAA GGGGGTGAAC
GACGCATTCG GGCACGCCGC CGGGGATGAA CTCCTCAAGA TCAGCGCAGA GCGGTTGCGG
GCGGGGACGC GCGCCGCGGA CACCGTAGCT CGGCTCGGCG GTGACGAGTT CGCCGTCATC
CTCGACGGCG GCGGACCAGA CAATCCCCGC CAGATCGGCG AGCGGCTCGC CGCGGCAATC
CAGACCCCCT GCGTGCTCGC CGGACGGCCT TATACCCCGC GAGCCAGCCT TGGCCTGGTC
ATCCTGGACG GCTCCGCGGC TCAGCCGCCC GGTCCCGACA CCCTGCTGAA CCAGGCCGAC
CGGGCGATGT ACACAGCCAA GCGGGAACGG GCCGGCAGAC TGGTGATCTA CCGGCCAGAC
CTTGCCCCCA CCGGCCACCA CCGAACGACA CGAATGTGA
 
Protein sequence
MSRFHGAIGA APDPWRSTSS GRMPRLFLCA VVAALVLLGL DVAFAVLLDP RPSRIGTTFV 
DTATELAAAA ACFWTARRMR GAERRWRRLI GFAASGAMLT GLGTAAMLLA TGSVGPPPVY
AVLIPCYGLA LAGLLSLPTD PLDGRGGSAE TVRHGGLRWC AITMLDCLLL VGSIILLEWV
TVLTALVHAG GHDPVHFLFS LVHAVAGLIL ATAVLLIASF RRPRSRGTLA LLGTGLLTYG
LTNNILVYHT AQGRISLPPW SLIGFALAWL LILLAALVPV PAHPQADGPT PVSQRVLWAH
ALLPYVVLGA VSLLVLGKLT TGTRLDRFET YGMLGLLLVT LVRQMVTLAE NTRLLAAVRE
REQQLHYQAF HDQLTGLPNR ALFARRLEQA LAHNPDTDDV SVGTPLSVMF LDLDEFKGVN
DAFGHAAGDE LLKISAERLR AGTRAADTVA RLGGDEFAVI LDGGGPDNPR QIGERLAAAI
QTPCVLAGRP YTPRASLGLV ILDGSAAQPP GPDTLLNQAD RAMYTAKRER AGRLVIYRPD
LAPTGHHRTT RM