Gene Franean1_2936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2936 
Symbol 
ID5671322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3451359 
End bp3453044 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content68% 
IMG OID641241842 
Productdiguanylate cyclase with PAS/PAC sensor 
Protein accessionYP_001507262 
Protein GI158314754 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTTCA GAGAGAGCCT GGTCGACGAG CTACGGATCG GCGTCATCAT CACGGGCGCG 
CACAGCAGGA TCGTCAGGTA CGTCAACGCG ACTGCGTGCC AGATCCTGGG TCGCGAGCCG
GACGAGATGC TCGGCCGCTC CTGGCAAACC CTCGTCCCCC AGACGGACTA TGAGCTCTTC
ACCAGATACG AGCGCAGGGT CATCGGCCAG GGCCCCGGTG GCTCCGACAA GCGGTTCCCG
CCGTTCCGAA TGAGGTTCGC TCGGCCGGGA GGAGAGATCG TCCACGTTTG GGTTACCTCC
ACACTCGCCC ACAACTTCGT GGGGCTGTTC GGAGACGACG AGCCCCAGTT GGTGAGTCGC
CTGGAGGACG TGGCCGACCG TGAGCAGATC GCCAGCTATC TCGGGCTCGC CCTGGACAAC
AGCCCGATGT CGGTGGGTCT GGTTGACCCG TTCGGTCGCA TCGTGTTCGC TACACGCGGG
CGCTCCCCTC AAGAGACCGA GGACACGCTC AATGCCGAGG CAACATCTGT CTTCGAGGTC
TTCGCGGACT ACCAGGAACC GCTGTCGATG GTCGAGGCCG CCTTCAAAGG TGAGTCTGAC
TCGCTGGTCG TCCAGGCGTA CGGCCGCTAC TTCGACTTCC ATGTGGTTCC CATCACCGAC
GCGGCCGGCC AGGTCCGGCT GGCTGCCGCA CTCGCCACCG ATGTCACCGA ACGGGAACTC
GCCCGCGCCG GCCAAGCCCA GCTCGCCAGC CTCGCTGAGC AGGCGCTGGT TACGCTCGAA
CCGGTCGACC TCTGGCAACA CGCCACCACC GTGCTGGCCA ACCAGCTCGA CGCCGCGGCG
ACGCTGCACG AGATCGACGT CGCGGTGGCC ACCGCTCACG GCACGAAACT CACCGCCACC
GCCACCGCCA CCGCCGGCCC GCCCGTCCCG AGCGACGTCG CAGAGAACGT ACTGTCGACC
GTCATCCGCA CCGGACGCAC GACCCTCAGG ACAGCGGACC TGACGGAGGG CTGGCGGACC
CTGGCCGCAC CGATAGGCCG GCCCGGCGTC TGTAGGGCCG TGGTCGCCGT ACATCGCCCG
GGGCCGGACC GGGAGCCGTT CGGTGACCGG GACGAGGAGT TCCTCGTTGC GGTCGCGAGC
GTGCTGGGCT CGGCGGCGGC GCGGTTCGCC GCCGAACAGG AGATCCGCTA CCGCTCGACG
CACGACACCC TCACCGACCT GCCCAACCGC TCATGGCTGC TCGAGCGGCT CGCCCGCAGC
CTCAAGCTGC ACCGCACAGG CGTGGTCTTC ATCGATCTCG ACGGCTTCAA GACCGTCAAC
GACACCTACG GCCACCAGGC CGGCGACCGG CTGCTGTGCG AGATCGCCCG CCGGCTCGAG
GCGGCGGTGC GGCCAGACGA TCTGGTCGCC CGCCTCGCCG GGGACGAGTT CGCCGTGCTC
TGCGAGCGCG TCGACTCGCC TGCGGCCGTC GAACGGCTCG CCCGGCGAGT GCTCGCAGAG
ATCGAGGAGC CTGTGGTACT TGCGGAGGGA ACCGTACGCG TCTCGGCAAG CGCCGGAGTG
GCGATCTCCC GCGCGGACCT CTCCGATCCG GACCGCCTGC TCAACGCCTC CGACATCGCG
ATGTACGCAG CGAAGCGCGC CGGAGCTGGA CAGTGCATCG TCCACGAGTC CTGGATGCAC
CTCTGA
 
Protein sequence
MDFRESLVDE LRIGVIITGA HSRIVRYVNA TACQILGREP DEMLGRSWQT LVPQTDYELF 
TRYERRVIGQ GPGGSDKRFP PFRMRFARPG GEIVHVWVTS TLAHNFVGLF GDDEPQLVSR
LEDVADREQI ASYLGLALDN SPMSVGLVDP FGRIVFATRG RSPQETEDTL NAEATSVFEV
FADYQEPLSM VEAAFKGESD SLVVQAYGRY FDFHVVPITD AAGQVRLAAA LATDVTEREL
ARAGQAQLAS LAEQALVTLE PVDLWQHATT VLANQLDAAA TLHEIDVAVA TAHGTKLTAT
ATATAGPPVP SDVAENVLST VIRTGRTTLR TADLTEGWRT LAAPIGRPGV CRAVVAVHRP
GPDREPFGDR DEEFLVAVAS VLGSAAARFA AEQEIRYRST HDTLTDLPNR SWLLERLARS
LKLHRTGVVF IDLDGFKTVN DTYGHQAGDR LLCEIARRLE AAVRPDDLVA RLAGDEFAVL
CERVDSPAAV ERLARRVLAE IEEPVVLAEG TVRVSASAGV AISRADLSDP DRLLNASDIA
MYAAKRAGAG QCIVHESWMH L