Gene Franean1_7031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7031 
Symbol 
ID5675342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8575299 
End bp8579282 
Gene Length3984 bp 
Protein Length1327 aa 
Translation table11 
GC content68% 
IMG OID641245877 
Productdiguanylate cyclase with PAS/PAC sensor 
Protein accessionYP_001511268 
Protein GI158318760 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGG CCACTCGGAT CTTTCCGGTC CGGCGCGGAT CTGAAGATCG CAGGCCAAAG 
GATCGTAGAT TTGGCGATCC TCCTATATCG GGCGCGCTCG GCGCATCCGA CCTGCGTGAT
CTCACCCGGC GCGGCCGGCT TCGGCGAGCC ATGCCGCTGC TGGTCGGCCT GGCAGCCTTT
CTTCTTCTCG TCGCGGTCAT CTTCGCCATC AGCAGTGAAA GCCTGGCGAC CTCCGAGCGG
GAGCGGCTGA GTAACCGGGC ACATCTGACC GAACAGATGG CGGCCTACAC GTCGCGTGCC
GTCGACCCTC AGCTTCAGCA GTCCCGCGTC GACGCGGCCG GATTCTCCCT GACGGATCAA
GCGCGCAACG CGCACTTGTT GACATCGTTT CGGATCAATC CTGCAAGTGA CCCTAACTTC
GTCGTCGCTC TGTTGGACCG CGACGGGCAT CCGCTGGTGT CCCGACCGGT GAGTGCCACG
ATTCCAGTGA GCGCCCTCGG TGACGCCTGG CCGACTGCGC TTGCCGGGCA GGCCGCGAGG
TCGCCGGTGT TCGTCATCGA TGGTCTTGTC GGCCGGGCGA CAGTGGTGCC GGTCGGCGAA
GGACGTCCGT GGGCCGCCCT TGTTACCGTC GCTACGGACG ATCTGGCTCA ACGCTTCGAC
GAGCAGCTCG GTTCGCTCGA GCTCGGTCCC GGGGGCCTCA CCACTGTTGA CTCGGCCGGG
GTCGCGGCAT CCTCCTGGGA TCCGGCGCTG GTCGGCCGGC GCGTCATCGA CCCGGAAATG
CTGCGCGGGA TTCCGCCGCT GCCGGCGCGG GTGTGGACGT CCGGGCATGG CTCCGACGAG
GTGACGAACA TCGCCGCCCA GCAGGCGAGC AGCGGATACA TCAACCTGTT TCAGCAGCGA
ACCGACGCGT TCTACTCCGA CCTGCGTGCA CAGCAGTGGT CCCGGAACCT GACCGTGATC
GCCGTGGGAG CGGTGGCGAT GCTGGGGCTG GTCGTCTTCG GCCTGCGCCG GGAGGTGAGC
GCCCGCCGTG CTGGTAAGCG GCTCGACGAG CTGTTGCGCA ACACCCGGGA CCTGATCGCC
GTCGTCCGTC TGGACGGTGT GGTGAGCTTC CTGAGCCCGT CGATCGAAAG GCTGCTCGGC
AGCCGTGCCT CGGACTGGCT CGGTCGACCG CTGGCACAGC TCGTCCATCC GGCCGACCGG
GAGCGCCTCG ACCGATTGCT GACCGATGGC GGCGCGGTCC TCGACGTCCG ACTGTGTGGC
GCCGACGGCG CCATCCACTG GTTCGACCTG GAGGCCGCCG ACCTTCCTCC GCGGGCCGGC
CTGGCCGGTG TGCTGGTCAC CTGCCATGAG ATCGGGGAAC GCAAGAAGCT GCAGGACCGG
CTTGGTCATC AGGCACGCCA CGACTTGCTG ACCGGTCTGG CCAACCGGTC CGGGTTCGGT
GATCGGCTTG CCAGCGCGCT CGCCGTGAGC AGCGAGTCGA GCTGCGCCGT CCTTTATCTG
GACCTTGACG GGTTCAAGCC CATCAACGAC TCGCTCGGAC ACGCGGCCGG AGACCGGGTG
CTGGCGATCA TCGGCGCGCG GTTGGCGGCG CTGTTGCGTG AAGGCGACCT TGCCAGCCGG
CTCGGCGGTG ACGAGTTCGC GATGCTGCTT CGCGACAGCG ACCTCGCGAC TGCGAGTCTG
GTGGCGGATC GGGTGATCAG AGCCGTCGCC GAAAAGATCA TTCTTGAGGG CGAGTGCGTG
CAGGTAGGCG CGAGCGTGGG CATCGCGATG GGCAGACCAG GCCGCAGCCG TGCTGACGAA
CTGATCCACG CCGCGGACAC GGCGATGTAC GCCGCCAAGC GCGATGGCGG GAGTCGGGCC
GCCATCGCGC TGCCCACCCC GTCCACAGCC GCCGACGCGG ACGATATTCT TGGCCCGCGC
GACCTTGGCA GGCCCTCGGC CACCATTGCC GCCGTCGACG TAACAAACGT CGCGGACGAT
CACCCGGGTG CGCCGCAGCT GGCAAACCCA GCAAGCGCGA CCTCGGCAAG CCCGGCGAAA
CTGGTGGAAC CGGCGGGATC GGCGGACCTG GAGCCGGGAA GACCGCTCGG TTCGGCGGTA
CGAAAACCAC CGTCCGCGCG CCCATGGTGG TGGTCCCGTC GCTCCGAGGC CGTCCGCTCC
ACACACCGTG GCGAGGCCGC AGCCCCTCAA CGGCGCGCCC CCCAGATATG TTTCGGCCGT
ATCGTGCCGC TGTTGGTCGC CGCGCTGACC TTGCTCGGTA TCGCGTCGTT TGGTCTGTGG
CAGGAGTCCC AGGCGCTACG GGCAGCGGAA GCAAGCGCGA TGCGCGAACG CCTCGCGCAG
ACCGCACGCA TGGCCGACTA CATGTCGGCC ATGCAGGACC CGCAGCCATT GGTCTCGGCC
GCGTCCGCCG AGTCATGGTC GCCGACGGAT CTGACACACA ACCAGACCAT CCTGCAACGG
TTCGCCCAGT CGCCCGAGGG CGGACCGGAC ACCCTGGCCT CGCTGGTCGG CCTCGACGGA
CGAACGATCG CGGCCGAGCC GGCCGCTGCC CCGCCGATGC CGATCGCTAC GGACAGCCCG
CAGTGGCAGG CGGTGCTGGC CGGGCGGGCG ACCACCTCCC AGGTGGTCAT GGTGGATGGA
GTCGCGCGGC TGTACACCCT GGCGCCGGTA CTCGAGGACG GGCGCCCGGT CGCCGTCCTC
ACCCTCGGCC GCTCGGCGCG AGACGCCCTG CCCAGCTTCT TCCTCGCCGC GCTCGGCGGT
ATGGGGCTGC GCGACGGCGG CGGCGCCGTC GTCGACCGCT CCGGCCGGGC ATCCTTCTCC
TGGAACCCCG GCCTGGCCGG CCGGCCGCTG GTGAACCCGG CCGACCTGGC GGACCTGCGG
ATGGGGAAGG CCGAGCGGGT AACCCTCGTC GATGACTCGA ACGAGATTAT GCTCGCCGCG
CCGATCTGGT CCCTTGACGG TGCGTACTTC GTCTTCCAGC AACCCGCCAC GGCCCTGTTC
GCCGGGCTTC GGGAGGGCCA GGGCCTGCTT GCGGTAAGCC TGTTCGGCAC CATCGGCATC
GCTATCCTCG GCATCGCCGT CTCGAATGCC CGCCGGGAGA ACGCGCTGCG CCGCGAGGAA
TCCCAGCTGC ACGCGCTGCT ACACAATGCC CACGACATCG TCATCGTTGC CGATCGTGAC
TGTCGGATAA CTTTTGCCAG CTCCGACGCG ACGGGTCTAC TCGGCCAGCC CACGGCGGGC
TGGATCGGAA AAATCTTCCC TGACCTCAGC CATCCCGAGG ACGCCAGCCG CCTCCGGCAG
TTCCTCGGTG CCCGCGACCG CGAAGGGCAG GGCGCGATCC GCGAGATCCG GCTACGCGCC
TGCGATGGCT ATCGGTGGTT CGATATCCAC GCCTCGGAGC CGCACCACAC ACCGGTCCTG
TCCGGCATCG TCCTGACTTG TCATGAGATC GGGGGACGCC GACAGATGCA GGCCGAACTC
GCCTATCGGG CCAGCCACGA CCCTCTCACA GGCCTGCCCA ATCGGGCCGC GTTCGACCGC
CATCTGGCGG CTGTGGCCGA ACACGAGCAG GCCGGAACGA GCTTCGCCGT GCTTTTCATC
GACCTGGACA ACTTCAAGCC CGTCAACGAC AGGTTCGGCC ACGCTGTCGG CGACGACGTC
CTGCGCGTCA TTGGCGCCCG CATCGGGGCC GCATCCCGGA AGACGGACAT GGTCAGCCGC
CTCGGCGGAG ATGAGTTCGC GGTCCTCGTC GACGAGGCCG ACGACGATCT CGTCTGCGCC
ATCTCCGCGC GCATCCTCGA AACCAGCAGG CAGCCCATCG CGCTGGCCGG GACCACCATC
GCCCTCGGCG CCACTATCGG GATCGCGCTG TGCCGCGCCG GCATCGACGA CCCCGAAACC
GCCCTGCGCA ATGCCGATAA TGCTATGTAT CGGGCCAAAC GGGCGGGTCG TGACCGCTAC
GCCCTCTTCA CCGACCGGCC CTGA
 
Protein sequence
MTMATRIFPV RRGSEDRRPK DRRFGDPPIS GALGASDLRD LTRRGRLRRA MPLLVGLAAF 
LLLVAVIFAI SSESLATSER ERLSNRAHLT EQMAAYTSRA VDPQLQQSRV DAAGFSLTDQ
ARNAHLLTSF RINPASDPNF VVALLDRDGH PLVSRPVSAT IPVSALGDAW PTALAGQAAR
SPVFVIDGLV GRATVVPVGE GRPWAALVTV ATDDLAQRFD EQLGSLELGP GGLTTVDSAG
VAASSWDPAL VGRRVIDPEM LRGIPPLPAR VWTSGHGSDE VTNIAAQQAS SGYINLFQQR
TDAFYSDLRA QQWSRNLTVI AVGAVAMLGL VVFGLRREVS ARRAGKRLDE LLRNTRDLIA
VVRLDGVVSF LSPSIERLLG SRASDWLGRP LAQLVHPADR ERLDRLLTDG GAVLDVRLCG
ADGAIHWFDL EAADLPPRAG LAGVLVTCHE IGERKKLQDR LGHQARHDLL TGLANRSGFG
DRLASALAVS SESSCAVLYL DLDGFKPIND SLGHAAGDRV LAIIGARLAA LLREGDLASR
LGGDEFAMLL RDSDLATASL VADRVIRAVA EKIILEGECV QVGASVGIAM GRPGRSRADE
LIHAADTAMY AAKRDGGSRA AIALPTPSTA ADADDILGPR DLGRPSATIA AVDVTNVADD
HPGAPQLANP ASATSASPAK LVEPAGSADL EPGRPLGSAV RKPPSARPWW WSRRSEAVRS
THRGEAAAPQ RRAPQICFGR IVPLLVAALT LLGIASFGLW QESQALRAAE ASAMRERLAQ
TARMADYMSA MQDPQPLVSA ASAESWSPTD LTHNQTILQR FAQSPEGGPD TLASLVGLDG
RTIAAEPAAA PPMPIATDSP QWQAVLAGRA TTSQVVMVDG VARLYTLAPV LEDGRPVAVL
TLGRSARDAL PSFFLAALGG MGLRDGGGAV VDRSGRASFS WNPGLAGRPL VNPADLADLR
MGKAERVTLV DDSNEIMLAA PIWSLDGAYF VFQQPATALF AGLREGQGLL AVSLFGTIGI
AILGIAVSNA RRENALRREE SQLHALLHNA HDIVIVADRD CRITFASSDA TGLLGQPTAG
WIGKIFPDLS HPEDASRLRQ FLGARDREGQ GAIREIRLRA CDGYRWFDIH ASEPHHTPVL
SGIVLTCHEI GGRRQMQAEL AYRASHDPLT GLPNRAAFDR HLAAVAEHEQ AGTSFAVLFI
DLDNFKPVND RFGHAVGDDV LRVIGARIGA ASRKTDMVSR LGGDEFAVLV DEADDDLVCA
ISARILETSR QPIALAGTTI ALGATIGIAL CRAGIDDPET ALRNADNAMY RAKRAGRDRY
ALFTDRP