Gene Franean1_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0666 
Symbol 
ID5669083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp783662 
End bp787864 
Gene Length4203 bp 
Protein Length1400 aa 
Translation table11 
GC content70% 
IMG OID641239593 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_001505031 
Protein GI158312523 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.982341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG ACGCGAGCCC CGACCGCCAG CAGTTCCCCG CGAGTGCCGG GGATTCCGGT 
GGCCCGCCCG CGGGCGCCAC GGCCACCCTG CCGGTGAACG GCACGGGCAC GGGCGCCGCC
ACAGGGCCCG GCGGTGGCAC GGACGCCCTG CTCTCCAGCC TGCTCACCGG GCTCGACCGG
CTCGTCGACG GCGAGTTCGG CGTCCGGCTG CCGATCGGTG ACGGCGTCGG CGGGGACGTC
GCCCGCCGGT TCAACGAGCT CGCCGGGATG CAGGAGCGGC ACGCCCGCGA GGTTTCCAGG
GTGAGCAAGG TAATTCGACG TGACGGTCGT CTCACCCTGC GAATGGACGA CCTTGGTGGC
AGTGGCGGCT GGGACGAGCT CACGCTCTCG GTGAACTCGC TCATCGACGA CCTGGCCCGG
CCCACCCACG AGGTCGCCAG GGTCATCGCC GCGGTGGCGG AGGGTGACCT CTCCCAGCAG
ATGGCGCTGG AGATCGCCGG GCAGCCGGTG CGTGGCGAGT TCCTCCGGAT CGGCACAACC
GTCAACACGA TGGTGGACCA GCTCTCGTCC TTCGCCGACG AGGTCACCCG GGTGGCGAAG
GAGGTCGGCA CCGAGGGCAA CCTCGGTGGC CAGGCCAAGG TCATGGGGGT CTCCGGGGTC
TGGCGGGACC TGACCGAGTC AGTGAACTCG ATGGCCGGCA ACCTGACCAG CCAGGTCCGC
AACATCGCCC AGGTGACTAC GGCGGTGGCG CAGGGTGACC TGTCGCAGAA GATCACGGTC
GACGCGCGCG GTGAGATCCA CGAGCTGAAG TCGACCGTCA ACACGATGGT GGACCAGCTC
TCCGCGTTCG CCGACGAGGT CACCCGAATG GCCAAGGAGG TCGGCACCGA GGGCAAGCTC
GGTGGCCAGG CCCAGGTCAA GGGTGTCTCC GGGGTGTGGC GTGATCTCAC CGACTCGGTG
AACGTCATGG CCGGCAACCT CACCACCCAG GTGCGCAGCA TCGCCGAGGT CGCCGCCGCG
GTGGCCCGCG GTGACCTGAC CCGCCAGATC ACCGTCGACG CCCGCGGCGA GGTGGCGGGC
CTCGCGCACA CCCTGAACAC GATGGTCGAC CAGCTCTCGT CGTTCGCCGA CGAGGTGACC
CGGGTCGCCT GGGAGGTCGG TACCGAGGGC AACCTGGGTG GTCAGGCGCA CGTTCGGGGG
GTCTCCGGGG TCTGGCGGGA CCTGACCGAG TCGGTGAACT CGATGGCCGG CAACCTGACC
AGCCAGGTCC GCAACATCGC CCTGGTCGCG ACGGCGGTGG CCCGTGGTGA CCTCTCGCAG
AAGATCACCG TCACCGCGCA GGGCGAGATC CTCGAGCTCA AGGAAACCCT GAACACGATG
GTCGACCAGC TCTCGTCGTT CGCCGACGAG GTGACCCGGG TGGCCAAGGA GGTCGGTACC
GAGGGCAACC TGGGTGGTCA GGCGCACGTT CGGGGGGTCT CCGGGGTCTG GCGGGACCTG
ACCGAGTCGG TGAACTCGAT GGCCGGCAAC CTGACCAGCC AGGTCCGCAA CATCTCCGCG
GTGACCAGGG CGGTGGCCCG CGGTGATCTC TCGCAGAAGA TCACGGTCAC CGCGCAGGGC
GAGATCGCCG AGCTGAAGGA CACCGTCAAC ACGATGGTCG ACCAGCTCTC GTCCTTCGCG
GCCGAGATCA CCCGGGTCGG CCGCGAGGTG GGCGTCGAGG GCAAGCTGGG CGGCCAGGCC
ACGGTCGCCG GTGTCGCGGG CACCTGGAAG GACCTGACCG ACAACGTCAA CCAGCTCGCG
TCGACGCTGA CGATCCAGCT CCGCGCGATC GGCGACGTGT CCACCGCGGT GACCCGCGGT
GACCTGACCC GCTCCATCAC GGTGGAGGCC GAGGGCGAGG TCGCCGAGCT CAAGGACAAC
ATCAACCAGA TGATCGCCCG GCTGCGGGAG ACCACCGAGG TCAACGCCCA GCAGGACTGG
CTCAAGTCGA ACCTGGCCCT CATCGGCAGC AAGATGCAGG GCCAGCGCGA CCTCTACACC
GTCTGCCAGA TGATCATCAG CGAGATGACG CCGGCAGTGA ACGCCCAGCA GGGCACGGTC
TACCTGCTCG ACTTCATCGA GGGCGACAAG CTGCGCTACG TCGCCGGCTA CGGCTCGGTG
CCGCGGCGCC GCTCGGACGG AACCTTCCTG TTCGGCGAGG GCCTCATCGG CCAGGCGGCG
CTGGAGAAGA AGCGCATCCG GGTCGAGGAC GTGCCCGCCG GCTACCTCAA CATCCGCAGC
GGCCTGGGCG AGGCGCCGCC GTGCGACCTG GTCGTCGTTC CGGTCATCTT CGAGAACCAG
GTTCTCGGTG TGATCGAACT GGCCTCTTTC ACCCCGTTCT CCGACCTGCA CCTCACGCTC
GTCGACCAGC TCGTCGACAC CATCGGCGTC GTCCTGAACA CGATCATGGC GAACGCCCGC
ACCGAGGAGC TGCTCGCCCA GTCGCAGCGG CTGACCCAGG AGCTGCGCTC GCAGTCCGTC
GAGCTCCAGC GCACGAACAA CGAGCTGGAG GAGAAGGCGG CGCTGCTCGA GGAGAAGAAC
CAGGAGATCG AGCTGGCCCG GATCGGGCTG GAGGAGAAGG CCGAGCAGCT CGCGCTGTCC
TCCCAGTACA AGTCGGAGTT CCTGGCGAAC ATGAGCCACG AGCTGCGCAC CCCGCTGAAC
AGCCTGCTCA TCCTGGCCAA GCTGCTGGCC GACAACCCCG ACCACAACCT GAGCCAGAAG
CAGATCGACT TCGCCGAGAC GATCCACTCC GCCGGCTCGG AGCTGCTCGG GCTGATCAAC
GACATCCTCG ACCTCTCCAA GGTCGAGGCC GGCAAGATGA ATGTCGACGC CGGCCCGGTG
CGCACCGCCG CGCTGTGCGA CGCGGTGGCG GGCGTGTTCG GGCCCACCGC CGAGGAGAAG
GGCCTCACCT TCGAGATCAG CGTGACCGAG GACGTGCCGG ACGAGTTCGT CACCGACGAG
CAGCGCATCC AGCAGGTGCT CAAGAACCTG CTGTCGAACG CGGTCAAGTT CACCGATGCC
GGCACCGTCC GGCTGGACGT CGCCGTCGCC TCGCCCGACA CCCCGTTCGT CGCGCCCAGC
CTGCGCTCGG CGCCGATGGT GCTGTCCTTC GCCGTCACCG ACACCGGCAT CGGGGTCGCG
GCCGAGAAAC TCAGGATGAT CTTCGAGGCA TTCCAGCAGG CGGACGGCAC GACCTCGCGC
CGCTACGGCG GTACCGGCCT GGGACTGTCG ATCAGCAAGG AGATCGCCCG CCTGCTGGGC
GGGTCGATCG CCGTCTCCAG TCAGATCGGG CAGGGCAGCA CGTTCACCCT GTTCGTCCCG
TCGGTGATGC CTCCGGAGGC ACCCGCGGGC CCCCACCCCG GCGAGCCCGA CGGTGTCCTC
ATGATCGTGG ACGGTGCCGT CCAGCCGCGG TCACGGACCG ACGGCGGGGA CGACGGCCGT
GGGCCGGTGC TGGCCGGCGT CGAGCGGGCC GCGCTCGTCC CGGCGGAGCG CAGGCCGGCG
GCGCCGGAGT CGCCCGGTGA CGCGGCGCAC CGGGAGAGCG CCGGCCCGGC TCCTGCCGGC
ACGTCCAACG GGGCGGGCAC CGGCAGGCAG CGTGACGGCG AACGGGTCTA CGGAGCGCCC
GGCCGGCCCG TCGAGCCCAA GGCGGCCCGC CGATTGGCGC GCACCTCGTG GTCGGGTCGC
GGCGGCACCG ACGCAGCCGG CGACCGGACG CCCGCCGAGG GCACCCCGTT CCTCACCGAC
CCACAGGCGC CCGCGCGGCC GCCGACGACG AACCTGGACG GCGGCGACCC GCTCGTGGGC
GCCAGCATCC TGGTGGTCGA CGACGACGTC CGCAACGTGT TCGCCCTGAC CAGCGCGCTG
GAGATGCACG GCATGCGGGT GCTCTACGCC GACAACGGCC ACGACGCCAT CCGGATGCTG
CAGCAGGACA ACGCGCCGGT GCACCTGGTG CTGATGGACG TCATGCTCCC GGGCATGGAC
GGCAACGAGA CCACGTCGGA GATCCGTGCG ATGCCGGCCT TCGCGGCGCT GCCGATCCTC
GTGCTGACCG CAAAGGCGAT GCCTGGTGAC CGGGAGAAGA GCATTTCCGC CGGTGCGAGC
GACTACATCA CCAAGCCGGT AGACCTTGAT CACCTCCTTG GAGTGATGCG GTCGTATCTG
TGA
 
Protein sequence
MSADASPDRQ QFPASAGDSG GPPAGATATL PVNGTGTGAA TGPGGGTDAL LSSLLTGLDR 
LVDGEFGVRL PIGDGVGGDV ARRFNELAGM QERHAREVSR VSKVIRRDGR LTLRMDDLGG
SGGWDELTLS VNSLIDDLAR PTHEVARVIA AVAEGDLSQQ MALEIAGQPV RGEFLRIGTT
VNTMVDQLSS FADEVTRVAK EVGTEGNLGG QAKVMGVSGV WRDLTESVNS MAGNLTSQVR
NIAQVTTAVA QGDLSQKITV DARGEIHELK STVNTMVDQL SAFADEVTRM AKEVGTEGKL
GGQAQVKGVS GVWRDLTDSV NVMAGNLTTQ VRSIAEVAAA VARGDLTRQI TVDARGEVAG
LAHTLNTMVD QLSSFADEVT RVAWEVGTEG NLGGQAHVRG VSGVWRDLTE SVNSMAGNLT
SQVRNIALVA TAVARGDLSQ KITVTAQGEI LELKETLNTM VDQLSSFADE VTRVAKEVGT
EGNLGGQAHV RGVSGVWRDL TESVNSMAGN LTSQVRNISA VTRAVARGDL SQKITVTAQG
EIAELKDTVN TMVDQLSSFA AEITRVGREV GVEGKLGGQA TVAGVAGTWK DLTDNVNQLA
STLTIQLRAI GDVSTAVTRG DLTRSITVEA EGEVAELKDN INQMIARLRE TTEVNAQQDW
LKSNLALIGS KMQGQRDLYT VCQMIISEMT PAVNAQQGTV YLLDFIEGDK LRYVAGYGSV
PRRRSDGTFL FGEGLIGQAA LEKKRIRVED VPAGYLNIRS GLGEAPPCDL VVVPVIFENQ
VLGVIELASF TPFSDLHLTL VDQLVDTIGV VLNTIMANAR TEELLAQSQR LTQELRSQSV
ELQRTNNELE EKAALLEEKN QEIELARIGL EEKAEQLALS SQYKSEFLAN MSHELRTPLN
SLLILAKLLA DNPDHNLSQK QIDFAETIHS AGSELLGLIN DILDLSKVEA GKMNVDAGPV
RTAALCDAVA GVFGPTAEEK GLTFEISVTE DVPDEFVTDE QRIQQVLKNL LSNAVKFTDA
GTVRLDVAVA SPDTPFVAPS LRSAPMVLSF AVTDTGIGVA AEKLRMIFEA FQQADGTTSR
RYGGTGLGLS ISKEIARLLG GSIAVSSQIG QGSTFTLFVP SVMPPEAPAG PHPGEPDGVL
MIVDGAVQPR SRTDGGDDGR GPVLAGVERA ALVPAERRPA APESPGDAAH RESAGPAPAG
TSNGAGTGRQ RDGERVYGAP GRPVEPKAAR RLARTSWSGR GGTDAAGDRT PAEGTPFLTD
PQAPARPPTT NLDGGDPLVG ASILVVDDDV RNVFALTSAL EMHGMRVLYA DNGHDAIRML
QQDNAPVHLV LMDVMLPGMD GNETTSEIRA MPAFAALPIL VLTAKAMPGD REKSISAGAS
DYITKPVDLD HLLGVMRSYL