Gene Francci3_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4049 
Symbol 
ID3907010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4834718 
End bp4838938 
Gene Length4221 bp 
Protein Length1406 aa 
Translation table11 
GC content68% 
IMG OID637881378 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_483128 
Protein GI86742728 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCG CATACGACCC GCAACCCTCG GCTCGTCATG ATCTGGATGC ACCAGGCGGT 
GACGCTACCG CGACCATCGT GCCCCATGCA TCCCTTCCCG TGGACGACAC TCGGGCGTCG
CGGGCCGCCG CCGACGAGCT GATCGCGGAT CTCCTGCACG GGCTGACCCG CCTGTGCGAG
GGCGACTTCT CGACCCGCCT CGCGGCGCGG GACGGCTACG CCGGTGACGT GGTGCGCAAG
TTCGACGAGC TGGCCGCCAT CCAGGAGCGC CACGCGCGTG AGCTGGCCCG GGTCAGTAAG
GTGATCCGCC GTGACGGTCG GCTCACCCTG CGGATGGAGG ACCTGGGCAA CCCCGGCGGC
TGGTCCGACA TGACGCAGGC GGTGAACTCC CTCATCGACG ACCTGGCCCG GCCGACCCAC
GAGGTGGCGC GGGTCATAGC GGCGGTGGCC GAGGGGGACC TGTCCCAGCA CATGGCGTTG
GAGATCGCCG GCCAGCCGGT GCGCGGGGAG TTCCTGCGGA TCGGCACCAC GGTGAACACG
ATGGTGGACC AGTTGTCGTC GTTTGCGGAC GAGGTGACCC GGGTCGCCCG TGAGGTGGGC
ACCGAGGGCA ACCTGGGCGG GCAGGCCAAG GTCAAGGGCG TCTCGGGGGT GTGGCGGGAC
CTCACCGAGT CGGTGAACTC GATGGCGGGC AACCTGACCA GCCAGGTCCG TAACATCGCT
CAGGTCACCA CCGCGGTTGC GCAGGGTGAT CTGAGTCAGA AGATCACGGT GGACGCCCGC
GGTGAGATCC ACGAACTGAA GTCGACCGTC AACACGATGG TCGACCAGCT CTCCGCGTTC
GCCGACGAGG TCACCCGGAT GGCCAAGGAG GTCGGCACCG AGGGCAAGCT AGGCGGCCAG
GCGCAGGTCA AGGGCGTTTC CGGGGTCTGG CGCGACCTGA CCGACTCGGT CAACGTCATG
GCCGGCAACC TCACCACCCA GGTTCGCAGC ATCGCCGAGG TCGCGGCGGC CGTGGCCCGC
GGCGACCTCA CCCGGCAGAT CACCGTCGAC GCCCGGGGCG AGGTCGCGGG CCTCGCGCAC
ACCCTGAACA CGATGGTGGA CCAGTTGTCG TCGTTTGCGG ACGAGGTGAC CCGGGTCGCC
TGGGAGGTGG GCACGGAGGG CAACCTGGGC GGGCAGGCGC ACGTGCGGGC CGTGTCGGGG
GTGTGGCGGG ACCTGACGGA GTCGGTGAAC TCGATGGCGG GCAACCTGAC CAGTCAGGTC
CGCAACATCG CGCTCGTCGC CACCGCCGTC GCCCGCGGCG ACCTGTCGCA GAAGATCACC
GTGGCGGCCC AGGGTGAGAT CCTGGAGCTC AAGGACACCC TGAACACGAT GGTGGACCAG
TTGTCGTCGT TTGCGGACGA GGTGACCCGG GTCGCCCGCG AGGTGGGCAC CGAGGGCAAC
CTGGGCGGGC AGGCGCATGT GCGCGGGGTG TCGGGGGTGT GGCGGGATCT GACGGAGTCG
GTGAACTCGA TGGCGGGCAA CCTGACCAGT CAGGTCCGCA ACATCGCGCT GGTGACCACG
GCGGTGGCCC GCGGCGACCT GAGCCAGAAG ATCACCGTGA CCGCGCAGGG TGAGATCGCC
GAGCTGAAGG ACACCGTCAA CACGATGGTC GACCAGCTCT CCTCGTTCGC AGCGGAGATC
ACCCGGGTGG GCCGGGAGGT TGGCGTCGAG GGCAAGCTCG GCGGTCAGGC CACCGTAGCC
GGAGTCGCCG GAACGTGGAA GGACCTCACC GACAACGTCA ACCAGCTGGC GTCCACGCTG
ACCATCCAGC TGCGGGCGAT CGGCGACGTC TCCACCGCGG TGACCCGCGG TGACCTGACC
CGGTCCATCT CGGTGGAGGC CGAGGGCGAG GTCGCCGAGC TCAAGGACAA CATCAACCAG
ATGATCGCGC GACTGCGCGA GACCACCGAG GTCAACGCCC AGCAGGACTG GCTGAAGTCC
AACCTGGCCC GGATCGGCAG CAAGATGCAG GGCCAGCGCG ATCTCTACGC GGTCTGCCAG
ATGATCATCA GTGAGATGAC GCCAGCGGTC AACGCCCAGC AGGGCACGGT CTACCTGCTC
GACTTCATCG AGGGTGACAA GCTGCGCTAC GTCGCCGGCT ACGGCTCGGT GCCCCGGCGC
CGCTCGGACG GCACCTTCCT GTTCGGCGAG GGACTCATCG GGCAGGCGGC CCTGGAACGC
GACCGTATCC GAGTCGAGCA CGTACCAGCC GGCTACCTCA ACATCCGCAG CGGACTCGGT
GAGGCGCCGC CGTGCGACCT GGTCGTCGTG CCGGTCGTGT TCGAGAACCA GGTGCTCGGC
GTCATCGAGC TGGCCTCGTT CTCGCCGTTC TCCGAGCTGC ACCTCACCCT GGTCGACCAG
CTCGTCGACA CCATCGGGGT GGTGCTGAAC ACGATCATGG CGAACGCGCG GACGGAGGAG
CTGCTGGCCC AGTCCCAGCG GCTCACCCAG GAGCTGCGTT CGCAGTCGGT CGAGCTGCAG
CGGACGAATA ACGAGCTTGA GGAGAAGGCG GCGCTGCTCG AGGAGAAGAA CCACGAGATC
GAGCTGGCCC GCATCGGGTT GGAGGAGAAG GCCGAGCAGC TGGCGCTGTC GTCGCAGTAC
AAGTCGGAGT TCCTGGCGAA CATGAGCCAC GAGCTGCGCA CGCCGCTCAA CAGTCTGCTC
ATCCTCGCCA AGCTGCTGGC CGACAACCCG GACCGTAACC TCTCCCGCAA GCAGATCGAC
TTCGCCGAGA CGATCCACTC CGCCGGTTCC GAGCTGCTCG AATTGATCAA CGACATCCTG
GACCTGTCGA AGGTCGAGGC CGGCAAGATG AACGTCGATG CGACCACCGT GCGCACGGCG
GCGCTCTGCG ACGCGGTCGC CGGGGTCTTC GGCCCCGTGG CGGAGGAGAA GGGCCTGTCA
TTCCAGGTCA ACCTCGCTCC CGACGTGCCG GCCGAGTTCG TCACCGACGA GCAGCGCCTC
CAGCAGGTGC TGAAGAACCT CCTGTCGAAC GCGGTGAAAT TCACCGACAC CGGCACCGTC
CGGCTGGACG TCACCGTCGC CCGGCCGGAT CTGCCCTTCC TCTCGCCGAG CCTGTGTTCC
GCGGGCACGG TGTTGTCGTT CGCGGTCACC GACACCGGAA TCGGGGTCGC CGTCGAGAAG
CTTCGGATGA TCTTCGAGGC GTTCCAACAG GCGGACGGCA CGACGTCGCG TCGCTACGGT
GGCACCGGGC TCGGCCTGTC GATCAGCAAG GAGATCGCCC GCCTGCTGGG CGGGGCCATC
GCGGTCTCCA GCGTGGTCGG CCGTGGGAGC ACCTTCACCC TTTACCTGCC GTCGGCCCCG
CCGGCGCAGA CGCCGCCCTT TGGGGTCACG CCGGGCGAGC CAGACAACGT TCTCATGATC
GTCGATCCGT CGGGTCAGTA CCTCGGGCGG CGGTCGGGTG AGGGAGTGGA CGCGGAACCG
GGTGGCGGAC CCGCGCCGTC CGCCGGCCGC GATGCCGGGC CCGCGAACGG ATGGCAGAAC
CCCGACGCCG ACCGGTTCGG CACGGCGGCA CTCGCGCCCT CGTCCCCGTC TTCGTCTTCG
TCCCTGTCCC CGACCCGTTC CGGGCCCGGC GCGGGGGGAA CCGCCGGGCA CGGGGACGCG
GGTGGGATCG GCGGCGCCCA CTGGTCCGCC GCGGCCAGCG GACCGGCGGA GGGCTCGGCG
GCCTTCACGT CGGGCTGGCG GCCGAGCGAG CCCGTCACGG TGGGGACTCC GTTCCTCACC
GAGTCCACCG AGTCCACCGA GTCCGCCAGC CGCCCGCTGA CCGAGGGATC GGAAGGATCC
GATCCCCTGG TCGGCACCAC GGTGCTGGTC GTCGACGACG ACGTCCGCAA CGTCTTCGCG
CTCACCAGCG CGCTGGAGAT GTACGGAATG CGGGTCCTCT ACGCCGACAA CGGTCATGAT
GCGATCCGTA TGCTGCAGCA GGACACCCCG CCCGTGCACC TCGTGCTGAT GGATGTGATG
TTGCCGGGCA TGGACGGCAA CGAAACGACC TCGATGATCC GTGACATGCC GGCGTTCGCG
GACCTGCCGA TCCTGGTGCT GACCGCCAAG GCCATGCCGG GAGATCGGGA GAAGAGCATC
ACCGCCGGAG CCACCGACTA CATCACCAAG CCCGTGGATC TGGAGCACCT CCTCGGGGTG
ATGCGGTCGT ATCTGTATTG A
 
Protein sequence
MSPAYDPQPS ARHDLDAPGG DATATIVPHA SLPVDDTRAS RAAADELIAD LLHGLTRLCE 
GDFSTRLAAR DGYAGDVVRK FDELAAIQER HARELARVSK VIRRDGRLTL RMEDLGNPGG
WSDMTQAVNS LIDDLARPTH EVARVIAAVA EGDLSQHMAL EIAGQPVRGE FLRIGTTVNT
MVDQLSSFAD EVTRVAREVG TEGNLGGQAK VKGVSGVWRD LTESVNSMAG NLTSQVRNIA
QVTTAVAQGD LSQKITVDAR GEIHELKSTV NTMVDQLSAF ADEVTRMAKE VGTEGKLGGQ
AQVKGVSGVW RDLTDSVNVM AGNLTTQVRS IAEVAAAVAR GDLTRQITVD ARGEVAGLAH
TLNTMVDQLS SFADEVTRVA WEVGTEGNLG GQAHVRAVSG VWRDLTESVN SMAGNLTSQV
RNIALVATAV ARGDLSQKIT VAAQGEILEL KDTLNTMVDQ LSSFADEVTR VAREVGTEGN
LGGQAHVRGV SGVWRDLTES VNSMAGNLTS QVRNIALVTT AVARGDLSQK ITVTAQGEIA
ELKDTVNTMV DQLSSFAAEI TRVGREVGVE GKLGGQATVA GVAGTWKDLT DNVNQLASTL
TIQLRAIGDV STAVTRGDLT RSISVEAEGE VAELKDNINQ MIARLRETTE VNAQQDWLKS
NLARIGSKMQ GQRDLYAVCQ MIISEMTPAV NAQQGTVYLL DFIEGDKLRY VAGYGSVPRR
RSDGTFLFGE GLIGQAALER DRIRVEHVPA GYLNIRSGLG EAPPCDLVVV PVVFENQVLG
VIELASFSPF SELHLTLVDQ LVDTIGVVLN TIMANARTEE LLAQSQRLTQ ELRSQSVELQ
RTNNELEEKA ALLEEKNHEI ELARIGLEEK AEQLALSSQY KSEFLANMSH ELRTPLNSLL
ILAKLLADNP DRNLSRKQID FAETIHSAGS ELLELINDIL DLSKVEAGKM NVDATTVRTA
ALCDAVAGVF GPVAEEKGLS FQVNLAPDVP AEFVTDEQRL QQVLKNLLSN AVKFTDTGTV
RLDVTVARPD LPFLSPSLCS AGTVLSFAVT DTGIGVAVEK LRMIFEAFQQ ADGTTSRRYG
GTGLGLSISK EIARLLGGAI AVSSVVGRGS TFTLYLPSAP PAQTPPFGVT PGEPDNVLMI
VDPSGQYLGR RSGEGVDAEP GGGPAPSAGR DAGPANGWQN PDADRFGTAA LAPSSPSSSS
SLSPTRSGPG AGGTAGHGDA GGIGGAHWSA AASGPAEGSA AFTSGWRPSE PVTVGTPFLT
ESTESTESAS RPLTEGSEGS DPLVGTTVLV VDDDVRNVFA LTSALEMYGM RVLYADNGHD
AIRMLQQDTP PVHLVLMDVM LPGMDGNETT SMIRDMPAFA DLPILVLTAK AMPGDREKSI
TAGATDYITK PVDLEHLLGV MRSYLY