Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4049 |
Symbol | |
ID | 3907010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4834718 |
End bp | 4838938 |
Gene Length | 4221 bp |
Protein Length | 1406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637881378 |
Product | GAF sensor hybrid histidine kinase |
Protein accession | YP_483128 |
Protein GI | 86742728 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCCG CATACGACCC GCAACCCTCG GCTCGTCATG ATCTGGATGC ACCAGGCGGT GACGCTACCG CGACCATCGT GCCCCATGCA TCCCTTCCCG TGGACGACAC TCGGGCGTCG CGGGCCGCCG CCGACGAGCT GATCGCGGAT CTCCTGCACG GGCTGACCCG CCTGTGCGAG GGCGACTTCT CGACCCGCCT CGCGGCGCGG GACGGCTACG CCGGTGACGT GGTGCGCAAG TTCGACGAGC TGGCCGCCAT CCAGGAGCGC CACGCGCGTG AGCTGGCCCG GGTCAGTAAG GTGATCCGCC GTGACGGTCG GCTCACCCTG CGGATGGAGG ACCTGGGCAA CCCCGGCGGC TGGTCCGACA TGACGCAGGC GGTGAACTCC CTCATCGACG ACCTGGCCCG GCCGACCCAC GAGGTGGCGC GGGTCATAGC GGCGGTGGCC GAGGGGGACC TGTCCCAGCA CATGGCGTTG GAGATCGCCG GCCAGCCGGT GCGCGGGGAG TTCCTGCGGA TCGGCACCAC GGTGAACACG ATGGTGGACC AGTTGTCGTC GTTTGCGGAC GAGGTGACCC GGGTCGCCCG TGAGGTGGGC ACCGAGGGCA ACCTGGGCGG GCAGGCCAAG GTCAAGGGCG TCTCGGGGGT GTGGCGGGAC CTCACCGAGT CGGTGAACTC GATGGCGGGC AACCTGACCA GCCAGGTCCG TAACATCGCT CAGGTCACCA CCGCGGTTGC GCAGGGTGAT CTGAGTCAGA AGATCACGGT GGACGCCCGC GGTGAGATCC ACGAACTGAA GTCGACCGTC AACACGATGG TCGACCAGCT CTCCGCGTTC GCCGACGAGG TCACCCGGAT GGCCAAGGAG GTCGGCACCG AGGGCAAGCT AGGCGGCCAG GCGCAGGTCA AGGGCGTTTC CGGGGTCTGG CGCGACCTGA CCGACTCGGT CAACGTCATG GCCGGCAACC TCACCACCCA GGTTCGCAGC ATCGCCGAGG TCGCGGCGGC CGTGGCCCGC GGCGACCTCA CCCGGCAGAT CACCGTCGAC GCCCGGGGCG AGGTCGCGGG CCTCGCGCAC ACCCTGAACA CGATGGTGGA CCAGTTGTCG TCGTTTGCGG ACGAGGTGAC CCGGGTCGCC TGGGAGGTGG GCACGGAGGG CAACCTGGGC GGGCAGGCGC ACGTGCGGGC CGTGTCGGGG GTGTGGCGGG ACCTGACGGA GTCGGTGAAC TCGATGGCGG GCAACCTGAC CAGTCAGGTC CGCAACATCG CGCTCGTCGC CACCGCCGTC GCCCGCGGCG ACCTGTCGCA GAAGATCACC GTGGCGGCCC AGGGTGAGAT CCTGGAGCTC AAGGACACCC TGAACACGAT GGTGGACCAG TTGTCGTCGT TTGCGGACGA GGTGACCCGG GTCGCCCGCG AGGTGGGCAC CGAGGGCAAC CTGGGCGGGC AGGCGCATGT GCGCGGGGTG TCGGGGGTGT GGCGGGATCT GACGGAGTCG GTGAACTCGA TGGCGGGCAA CCTGACCAGT CAGGTCCGCA ACATCGCGCT GGTGACCACG GCGGTGGCCC GCGGCGACCT GAGCCAGAAG ATCACCGTGA CCGCGCAGGG TGAGATCGCC GAGCTGAAGG ACACCGTCAA CACGATGGTC GACCAGCTCT CCTCGTTCGC AGCGGAGATC ACCCGGGTGG GCCGGGAGGT TGGCGTCGAG GGCAAGCTCG GCGGTCAGGC CACCGTAGCC GGAGTCGCCG GAACGTGGAA GGACCTCACC GACAACGTCA ACCAGCTGGC GTCCACGCTG ACCATCCAGC TGCGGGCGAT CGGCGACGTC TCCACCGCGG TGACCCGCGG TGACCTGACC CGGTCCATCT CGGTGGAGGC CGAGGGCGAG GTCGCCGAGC TCAAGGACAA CATCAACCAG ATGATCGCGC GACTGCGCGA GACCACCGAG GTCAACGCCC AGCAGGACTG GCTGAAGTCC AACCTGGCCC GGATCGGCAG CAAGATGCAG GGCCAGCGCG ATCTCTACGC GGTCTGCCAG ATGATCATCA GTGAGATGAC GCCAGCGGTC AACGCCCAGC AGGGCACGGT CTACCTGCTC GACTTCATCG AGGGTGACAA GCTGCGCTAC GTCGCCGGCT ACGGCTCGGT GCCCCGGCGC CGCTCGGACG GCACCTTCCT GTTCGGCGAG GGACTCATCG GGCAGGCGGC CCTGGAACGC GACCGTATCC GAGTCGAGCA CGTACCAGCC GGCTACCTCA ACATCCGCAG CGGACTCGGT GAGGCGCCGC CGTGCGACCT GGTCGTCGTG CCGGTCGTGT TCGAGAACCA GGTGCTCGGC GTCATCGAGC TGGCCTCGTT CTCGCCGTTC TCCGAGCTGC ACCTCACCCT GGTCGACCAG CTCGTCGACA CCATCGGGGT GGTGCTGAAC ACGATCATGG CGAACGCGCG GACGGAGGAG CTGCTGGCCC AGTCCCAGCG GCTCACCCAG GAGCTGCGTT CGCAGTCGGT CGAGCTGCAG CGGACGAATA ACGAGCTTGA GGAGAAGGCG GCGCTGCTCG AGGAGAAGAA CCACGAGATC GAGCTGGCCC GCATCGGGTT GGAGGAGAAG GCCGAGCAGC TGGCGCTGTC GTCGCAGTAC AAGTCGGAGT TCCTGGCGAA CATGAGCCAC GAGCTGCGCA CGCCGCTCAA CAGTCTGCTC ATCCTCGCCA AGCTGCTGGC CGACAACCCG GACCGTAACC TCTCCCGCAA GCAGATCGAC TTCGCCGAGA CGATCCACTC CGCCGGTTCC GAGCTGCTCG AATTGATCAA CGACATCCTG GACCTGTCGA AGGTCGAGGC CGGCAAGATG AACGTCGATG CGACCACCGT GCGCACGGCG GCGCTCTGCG ACGCGGTCGC CGGGGTCTTC GGCCCCGTGG CGGAGGAGAA GGGCCTGTCA TTCCAGGTCA ACCTCGCTCC CGACGTGCCG GCCGAGTTCG TCACCGACGA GCAGCGCCTC CAGCAGGTGC TGAAGAACCT CCTGTCGAAC GCGGTGAAAT TCACCGACAC CGGCACCGTC CGGCTGGACG TCACCGTCGC CCGGCCGGAT CTGCCCTTCC TCTCGCCGAG CCTGTGTTCC GCGGGCACGG TGTTGTCGTT CGCGGTCACC GACACCGGAA TCGGGGTCGC CGTCGAGAAG CTTCGGATGA TCTTCGAGGC GTTCCAACAG GCGGACGGCA CGACGTCGCG TCGCTACGGT GGCACCGGGC TCGGCCTGTC GATCAGCAAG GAGATCGCCC GCCTGCTGGG CGGGGCCATC GCGGTCTCCA GCGTGGTCGG CCGTGGGAGC ACCTTCACCC TTTACCTGCC GTCGGCCCCG CCGGCGCAGA CGCCGCCCTT TGGGGTCACG CCGGGCGAGC CAGACAACGT TCTCATGATC GTCGATCCGT CGGGTCAGTA CCTCGGGCGG CGGTCGGGTG AGGGAGTGGA CGCGGAACCG GGTGGCGGAC CCGCGCCGTC CGCCGGCCGC GATGCCGGGC CCGCGAACGG ATGGCAGAAC CCCGACGCCG ACCGGTTCGG CACGGCGGCA CTCGCGCCCT CGTCCCCGTC TTCGTCTTCG TCCCTGTCCC CGACCCGTTC CGGGCCCGGC GCGGGGGGAA CCGCCGGGCA CGGGGACGCG GGTGGGATCG GCGGCGCCCA CTGGTCCGCC GCGGCCAGCG GACCGGCGGA GGGCTCGGCG GCCTTCACGT CGGGCTGGCG GCCGAGCGAG CCCGTCACGG TGGGGACTCC GTTCCTCACC GAGTCCACCG AGTCCACCGA GTCCGCCAGC CGCCCGCTGA CCGAGGGATC GGAAGGATCC GATCCCCTGG TCGGCACCAC GGTGCTGGTC GTCGACGACG ACGTCCGCAA CGTCTTCGCG CTCACCAGCG CGCTGGAGAT GTACGGAATG CGGGTCCTCT ACGCCGACAA CGGTCATGAT GCGATCCGTA TGCTGCAGCA GGACACCCCG CCCGTGCACC TCGTGCTGAT GGATGTGATG TTGCCGGGCA TGGACGGCAA CGAAACGACC TCGATGATCC GTGACATGCC GGCGTTCGCG GACCTGCCGA TCCTGGTGCT GACCGCCAAG GCCATGCCGG GAGATCGGGA GAAGAGCATC ACCGCCGGAG CCACCGACTA CATCACCAAG CCCGTGGATC TGGAGCACCT CCTCGGGGTG ATGCGGTCGT ATCTGTATTG A
|
Protein sequence | MSPAYDPQPS ARHDLDAPGG DATATIVPHA SLPVDDTRAS RAAADELIAD LLHGLTRLCE GDFSTRLAAR DGYAGDVVRK FDELAAIQER HARELARVSK VIRRDGRLTL RMEDLGNPGG WSDMTQAVNS LIDDLARPTH EVARVIAAVA EGDLSQHMAL EIAGQPVRGE FLRIGTTVNT MVDQLSSFAD EVTRVAREVG TEGNLGGQAK VKGVSGVWRD LTESVNSMAG NLTSQVRNIA QVTTAVAQGD LSQKITVDAR GEIHELKSTV NTMVDQLSAF ADEVTRMAKE VGTEGKLGGQ AQVKGVSGVW RDLTDSVNVM AGNLTTQVRS IAEVAAAVAR GDLTRQITVD ARGEVAGLAH TLNTMVDQLS SFADEVTRVA WEVGTEGNLG GQAHVRAVSG VWRDLTESVN SMAGNLTSQV RNIALVATAV ARGDLSQKIT VAAQGEILEL KDTLNTMVDQ LSSFADEVTR VAREVGTEGN LGGQAHVRGV SGVWRDLTES VNSMAGNLTS QVRNIALVTT AVARGDLSQK ITVTAQGEIA ELKDTVNTMV DQLSSFAAEI TRVGREVGVE GKLGGQATVA GVAGTWKDLT DNVNQLASTL TIQLRAIGDV STAVTRGDLT RSISVEAEGE VAELKDNINQ MIARLRETTE VNAQQDWLKS NLARIGSKMQ GQRDLYAVCQ MIISEMTPAV NAQQGTVYLL DFIEGDKLRY VAGYGSVPRR RSDGTFLFGE GLIGQAALER DRIRVEHVPA GYLNIRSGLG EAPPCDLVVV PVVFENQVLG VIELASFSPF SELHLTLVDQ LVDTIGVVLN TIMANARTEE LLAQSQRLTQ ELRSQSVELQ RTNNELEEKA ALLEEKNHEI ELARIGLEEK AEQLALSSQY KSEFLANMSH ELRTPLNSLL ILAKLLADNP DRNLSRKQID FAETIHSAGS ELLELINDIL DLSKVEAGKM NVDATTVRTA ALCDAVAGVF GPVAEEKGLS FQVNLAPDVP AEFVTDEQRL QQVLKNLLSN AVKFTDTGTV RLDVTVARPD LPFLSPSLCS AGTVLSFAVT DTGIGVAVEK LRMIFEAFQQ ADGTTSRRYG GTGLGLSISK EIARLLGGAI AVSSVVGRGS TFTLYLPSAP PAQTPPFGVT PGEPDNVLMI VDPSGQYLGR RSGEGVDAEP GGGPAPSAGR DAGPANGWQN PDADRFGTAA LAPSSPSSSS SLSPTRSGPG AGGTAGHGDA GGIGGAHWSA AASGPAEGSA AFTSGWRPSE PVTVGTPFLT ESTESTESAS RPLTEGSEGS DPLVGTTVLV VDDDVRNVFA LTSALEMYGM RVLYADNGHD AIRMLQQDTP PVHLVLMDVM LPGMDGNETT SMIRDMPAFA DLPILVLTAK AMPGDREKSI TAGATDYITK PVDLEHLLGV MRSYLY
|
| |