Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0644 |
Symbol | |
ID | 3903322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 729800 |
End bp | 733648 |
Gene Length | 3849 bp |
Protein Length | 1282 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637877977 |
Product | periplasmic sensor signal transduction histidine kinase |
Protein accession | YP_479757 |
Protein GI | 86739357 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCCGG GTTCGCCCGT GGACGCCGAA GACGGCGGGC AGCGCGATGG TCGATCCGCC GCGCCACCGA ACCCAGCCGG GTCGGGACTG GCGCGCCTTC GTCGCGCCGC CGCCAACCCC ACCAACTGGC GGGTACGCAC CAAGCTGATC GCGATTCTGG CCGTGCCTGT CATCGCGATC CTCGCCTACT CGTCCATCGA GGCGGGTACG GTGCTGACGA GCACGCGGGA CGTCAACCGG GTGTCCGATC TGACCGAGAT CAGCCGCTCC GCGGCCGGGC TGGTCCACGC CCTGCAGGCC GAGCGCACCT ACAGCACCGG ATTCGTCGCC AGCGGTCCCG TCGGCGCCCG GGACGGGAGC ACGATTCCAA CCCGCCGGGC GGCGGTCGAT GAAGCCTACA GGGCGTACAA GTCAGCTGCG AACGAGCGGC GAGGGTCCTT CGGCAGCGAG GTACGCCGCG CTCTCGACGA CGTCAGCAGC CGGATGGACA CGCTGCCCGC CAAGCGGCAG ACGATCGAAA GGCTGATCGA CAGGAAGGCG GTCGAGACCG CCTTTCAGCC GATCATTGAC TCACTGGTCA ACCTGATCCG GCTGATCCCA CAGGGAAATG ACGACCAGGA GCTCAACGCC GGGGTCAACA CGGCCTACTT CCTGATGTCC GGGACGGAAC TTCTCGCCGA GGAGCAGACA CTCATCCACG GCCTGTTTAC CCGCAACGGG AACCAGGTCT TCACGGCAGA CGACTATCGC GAGTTCCTGT CGGCCGAGAC CCAGCGGGGC CGCCAGCGCA ACGATCTCGA GAACGCCGCT CTACCGAACC AGCTCGCGGT CTACGAACCG ACGATCGACA CCGAGACCAA CGGGGACATC CGCAACACCC AGAATTACGT CGAAGAAATT CTGAACGCGC CCATCGGTCA GCCGCTGACC GCGCTGGAAG CCGAGAAATG GGACCAGGCG ACCCAGCGAA CCATCGACAC CGGCACGGAA GCCGTTGACA AGCTCCTGGG CCTCGTCGAC GCCCGCGCGG GCACCCTGCG GGGCAACATC CAGCGCCACG CGCTGCTGTC CGGTCTGCTG ATCCTGCTCA TCCTGGCCCT CGCCCTGCTC GCCGCACTGA TCGTCGCGCG ATCGCTGGTC CGCCCCCTGC TGGCGCTGCG GACTGCCGCG CTGGACGTCG CCGACCATCG GCTTCCCGAG GCCGTCCGTC GGTTGCGCGA TTCCACCGAG CTGGACATCG ACGACTCGAT CGAACCGGTC GGTATCGACA CCGACGAGGA GGTCGGCGAA GTCGCGCGGG CTTTCGACGA GGTTCACCGG GAGGCGATCC GGCTGGCCTC CGAACAGGCG TCGCTGCGGA ACAACGTCAA CGCGATGTTC GTGAACCTCT CCCGGCGCAG TCAGGGACTG GTCGAACGGC AGCTGCGTCT CATCGACGAG CTGGAGAACC GCGAGCAGGA TCCGGACCAG TTGTCCAACC TGTTCAAGCT GGATCACCTC GCCACTCGTA TGCGGCGGAA CAACGAAAGC CTGCTCGTCC TCGCCGGCAC GGACACCGCT CGGCGCTGGA CACACCCCGT CCCACTGAAC GAGGTCGTAC TGGCGGCCAT CTCCGAGGTG GAGCAGTACA CCCGCGTCAA GCAGACCTCG GCCGCCGCGG TGTCCATCGC CGGCAACGGT GTCAGCGACG TCGTCCATCT GGTCGCCGAG CTCCTCGAGA ACGCCACCTC GTACTCGCCG CCGGCCACCA GCGTCCTCGT CACCAGCCGC TCGCTCGGTC CGGGGGCCGG GGCGATGATC GAGATCGAGG ACCAGGGCAT CGGCATGCCG GCCAAGGAAC TCGAACGGGT CAACGACCGA CTGGCGAACC CGCCGGTGGT CGACGTGTCG GTGTCACGGA CCATGGGCCT GTTCGCGGTC GGTCGGCTGG CCAGCCGCCA CGGCATTCAC GTCCAGCTGC GTGAGTCCGC CTCCGGAGGT ATCACCGCTG TCGTCCGGCT GCCTGCCAAG CTGGTGACCG GAGACGGCGG GGCCGGCACG GGACCGGCCC GGCCCGCCGC GCCGGCGTTG GGAACCCGCG TCGGCGACTC GCCGGCCATC CCCGGTTCCC CGACCGGACC GCGTGCCCCG CAGCGGCCGG AACGCACCGG CTCGGGAACT CCCGAACCGG TGGACGGCAG GGTCGCGTCG ACGAACGGCA CCGGGGCCAA CGGGCACCGG ACCGCGAGTG AATCCGACGG ACCAGAAAGC CTCTTCGACG ACTCCAGCGG GCCCCGGCGG TTCCCCAGCA CCGGGCCGTT CCCTCTCACC GGGCCGCTGG CCGCCGCTCA GCTCGGATCC GACCGGACGG GTTCGCCGCC CACCGCGGAT GATCGCGACT TCGGGCCGGA CGACTTCGCT CCCGGCCTCG ACGGCGGCAC CGAGCGCGAC AGCGAGACGG ATGGCATCAG CGCCGCTCCG CCGACCCTGC CGCACGATCT ACTCGGGCGG GGAGGAAACG ACGAGGACCA CCCGGTCCCG GGAGCCCGGG AGCGGCCCGG CTCCAGTGGG CAGGACGATC CCCGTGACCC ACGGCAGCTC GACGACCGGG GGCTCGACCT CGATGCCCTT CGCGGCCGCG GCCGCACCGC TCCCGGTGGC CGGCTCGAAC CGCCCCCCGA CGTCGTACCG GACACCCAGA CCATCGACCG GCGCGGATTC GGACGACGGG ACGATCCGGT CACGGACGAG ACGAGACCGG TGTCGTTCGG GGACAACCGC TGGGGGACCA AACCACGTTC CGGCGGCGCC ACCGGGCCCG GTCGGACCGG GCAGCCCGAC ACCACCCCGA CACCGCCCAC CCCGACACCG CCCGGCCCGA CACCGCCCGG CCCGACACCG CCCGGCCCGA CACCGCCCGG CCAGAGCCGG CCCGATGCCG GTGAGCGGCC CCGGTCGGCC GAGGAGCCGC CCCCCGGGGC CGGCAGCGTG TTCGCCCGCT CCGAGAACCG GCCCGTTCCG CAGGAACGAT CGGAGTCCAC GGAGTCCACG GAGTCCACGG AGTCCACCGG TCCCCGCGAC CGGCTGGCGT CGGCGCTCGC GTCCCGGCGG TTGCGCGACA CCGGGACGAC AGAGGCGGTC TCCGGCGAGG ACACCGGCGA ACAGACCACG GCAGACCGGG ACACCGACGA CATCGAGACG TCGCCGATCT TCGACTCCGT GTCGGCGTGG TTCCAGCGAC GTTCTCCGTC CGATCGGGCG CCCGCCCAGC ATTCGACGGC GGCCGCGGCG GCCCGCCCCA CCGGGCCGGG TGCCGGTGGG GCGGCGCCAC CGCCCCGTTC CCCGGTCCGT CCGGGGCTCG GTGGGGCCTT CGGGCAACGC GGCCCCGCCC GCTCCGGAGT CGATCCGCTC ACGGCTACCC GGACACCGGC ACGGGAGGAG CCCGCGACGG ACGCGCCGTC AGCGGCTCCG CCCGTCCCGG CCTTCGCGGC CCAGCATCAG ACGGCCCCAC ACCATCAACC AGCAGGGCAA TACCAGCCAG CACAGCACCA GGCAACGCCC CAGCAGCCCC CGTCCCGGGA GAACTGGACA TCTCCCGGGG ACGCCGGCTG GCAGGCGGCC GAGTTGTTGC GTCAGCCGAG CACAGGCGGC GTGACGCGGT CGGGCCTACC GGTGCGAGTT CCCATGACCC ATCTCGTGCC CGGCAGCGCC GAACCCGCGC CCCGCCGGCG TCCCACCGAC ACCTCGGCCA GATCTCCTGA GGCTGTAGGA GGCCGGCTTG CCAGCTTCTA CCAGGGCGTA CGGCAGGGAC GTGATGTCGG AGTCGACACC GCGAGGAATC CTCGACGTGA CGCGCAGGAG GAACGGTGA
|
Protein sequence | MAPGSPVDAE DGGQRDGRSA APPNPAGSGL ARLRRAAANP TNWRVRTKLI AILAVPVIAI LAYSSIEAGT VLTSTRDVNR VSDLTEISRS AAGLVHALQA ERTYSTGFVA SGPVGARDGS TIPTRRAAVD EAYRAYKSAA NERRGSFGSE VRRALDDVSS RMDTLPAKRQ TIERLIDRKA VETAFQPIID SLVNLIRLIP QGNDDQELNA GVNTAYFLMS GTELLAEEQT LIHGLFTRNG NQVFTADDYR EFLSAETQRG RQRNDLENAA LPNQLAVYEP TIDTETNGDI RNTQNYVEEI LNAPIGQPLT ALEAEKWDQA TQRTIDTGTE AVDKLLGLVD ARAGTLRGNI QRHALLSGLL ILLILALALL AALIVARSLV RPLLALRTAA LDVADHRLPE AVRRLRDSTE LDIDDSIEPV GIDTDEEVGE VARAFDEVHR EAIRLASEQA SLRNNVNAMF VNLSRRSQGL VERQLRLIDE LENREQDPDQ LSNLFKLDHL ATRMRRNNES LLVLAGTDTA RRWTHPVPLN EVVLAAISEV EQYTRVKQTS AAAVSIAGNG VSDVVHLVAE LLENATSYSP PATSVLVTSR SLGPGAGAMI EIEDQGIGMP AKELERVNDR LANPPVVDVS VSRTMGLFAV GRLASRHGIH VQLRESASGG ITAVVRLPAK LVTGDGGAGT GPARPAAPAL GTRVGDSPAI PGSPTGPRAP QRPERTGSGT PEPVDGRVAS TNGTGANGHR TASESDGPES LFDDSSGPRR FPSTGPFPLT GPLAAAQLGS DRTGSPPTAD DRDFGPDDFA PGLDGGTERD SETDGISAAP PTLPHDLLGR GGNDEDHPVP GARERPGSSG QDDPRDPRQL DDRGLDLDAL RGRGRTAPGG RLEPPPDVVP DTQTIDRRGF GRRDDPVTDE TRPVSFGDNR WGTKPRSGGA TGPGRTGQPD TTPTPPTPTP PGPTPPGPTP PGPTPPGQSR PDAGERPRSA EEPPPGAGSV FARSENRPVP QERSESTEST ESTESTGPRD RLASALASRR LRDTGTTEAV SGEDTGEQTT ADRDTDDIET SPIFDSVSAW FQRRSPSDRA PAQHSTAAAA ARPTGPGAGG AAPPPRSPVR PGLGGAFGQR GPARSGVDPL TATRTPAREE PATDAPSAAP PVPAFAAQHQ TAPHHQPAGQ YQPAQHQATP QQPPSRENWT SPGDAGWQAA ELLRQPSTGG VTRSGLPVRV PMTHLVPGSA EPAPRRRPTD TSARSPEAVG GRLASFYQGV RQGRDVGVDT ARNPRRDAQE ER
|
| |