Gene Francci3_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0644 
Symbol 
ID3903322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp729800 
End bp733648 
Gene Length3849 bp 
Protein Length1282 aa 
Translation table11 
GC content71% 
IMG OID637877977 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_479757 
Protein GI86739357 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCCGG GTTCGCCCGT GGACGCCGAA GACGGCGGGC AGCGCGATGG TCGATCCGCC 
GCGCCACCGA ACCCAGCCGG GTCGGGACTG GCGCGCCTTC GTCGCGCCGC CGCCAACCCC
ACCAACTGGC GGGTACGCAC CAAGCTGATC GCGATTCTGG CCGTGCCTGT CATCGCGATC
CTCGCCTACT CGTCCATCGA GGCGGGTACG GTGCTGACGA GCACGCGGGA CGTCAACCGG
GTGTCCGATC TGACCGAGAT CAGCCGCTCC GCGGCCGGGC TGGTCCACGC CCTGCAGGCC
GAGCGCACCT ACAGCACCGG ATTCGTCGCC AGCGGTCCCG TCGGCGCCCG GGACGGGAGC
ACGATTCCAA CCCGCCGGGC GGCGGTCGAT GAAGCCTACA GGGCGTACAA GTCAGCTGCG
AACGAGCGGC GAGGGTCCTT CGGCAGCGAG GTACGCCGCG CTCTCGACGA CGTCAGCAGC
CGGATGGACA CGCTGCCCGC CAAGCGGCAG ACGATCGAAA GGCTGATCGA CAGGAAGGCG
GTCGAGACCG CCTTTCAGCC GATCATTGAC TCACTGGTCA ACCTGATCCG GCTGATCCCA
CAGGGAAATG ACGACCAGGA GCTCAACGCC GGGGTCAACA CGGCCTACTT CCTGATGTCC
GGGACGGAAC TTCTCGCCGA GGAGCAGACA CTCATCCACG GCCTGTTTAC CCGCAACGGG
AACCAGGTCT TCACGGCAGA CGACTATCGC GAGTTCCTGT CGGCCGAGAC CCAGCGGGGC
CGCCAGCGCA ACGATCTCGA GAACGCCGCT CTACCGAACC AGCTCGCGGT CTACGAACCG
ACGATCGACA CCGAGACCAA CGGGGACATC CGCAACACCC AGAATTACGT CGAAGAAATT
CTGAACGCGC CCATCGGTCA GCCGCTGACC GCGCTGGAAG CCGAGAAATG GGACCAGGCG
ACCCAGCGAA CCATCGACAC CGGCACGGAA GCCGTTGACA AGCTCCTGGG CCTCGTCGAC
GCCCGCGCGG GCACCCTGCG GGGCAACATC CAGCGCCACG CGCTGCTGTC CGGTCTGCTG
ATCCTGCTCA TCCTGGCCCT CGCCCTGCTC GCCGCACTGA TCGTCGCGCG ATCGCTGGTC
CGCCCCCTGC TGGCGCTGCG GACTGCCGCG CTGGACGTCG CCGACCATCG GCTTCCCGAG
GCCGTCCGTC GGTTGCGCGA TTCCACCGAG CTGGACATCG ACGACTCGAT CGAACCGGTC
GGTATCGACA CCGACGAGGA GGTCGGCGAA GTCGCGCGGG CTTTCGACGA GGTTCACCGG
GAGGCGATCC GGCTGGCCTC CGAACAGGCG TCGCTGCGGA ACAACGTCAA CGCGATGTTC
GTGAACCTCT CCCGGCGCAG TCAGGGACTG GTCGAACGGC AGCTGCGTCT CATCGACGAG
CTGGAGAACC GCGAGCAGGA TCCGGACCAG TTGTCCAACC TGTTCAAGCT GGATCACCTC
GCCACTCGTA TGCGGCGGAA CAACGAAAGC CTGCTCGTCC TCGCCGGCAC GGACACCGCT
CGGCGCTGGA CACACCCCGT CCCACTGAAC GAGGTCGTAC TGGCGGCCAT CTCCGAGGTG
GAGCAGTACA CCCGCGTCAA GCAGACCTCG GCCGCCGCGG TGTCCATCGC CGGCAACGGT
GTCAGCGACG TCGTCCATCT GGTCGCCGAG CTCCTCGAGA ACGCCACCTC GTACTCGCCG
CCGGCCACCA GCGTCCTCGT CACCAGCCGC TCGCTCGGTC CGGGGGCCGG GGCGATGATC
GAGATCGAGG ACCAGGGCAT CGGCATGCCG GCCAAGGAAC TCGAACGGGT CAACGACCGA
CTGGCGAACC CGCCGGTGGT CGACGTGTCG GTGTCACGGA CCATGGGCCT GTTCGCGGTC
GGTCGGCTGG CCAGCCGCCA CGGCATTCAC GTCCAGCTGC GTGAGTCCGC CTCCGGAGGT
ATCACCGCTG TCGTCCGGCT GCCTGCCAAG CTGGTGACCG GAGACGGCGG GGCCGGCACG
GGACCGGCCC GGCCCGCCGC GCCGGCGTTG GGAACCCGCG TCGGCGACTC GCCGGCCATC
CCCGGTTCCC CGACCGGACC GCGTGCCCCG CAGCGGCCGG AACGCACCGG CTCGGGAACT
CCCGAACCGG TGGACGGCAG GGTCGCGTCG ACGAACGGCA CCGGGGCCAA CGGGCACCGG
ACCGCGAGTG AATCCGACGG ACCAGAAAGC CTCTTCGACG ACTCCAGCGG GCCCCGGCGG
TTCCCCAGCA CCGGGCCGTT CCCTCTCACC GGGCCGCTGG CCGCCGCTCA GCTCGGATCC
GACCGGACGG GTTCGCCGCC CACCGCGGAT GATCGCGACT TCGGGCCGGA CGACTTCGCT
CCCGGCCTCG ACGGCGGCAC CGAGCGCGAC AGCGAGACGG ATGGCATCAG CGCCGCTCCG
CCGACCCTGC CGCACGATCT ACTCGGGCGG GGAGGAAACG ACGAGGACCA CCCGGTCCCG
GGAGCCCGGG AGCGGCCCGG CTCCAGTGGG CAGGACGATC CCCGTGACCC ACGGCAGCTC
GACGACCGGG GGCTCGACCT CGATGCCCTT CGCGGCCGCG GCCGCACCGC TCCCGGTGGC
CGGCTCGAAC CGCCCCCCGA CGTCGTACCG GACACCCAGA CCATCGACCG GCGCGGATTC
GGACGACGGG ACGATCCGGT CACGGACGAG ACGAGACCGG TGTCGTTCGG GGACAACCGC
TGGGGGACCA AACCACGTTC CGGCGGCGCC ACCGGGCCCG GTCGGACCGG GCAGCCCGAC
ACCACCCCGA CACCGCCCAC CCCGACACCG CCCGGCCCGA CACCGCCCGG CCCGACACCG
CCCGGCCCGA CACCGCCCGG CCAGAGCCGG CCCGATGCCG GTGAGCGGCC CCGGTCGGCC
GAGGAGCCGC CCCCCGGGGC CGGCAGCGTG TTCGCCCGCT CCGAGAACCG GCCCGTTCCG
CAGGAACGAT CGGAGTCCAC GGAGTCCACG GAGTCCACGG AGTCCACCGG TCCCCGCGAC
CGGCTGGCGT CGGCGCTCGC GTCCCGGCGG TTGCGCGACA CCGGGACGAC AGAGGCGGTC
TCCGGCGAGG ACACCGGCGA ACAGACCACG GCAGACCGGG ACACCGACGA CATCGAGACG
TCGCCGATCT TCGACTCCGT GTCGGCGTGG TTCCAGCGAC GTTCTCCGTC CGATCGGGCG
CCCGCCCAGC ATTCGACGGC GGCCGCGGCG GCCCGCCCCA CCGGGCCGGG TGCCGGTGGG
GCGGCGCCAC CGCCCCGTTC CCCGGTCCGT CCGGGGCTCG GTGGGGCCTT CGGGCAACGC
GGCCCCGCCC GCTCCGGAGT CGATCCGCTC ACGGCTACCC GGACACCGGC ACGGGAGGAG
CCCGCGACGG ACGCGCCGTC AGCGGCTCCG CCCGTCCCGG CCTTCGCGGC CCAGCATCAG
ACGGCCCCAC ACCATCAACC AGCAGGGCAA TACCAGCCAG CACAGCACCA GGCAACGCCC
CAGCAGCCCC CGTCCCGGGA GAACTGGACA TCTCCCGGGG ACGCCGGCTG GCAGGCGGCC
GAGTTGTTGC GTCAGCCGAG CACAGGCGGC GTGACGCGGT CGGGCCTACC GGTGCGAGTT
CCCATGACCC ATCTCGTGCC CGGCAGCGCC GAACCCGCGC CCCGCCGGCG TCCCACCGAC
ACCTCGGCCA GATCTCCTGA GGCTGTAGGA GGCCGGCTTG CCAGCTTCTA CCAGGGCGTA
CGGCAGGGAC GTGATGTCGG AGTCGACACC GCGAGGAATC CTCGACGTGA CGCGCAGGAG
GAACGGTGA
 
Protein sequence
MAPGSPVDAE DGGQRDGRSA APPNPAGSGL ARLRRAAANP TNWRVRTKLI AILAVPVIAI 
LAYSSIEAGT VLTSTRDVNR VSDLTEISRS AAGLVHALQA ERTYSTGFVA SGPVGARDGS
TIPTRRAAVD EAYRAYKSAA NERRGSFGSE VRRALDDVSS RMDTLPAKRQ TIERLIDRKA
VETAFQPIID SLVNLIRLIP QGNDDQELNA GVNTAYFLMS GTELLAEEQT LIHGLFTRNG
NQVFTADDYR EFLSAETQRG RQRNDLENAA LPNQLAVYEP TIDTETNGDI RNTQNYVEEI
LNAPIGQPLT ALEAEKWDQA TQRTIDTGTE AVDKLLGLVD ARAGTLRGNI QRHALLSGLL
ILLILALALL AALIVARSLV RPLLALRTAA LDVADHRLPE AVRRLRDSTE LDIDDSIEPV
GIDTDEEVGE VARAFDEVHR EAIRLASEQA SLRNNVNAMF VNLSRRSQGL VERQLRLIDE
LENREQDPDQ LSNLFKLDHL ATRMRRNNES LLVLAGTDTA RRWTHPVPLN EVVLAAISEV
EQYTRVKQTS AAAVSIAGNG VSDVVHLVAE LLENATSYSP PATSVLVTSR SLGPGAGAMI
EIEDQGIGMP AKELERVNDR LANPPVVDVS VSRTMGLFAV GRLASRHGIH VQLRESASGG
ITAVVRLPAK LVTGDGGAGT GPARPAAPAL GTRVGDSPAI PGSPTGPRAP QRPERTGSGT
PEPVDGRVAS TNGTGANGHR TASESDGPES LFDDSSGPRR FPSTGPFPLT GPLAAAQLGS
DRTGSPPTAD DRDFGPDDFA PGLDGGTERD SETDGISAAP PTLPHDLLGR GGNDEDHPVP
GARERPGSSG QDDPRDPRQL DDRGLDLDAL RGRGRTAPGG RLEPPPDVVP DTQTIDRRGF
GRRDDPVTDE TRPVSFGDNR WGTKPRSGGA TGPGRTGQPD TTPTPPTPTP PGPTPPGPTP
PGPTPPGQSR PDAGERPRSA EEPPPGAGSV FARSENRPVP QERSESTEST ESTESTGPRD
RLASALASRR LRDTGTTEAV SGEDTGEQTT ADRDTDDIET SPIFDSVSAW FQRRSPSDRA
PAQHSTAAAA ARPTGPGAGG AAPPPRSPVR PGLGGAFGQR GPARSGVDPL TATRTPAREE
PATDAPSAAP PVPAFAAQHQ TAPHHQPAGQ YQPAQHQATP QQPPSRENWT SPGDAGWQAA
ELLRQPSTGG VTRSGLPVRV PMTHLVPGSA EPAPRRRPTD TSARSPEAVG GRLASFYQGV
RQGRDVGVDT ARNPRRDAQE ER