Gene Francci3_2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2152 
Symbol 
ID3905542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2519963 
End bp2521975 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content71% 
IMG OID637879487 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_481253 
Protein GI86740853 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.381885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA GACAGGTGCG CATGATGCCA CGCATTCACG TCCTCATGAT CGAGGACAGC 
GACGACGATG CCGTGCTCGT CATCGAACGA CTGCGACGCG GCGGTTTCGA GGTGACCTCC
GATCGGGTGG AGGACTCCGA GGCGGCCGCG TCGGTTCTCC GTACCCGGCC TCCGGACCTG
GTGATCTGCG ATTACCACAT GCCGGCGTTC GGCGCCGAGG CGGCCCTGAT GCAGCTGCGC
GCCAGCGGCC TCGATATCCC GTTCATCGTC GTCTCCGGCC AGGTCGGCGA GGAGACGGCG
GCGGCACTCA TGCGGGCCGG CGCGCACGAC TTCCTGCTCA AGGATCGCAT GAGCCGACTC
GTCCCCGCGG TGCAGCGCGA ACTGCGGGAG GCCGACGACC GGCAGAGACG ACGGCAGGCC
GAGGCCGCCC TACGACGCAG CGAGGAGCGG TTCCGGCTGC TCGCCGAGCA GGCGACGGAC
GTCCTCTTCC GCTGCCGGCT GTGCCCGCGG GCCGAGGTCG AGTACGTCAG CCCGGCGATC
AGGATGATCA CCGGCTACCC GCCCGAGGAC CTGTGCGGCC GCCCCGACAC CCTGTTCTCG
CTGGTCGACG AGGAGGACCG GCCCACCCTC AAGGGCTCGT GGCGCACGGC GAACCCCGAA
CCGCTGCGGG TCCGTTGGCA CCCGCGAGAG GGCGTCGAGG TCTGGACGGA GCAACGCGTG
GTCGGCGTCC ACGACGACCA GGGCCACCTC GTCGCGGTGC AGGGCATCCT GCGTGACGTG
ACCGAGCAGG TCCATGCCGA CGAGGAACGG GAGCGGCTGC GACGCCAGCT GGATCAGACC
GAGCGGCTGG AGTCCCTCGG CCGGCTCGCG GGCGGCGTGG CGCACGACTT CAACAACCTG
CTCGGCGTGA TCACCGGCTA TGCGGAGATC GTCCTCGACA CCCTTCCCGA CGACGACCCG
TGCCGGGCCG ACATGGACGA GATCAGCCGT GCCGCCGACC AGGCCGCCGG ACTGACCCGT
CAACTGCTCA CCTTCAGCCG GCAGGAGTCG TCGAAGCCGG AGACCCTCGA CCTCAACGCG
GTGGTCGAGG GGACGCGGAA CCTGCTGCGA CGCACCATCG GCGAGGACAT CGAGATCGTC
ACCCTCCTCG ATCCCGATCT CCACCCCGTC ACGATCGACC CGAGCAAGAT CCAACAGGTG
GTGATGAACC TGGTCGTCAA CTCCCGTGCG GCCATGCCCG ACGGCGGCCG CCTCACCCTC
AGCACGGCGA ACCTGGACCG GGACCGGGGC CCGTCTCCCG CGGACCGCCC GGCCGCGCAC
GGGCGGCCGT CGCCGGAAGA ACGCCCGAAG CGCGGCTGGG CCTGCCTGGC GGTCGCCGAC
ACCGGGTGCG GCATGAGCCC GGAGGTGATC CGTCGCGCCG TCGAACCCTT CTTCACCACC
AAGGGACCCG GTGAGGGGAC GGGACTCGGC CTCGCCACCG TCTACGGGGT GGTCAAGGCG
GCCGGCGGCG AGGTCGACAT CGAGTCCGAG GTCGGGAAGG GGACGACCAT CCGGATCCTG
CTGCCCGCCA CGGCGAGGAG CGGGCCCGCT CCCGTCGACA CCACCCGGGA CGACACCACC
CGGGACGACG CCACCCGCGG GCAGGGGAAG ACGATCCTCG TCGTCGAGGA CGACGCGGCC
GTGCGCGCCG TCACCACGCG CATCCTCGTA CGGTCCGGCT ACCTCGTCCA CGAGGCGGGT
TCCCCGGCCG AGGCCCTCGA CCGGTTCGGT CCCGGGACGA CGCGGATCGA CGCCCTGCTC
ACCGACGCGA TCATGCCCGG CATGACGGGC TACCAGCTCA TCGAACAGTT CCGTCGGACC
CGCCCCGAGC TGCCGGTCAT GCTGATGTCG GGCTACGCCG CCGGCGCCGA GAAGTCCGTG
GCCGGCTTCC CGCCCGGCGT CGCCCACCTC CAGAAGCCGT TCACCGCCCG CGCCCTTCTC
GACAGCCTGG AACGAACCCT GCACCCACGC TGA
 
Protein sequence
MTARQVRMMP RIHVLMIEDS DDDAVLVIER LRRGGFEVTS DRVEDSEAAA SVLRTRPPDL 
VICDYHMPAF GAEAALMQLR ASGLDIPFIV VSGQVGEETA AALMRAGAHD FLLKDRMSRL
VPAVQRELRE ADDRQRRRQA EAALRRSEER FRLLAEQATD VLFRCRLCPR AEVEYVSPAI
RMITGYPPED LCGRPDTLFS LVDEEDRPTL KGSWRTANPE PLRVRWHPRE GVEVWTEQRV
VGVHDDQGHL VAVQGILRDV TEQVHADEER ERLRRQLDQT ERLESLGRLA GGVAHDFNNL
LGVITGYAEI VLDTLPDDDP CRADMDEISR AADQAAGLTR QLLTFSRQES SKPETLDLNA
VVEGTRNLLR RTIGEDIEIV TLLDPDLHPV TIDPSKIQQV VMNLVVNSRA AMPDGGRLTL
STANLDRDRG PSPADRPAAH GRPSPEERPK RGWACLAVAD TGCGMSPEVI RRAVEPFFTT
KGPGEGTGLG LATVYGVVKA AGGEVDIESE VGKGTTIRIL LPATARSGPA PVDTTRDDTT
RDDATRGQGK TILVVEDDAA VRAVTTRILV RSGYLVHEAG SPAEALDRFG PGTTRIDALL
TDAIMPGMTG YQLIEQFRRT RPELPVMLMS GYAAGAEKSV AGFPPGVAHL QKPFTARALL
DSLERTLHPR