Gene Francci3_2445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2445 
Symbol 
ID3905057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2839467 
End bp2840888 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content73% 
IMG OID637879775 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_481541 
Protein GI86741141 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.333745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGACC TGGCCGAGCG GGCGAGGACG GCTGCCGCGG CGTCCGGTAC ACCCGCCGAG 
CCGCCCGAGC CGGCTCGGCG GGCGGTGTCA GCCGCCGCTG CCGCCCCGCG TTCCAGCGGG
CTGGTCGCCC GCCTCCGTCG GTGGCCGTGG GCGAGGCTGC TCCCGCGCAC GGTCCGATTG
CGCCTGACGC TGCTCTACGG CAGCTTGTTC GTCCTGTCCG GCGCCGCGCT GCTGGCGATC
ACGTATCTTC TGGTGCTGCA CGTCACCGCC ACCATCCAGA TCACCACCAC CCCGACCGGC
ATGCCGGAAA CCCAGGGCGG CCCACTCAAC GGGCAACCGG CCCCGCCACG GGCCTCGTCG
GCGGTCGCCT CGCAGGTCCA CCACAGCTAC CTGCACCAGT TGCTCGTCGA GTCCGGGATC
GCACTGGCGA TCATGGCGGT TGTGTCGATC TGGCTCGGCT GGCTGGTGGC GGGCCGGGTG
CTGCGCCCGT TGCGCACGAT GACCGCGACC ACCCTGCGGA TCTCCCAGGA GAACCTGCAC
GAGCGCCTGG ACCTACCCGG CCCGCAGGAT GAACTCACCG ACCTTGGTGA CACCATAGAC
GGGCTGCTGG CCCGCCTGGA GACCGCGTTC GAGGCGCAGC GACGCTTCGT CGCCAATGCC
TCCCACGAGC TGCGCACCCC GCTCACGATG ATGCGGACCT CGCTCGATGT CGCCGAGGGC
AAACCCCAGC CCGTGCCGCG GGAGGTCACC GTGCTCGCCG GCAAGCTTCG CGAGGGACTC
GACCAGGCCG ACCGGCTGAT CGAGAATTTC CTCACCCTGG CGCGGGCCCA TCAGGGGGCA
CCCGGCAATG ATGCCGCCGT CTCGCTCACC GGCCTCGTCG TCGCCGCGCT GGCAGCCCGC
GAACCTCACG CCGCCGAGCT GGGCGTGCAC ATCCACCGTC AGCTCGCGGC CGTCGAGATC
ATCGGAAACC CGACGCTGCT GCGGCGCCTG GTCGACAACC TCCTCGACAA CGCCCTGCGC
TACAACCATC CCGGCGGCTT CGTCCACGTC CAGGTCCAGG TCCGCTGCCA CCCCGCCACC
GGCGGCACGG ATGACGCCAG GATTGCCCAC CTGACGATCG AGAACGCCGG GCCGCCGCTC
GACGACGCCG ACGTCCAGCA GCTCGGGCAG CCGTTTCGCC GCCTGGCCGC CGACCGCACG
ACCGCCGGCA GCGTCGGGCT CGGGCTGTCG ATCGTCGCCG CGATCGCCGC GGCCCATGAC
GGCGCTCTGC ACCTGAGCGC CCGACCCGAG GGCGGTCTGC GGGCCGTCAT CACCATGCCG
CTCGTCGGCC GCGCTCCGGT GTCGGGGGTG CCGAGGGTGC CGAGGGTGCC GGGGGGTCGC
GGTGAGGCCG GGGTCGCGGT GAGGCCCGGG GGGTCGCGGT GA
 
Protein sequence
MSDLAERART AAAASGTPAE PPEPARRAVS AAAAAPRSSG LVARLRRWPW ARLLPRTVRL 
RLTLLYGSLF VLSGAALLAI TYLLVLHVTA TIQITTTPTG MPETQGGPLN GQPAPPRASS
AVASQVHHSY LHQLLVESGI ALAIMAVVSI WLGWLVAGRV LRPLRTMTAT TLRISQENLH
ERLDLPGPQD ELTDLGDTID GLLARLETAF EAQRRFVANA SHELRTPLTM MRTSLDVAEG
KPQPVPREVT VLAGKLREGL DQADRLIENF LTLARAHQGA PGNDAAVSLT GLVVAALAAR
EPHAAELGVH IHRQLAAVEI IGNPTLLRRL VDNLLDNALR YNHPGGFVHV QVQVRCHPAT
GGTDDARIAH LTIENAGPPL DDADVQQLGQ PFRRLAADRT TAGSVGLGLS IVAAIAAAHD
GALHLSARPE GGLRAVITMP LVGRAPVSGV PRVPRVPGGR GEAGVAVRPG GSR