Gene Francci3_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1156 
Symbol 
ID3903584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1376036 
End bp1377616 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content72% 
IMG OID637878488 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_480264 
Protein GI86739864 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.331298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCAGC CGGTAGCGGG GCATGCGGCG GTGAACGGGC GCGCCGGCCG GTTGGACGGG 
TTCGCGCGGC GGGTCCACCC GGGACGTCAC CCCAGGATCC GGTTTTCCTC CCCGCTCGGT
CAGCTGACGT TGCGGGCCCG GCTCGCCCTG CTCGTCGGGA TGGCGGTGGC TGCCGCCGTG
ACGGTCGTCG CCGGGGTGTC GCTGGGGGCG ACCAGCATCG TGCTGAACCA GGCCATCGAC
GACCAGCTCG TAGAGCAGGC TGAGGCGTCG GCGCGCACCA TCCAGACCAG CCCGCTGGGG
CTGGAACTGC AGGCGTTCAG CTTCGGGCTG GAGGGCCAGT TCCTCGACGC CGCCGGCAAT
CCGCTGGAGA ACGCCGTCTC GTCCTGGAAC AGCGTGCGGA TCCCGGTCGA CTCCGCCGAC
GCCGAGGTGG CCCGGCGGGA GCGGGCCCAG AACCTGCGCA CGATCGCCGT TGACGGGCAG
TCCTACCGGC TTGTCACGGT TCCGCTGCAA CGTCCCGCGT CCGGCGGTGC ATTGCAGCTC
GCCCGGCCGA CCACCGACGT CGACCGGACC CTGCGCGACC TCGCGCTGGT GCTGCTTGTC
GTCGGGATCG TCGGGGTGGT CGGCTCGGTG CTCGCCGGAC AGATCGTGGC ACGTGCCGCG
CTCAAACCCG TTGACGCCGC GGCCGCGGCC GCCGAGGAGG TCGCTCGTAC CCAGAATTTG
TCGGCGCTCA TCCCGGTGAC CGGATCCGAC GAGATCACCC GGCTGGCGGA GAGCCTCAAC
AGCATGCTGC GCGCGCTGGA GGCATCTCGG GCCCGCCAGC GCCAGCTCGT CGACGATGCC
AGCCACGAGC TGCGCACCCC GTTGACGAGC CTGCGTACGA ACATCGAACT GTTGCTGCGC
GCCGAGGCGA ACCCGCATCG CGCGCTGCCC GCGGCCGACC ATGAGGCACT GCTACGCGAC
GTGGACGCGC AGATGCGCGA ACTGTCCGGC CTCGTCAGCG AACTGGTTGA GCTCGCCCGG
GACGAGGCGC CCACCGAGGA GGTGGAACGG CTCGATCTCG CGGAGATCGT GCGAGCCGCG
GCCGAACGGG CCCGGCGCCG CGCCACCGGC AAGAGCATCA GCATCGAGCT CGACGCGACT
CCCTCGACGG TCGACGGGCG GGCGAACATG CTCGAACGGG CGATCACCAA CCTGCTCGAC
AACGCCGTGA AGTTCTCGCC GCCCGCCTCC GTCGTCCGGG TCGGCTCCCG GGACGGCGAG
GTAACCGTCG CGGATGACGG ACCGGGCATC GCGCCGGAGG ACCGCGTGCA GGTGTTCGAC
CGCTTCTACC GGGCCACCTC CGCGCGGGGC CTGCCCGGAT CCGGGCTCGG CCTCGCGATC
GTCGCGGATG CCGTGCACAC CCACCGGGGT ACCGTCACCG CGGAGAGTTC CCCGAGTGGC
GGGGCGTTGC TGCGGATGCG GCTGCCGGTC GTTGACGACC CCGGTCCGCT GGGCCCGGCC
ACCACACCGG GTCCGGCGCC GAACCCGCCG GACGGCGCTG CTTCGCCTCC GGACCGCGCG
GGCGAGAATC CCAGGCCGTG A
 
Protein sequence
MGQPVAGHAA VNGRAGRLDG FARRVHPGRH PRIRFSSPLG QLTLRARLAL LVGMAVAAAV 
TVVAGVSLGA TSIVLNQAID DQLVEQAEAS ARTIQTSPLG LELQAFSFGL EGQFLDAAGN
PLENAVSSWN SVRIPVDSAD AEVARRERAQ NLRTIAVDGQ SYRLVTVPLQ RPASGGALQL
ARPTTDVDRT LRDLALVLLV VGIVGVVGSV LAGQIVARAA LKPVDAAAAA AEEVARTQNL
SALIPVTGSD EITRLAESLN SMLRALEASR ARQRQLVDDA SHELRTPLTS LRTNIELLLR
AEANPHRALP AADHEALLRD VDAQMRELSG LVSELVELAR DEAPTEEVER LDLAEIVRAA
AERARRRATG KSISIELDAT PSTVDGRANM LERAITNLLD NAVKFSPPAS VVRVGSRDGE
VTVADDGPGI APEDRVQVFD RFYRATSARG LPGSGLGLAI VADAVHTHRG TVTAESSPSG
GALLRMRLPV VDDPGPLGPA TTPGPAPNPP DGAASPPDRA GENPRP