Gene Francci3_3685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3685 
Symbol 
ID3905369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4420977 
End bp4422509 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content73% 
IMG OID637881011 
Producthistidine kinase 
Protein accessionYP_482766 
Protein GI86742366 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGTCAC GCATCTTGCT CGACGCGCTG ACGGACCTGG CTGTGGTCGT CGGCACGGAT 
GGTCGTGTCA CCGACATGGG TGCCGCCGCC CGCGCCTTCC TCGGTGACCG GGTTCCCCGC
GTCCCCCATC TGCGCGATCT TGTCGACGTC GTCGATCCCG CGTTCCGCCT CGCTCTCGGT
GAGGCGGTGA CGGCCGCGCT GGGGCGGACC GGGCAGTGGC GTGGCTCGGT GGGACTCGTC
GACCTGGCCG GGGCCACCGT GCCCCATCAC GTGACGGTCC GGGTCGACCC GGAGGGCGGG
CTGGTCGTCG TCGCTCGGGA CACCAGCGAC CAGCGCGCCC GCGAGGTCGC CGAGAACGAG
TCACGGGCCA AGGACGAACT GATCGCCCGG CTCGGCCACG AACTGCGGAC CCCGCTGAAC
GCCATGCTGG GCTTCGCGCA GCTGCTGGAA CTCGAACCGC TCACCCCCGA CCTCCACGAC
GACGTCGAGC GGATCATCAC CGGTGGCCGC CACATGCAGG CGTTGATCGA TGACGTGCTG
GACCTGGCCA GGCTCCGCGC GGGCCGGGGG GACATCAACA AGGGGCCGGT CAACGTCCTC
GACATCGTCC AGGGCGTGGT GGAGCTCGTC GAACCGCTCG CGGCGCAGCG CAGCATCCGG
CGGGTGATCC ATCCGGCGCT GCCGCTCGTC GCCGACGCGG ACCGCCGCCG CCTGTGGCAG
GTCCTGCTCA ACCTTGTCGG TAACGCGCTG AAGTACGGCC GGGAGGGCGG CAGCGTGCGC
GTCGGCATCG TGCCCATCAC CGGCTCGCGG ATCCGTATCG AGGTCGAGGA CGACGGCGCG
GGGCTCTCCC CCCGGGCGAT CGACCGACTG TTCCGGCCCT TCGAGCGCCT CGGCGCCGAG
CGCAGCGGAA TCGAGGGCAG CGGGCTGGGC CTCGCGCTGT CCCACGCCCT GGTCACGGCG
ATGGGCGGGG TGCTTACCGT CGCCAGCCGG TACGGCGTGG GAAGCGTGTT CGCCGTCGTG
CTCGACGCGG TCGACCTGCG TTTCGAGGAT CTCGATGATG ACAGCGGGTT CGTGGGCAGT
TACGGTGGCC TCCCAGGTCG CTCCGGCGAG TCCGGCACGG TCACCCTCGG CCGTCCTCCC
ACACCCGGCG GCCTGCGGGT CGTGCACGTC AGTGGCGATC CCGCCCTGCG TTCCCTGGTC
GCCGAGACCC TCACCGATTT CCTGGCTGCG GACACCGTGA CCGTTCCCCG CGCCGGTCTC
GCCCTCGACG CGGTGCGGGG GGCCCGGCCG GCGTTCATCC TGCTCGACCG GGACCTGCCC
GACACGACCG CGGCCGAGCT GCTCGCCCAG CTCGCGGCGG ACCCGACCGC CGGCGTCATC
CCGGCCCTGG TCCTCAGCGA GGACACCGAC CCCCGCGAGC GGGCGTACCT GCGCCGAGCC
GGCGCCGTGG ACGTCCTGAA GATCCCGCTG GATCCCGGCG CCCTAGTCGC CGCGGCAGCC
GCGCTCACCA CCGCGATGAC CGCCCCGAGC TGA
 
Protein sequence
MGSRILLDAL TDLAVVVGTD GRVTDMGAAA RAFLGDRVPR VPHLRDLVDV VDPAFRLALG 
EAVTAALGRT GQWRGSVGLV DLAGATVPHH VTVRVDPEGG LVVVARDTSD QRAREVAENE
SRAKDELIAR LGHELRTPLN AMLGFAQLLE LEPLTPDLHD DVERIITGGR HMQALIDDVL
DLARLRAGRG DINKGPVNVL DIVQGVVELV EPLAAQRSIR RVIHPALPLV ADADRRRLWQ
VLLNLVGNAL KYGREGGSVR VGIVPITGSR IRIEVEDDGA GLSPRAIDRL FRPFERLGAE
RSGIEGSGLG LALSHALVTA MGGVLTVASR YGVGSVFAVV LDAVDLRFED LDDDSGFVGS
YGGLPGRSGE SGTVTLGRPP TPGGLRVVHV SGDPALRSLV AETLTDFLAA DTVTVPRAGL
ALDAVRGARP AFILLDRDLP DTTAAELLAQ LAADPTAGVI PALVLSEDTD PRERAYLRRA
GAVDVLKIPL DPGALVAAAA ALTTAMTAPS