Gene Francci3_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0235 
Symbol 
ID3906539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp274454 
End bp275956 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content74% 
IMG OID637877564 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_479353 
Protein GI86738953 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCGGC CGGTCCTGTC CCTGCGGGCC CGGCTGCTGC TGGCCCTGGT GGGCCTGCTG 
GCCGTCGGGA TGGCTCTCGG CGCCGTCGGC ACGCGAGGGG CTCTGGGGGC CTACCTGCGT
GGCCGGCTGG ACCAGCAGGT CCGGGACGCC CATCCGCTGA TGGAGCAGTT GCTCCTGCGG
GCCGGCGGGG ACGGGGACGG GGATGACCAC TACCCAGGTT TCGGCACCAC GTTCCTGGTG
GGCACCTACG GCGCGTTGTA CGACGGCGCC GGCCGCCGGC TCGCCGAGGC CAGCCCCGGC
CCGGGTGGCC CGGGTGGTCC GGGTGGCCCG GCGGCGCCGA CGCAGCGTCC GGCGGTGTCG
GCACGCCTGC TCGCCACCGC CCGGCTGCAT CCGGACCTCG CGACCGTCCC GCTGTGCACG
GTGGGCGCCG AGGACCATGG CGGCCGGTTC AGGATGCTCG CGGAGCGCTT CGACAACGGG
TATGTGCTGA TCGTCGCCGT GCCCTTCGCG GAGGTGAACG CCACCCTTGA CCGGGTCACC
AGGATCGAGG TCGTGGCGAC CTCGGTGGTC ACGGCCGCGC TGGCCGTGCT GGCATACGTC
ATCATCCGGC TTGGGCTGCG TCCCCTGACC CGGATCGAGC AGACCGCGGA CCTGATCGCC
CACGGCGACC TGACCAGGCG GGTCGCCGAC GCGGACCGCC GGACCGAGGT GGGTCGGCTC
GGGCTCGCCT TCAACGCGAT GCTCACCCGG ATCGAGGCGG CGTTCCGGGC GCGTGAGGTC
TCCGAAGGGC GTCTGCGCCG CTTCGTCGGG GACGCGAGCC ACGAGCTGCG GACCCCGTTG
ACATCGATCC GCGGGTACGC CGAGATGTTC CACCGCGGCG CCGCGGAGCG GCCCGAGGAC
CTGGCGATGG TCATGCGCCG CATCGAGGAG GAGTCCGCGC GCATGAGCGA GCTGGTGGAT
GACCTGCTGC TGCTCGCCCG GCTGGACCAG CGACCGGTGC TGGAGCGCCA GCCGGTCGAC
GTCGCGGCGA TGGTCCGTGA CATCGTCACC GACGCCCGTG TGGTGAGCCC GGGACGGACG
ATCGAGGTCG ACGTGCCGCC GATCCTCGAG GTGCTCGGCG ATGAGGGCCG GCTGCGGCAG
GCCGTCGGCA ACCTCGTCCG CAACGCCGTC GTCCACACCC CGCCCGACGC CGGGATCTCC
GTCTCCGTGG GCCCCCTCGA AGCCGGCTCC CCGGGGCCGA CGGGCGATGG TCCCACCGAC
GGGATCGTGG TCTCGGTGGT CGACCACGGC CCGGGCGTCC CCGAGGATGC CGTGGCCCAC
CTCTTCGAAC GCTTCTTCCG GGCCGATGCC GGGCGGTCCC GGGACGCCGG CGGGACCGGT
CTGGGCCTGT CCATCGTCGA CGCCGTCGCC ACCGCGCACG GCGGGCGGGT CGAGTACCGG
CCGACCCCGG GCGGCGGGGC GACGTTCCGT CTCGTCCTGC CCGGTCCGTC GCAGCCCGAC
TGA
 
Protein sequence
MSRPVLSLRA RLLLALVGLL AVGMALGAVG TRGALGAYLR GRLDQQVRDA HPLMEQLLLR 
AGGDGDGDDH YPGFGTTFLV GTYGALYDGA GRRLAEASPG PGGPGGPGGP AAPTQRPAVS
ARLLATARLH PDLATVPLCT VGAEDHGGRF RMLAERFDNG YVLIVAVPFA EVNATLDRVT
RIEVVATSVV TAALAVLAYV IIRLGLRPLT RIEQTADLIA HGDLTRRVAD ADRRTEVGRL
GLAFNAMLTR IEAAFRAREV SEGRLRRFVG DASHELRTPL TSIRGYAEMF HRGAAERPED
LAMVMRRIEE ESARMSELVD DLLLLARLDQ RPVLERQPVD VAAMVRDIVT DARVVSPGRT
IEVDVPPILE VLGDEGRLRQ AVGNLVRNAV VHTPPDAGIS VSVGPLEAGS PGPTGDGPTD
GIVVSVVDHG PGVPEDAVAH LFERFFRADA GRSRDAGGTG LGLSIVDAVA TAHGGRVEYR
PTPGGGATFR LVLPGPSQPD