Gene Franean1_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1046 
Symbol 
ID5669460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1226212 
End bp1227681 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content74% 
IMG OID641239975 
Producthistidine kinase 
Protein accessionYP_001505408 
Protein GI158312900 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00874969 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCTGA GTACCCTGCT CGACACGCTG ACCGACCTGG TGGTGGTCGT CGGCGGTGAC 
GGGCGTGTCC TCGAACTCGG CGCGGCCGCG CGCGCGTTCG TGGGAGACCG GATCGGCACG
GCGCCGCACC TGCACGACCT GTCCGACGTG GTCGATCCCG CGTGCCGGGT CGAGCTCGCC
GGCCCGGTGT CGCAGGCGCT GGCCCGGACG GGCGTGTGGC GGGGCACCGT CTCGCTGATC
GACCTGGCCG GCACGACCGC GCCGTACACG GTCACGGCGC GGCTGGACCC GGCCGGCGGG
ATGGTCATCG TCGCCCGCGA CACCGTCGAC GCGCGTGCCC GCCGGGCCGC CGAGACCGAG
TCGCGGTCCA AGGACGAGTT CGTCGCGCGG CTCGGCCACG AGCTGCGCAC ACCGCTGAAC
GCGATGCTCG GGTTCGCCCA GCTCCTCGAG CTCGAACCGC TCACGCCCGA CCTGCACGAG
GACGTCGAGC GCATCATCAC CGGCGGCCGC CACATGCAGG CCCTGATCGA CGACGTCCTC
GACCTGGCGC GGCTGCGCGC CGGGCGCGGT GACATCAACA AGGGGCCGGT GAACGTGCTG
GACATCGTCC AGGGCGTGGT CGAGCTGGTC GAGCCGCTGG CCGAGCGCCG CGGGATCCGG
CGGATCATCC ATCCGGCCCG GCCGCTCATC GCCGACGCGG ACCGCCGCCG GCTGTGGCAG
GTCCTGCTGA ACCTGGTCGG CAACGCGCTG AAGTACGGCC GGGAGGGCGG CAACGTCCGG
GTCGGCGTGG TCCCGGTCAC CGGTTCGCGG ATCCGCATCG AGGTGGAGGA CGACGGGATC
GGGCTGTCCC CGTCCGCGCT GTCCCGGCTG TTCCGGCCGT TCGAGCGCCT CGGGGCCGAG
GACACCGGAG TCGAGGGCAG CGGCCTCGGC CTGGCCCTCT CCCACGCCCT TGTGACGGCG
ATGGGCGGTG TTCTGACGGT TGCCAGCAGG CCCGGGGTGG GCAGCGTCTT CGCGGTCGTC
CTGGACGCCG TGGACGTCCG TTACGACCAC ACCGAGGAGG ACCCGGACGA CTTCGTGGGC
GAAGGGCAGC TCGAGCCGGG TGGCCTTCGC GTGGTCCATG TCAGCGGCGA CCCGGGGCTG
CGCGCCCTGG TCAGCGACAC CCTGGCGAGC GCGCTCGGGG CGGACACCAT CAGCGTGCCG
CGCGGGGCCC TCGCGGTCGA GGCCATCCGG CGGGCCCGCC CGTCGCTGGT CCTCCTCGAC
CGCGACCTGC CGGATGTGAC CGCGGTCGAC CTCCTCCCCC GGCTGGCCGC GGACCCGGTG
GCGGGCACCG TCCCCGCCCT GGTGCTGACC ACCGAGGTGG ACCCGGACGA GTGGACCCGC
CTGCGCCAGG CGGGCGCCGT CCAGGTCCTC GGCATGCCGC TGGACGTCAC CTCGCTGATC
GCCGCCACCT CGCAGCTCAC CACCGCCTGA
 
Protein sequence
MGLSTLLDTL TDLVVVVGGD GRVLELGAAA RAFVGDRIGT APHLHDLSDV VDPACRVELA 
GPVSQALART GVWRGTVSLI DLAGTTAPYT VTARLDPAGG MVIVARDTVD ARARRAAETE
SRSKDEFVAR LGHELRTPLN AMLGFAQLLE LEPLTPDLHE DVERIITGGR HMQALIDDVL
DLARLRAGRG DINKGPVNVL DIVQGVVELV EPLAERRGIR RIIHPARPLI ADADRRRLWQ
VLLNLVGNAL KYGREGGNVR VGVVPVTGSR IRIEVEDDGI GLSPSALSRL FRPFERLGAE
DTGVEGSGLG LALSHALVTA MGGVLTVASR PGVGSVFAVV LDAVDVRYDH TEEDPDDFVG
EGQLEPGGLR VVHVSGDPGL RALVSDTLAS ALGADTISVP RGALAVEAIR RARPSLVLLD
RDLPDVTAVD LLPRLAADPV AGTVPALVLT TEVDPDEWTR LRQAGAVQVL GMPLDVTSLI
AATSQLTTA