Gene Franean1_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3845 
Symbol 
ID5672208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4568087 
End bp4569265 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content76% 
IMG OID641242723 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001508143 
Protein GI158315635 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.5258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.79155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCC GGCATGATGC CCGGGTTGGT TACCGTGGCG GGGTGCGCGA TCTCGTCCGT 
CCGCTGTGGG AGGAGCCGCG TCCGGCGCGC CCGTCGACAC CGGGCCGGCG GGACTGGCCG
CTGGCCGCCG CGCTCGCGGT CGCCGCGCTC ACCGAGGGGT TGCTGCGGGC GGACCTTCCG
TGGCGGACCT GGTCGGTGCT GCTGGCACTC GCCCTCGTCC CGACCCTGCT GTGGCGCCGG
GCGAGGCCGC TGGCAGCGGT CACGATCGCC TTCGGTGCCT CGGCTGTGGC CTCGGTGCTC
ACCGGCGGCG GCGTCCCCCA GCTGAACAGC ATGATCTTCA TGCTGCTGTT GCCGTACTCG
CTGCTGCGGT GGGGATCGGG CCGGGAGGCC GTGACCGGCG CGGCGGTCGT GCTCGCCGCC
GCCGTCTGCA TGATCTCCGC CGCCACCGGT GCCGCCGACG CCGTCGGCGG CGCGGCCGTA
CTGCTCGCCT CGTTCGCGCT GGGCGGGGCG TTCCGGTACC GGGCCGGGGC CCGGCTGCGC
GAGCTGGAGC AGGCCAAGCT GCTCGAACGG GAGCGGCTGG CCCGGGATCT CCACGACACC
GTCGCCCACC ACGTCTCGGC GATCGCGATC CGGGCCCAGG CGGGCATCGC CACCGCGCCG
TCGAGCCCGG CCGCCGCCGC CGAGGCACTG CGGGTGATCG AGCTCGAGGC GTCGCGCACC
CTGGCCGAGA TGCGGGCCAT GGTCCGGGTA CTGCGCCGTG ACGAGCCGGC GGAGCTGGCG
CCGAACCCGA CCGTCGCCGA TCTCGAACGG CTCGCCGGGC AGGCCCGCTC CGGCCCGGCG
GTGCGGGTGC GGATCGTCGG CGAGGTGGGG GATCTCCCGC CGTCGGTCGG GTCCGCGATC
TACCGGCTCG CGCAGGAGTC GATCACCAAC GCCCGCCGGC ACGCGCGGCA CGCGAACCAC
GTCGAGGTCG TGGTGTCCGC CGATGACGCG TGCGTGCGGC TGTCCGTGCG CGACGACGGC
GACACCGCCG CCCTGCACCC GCCGCCGTCG CCGGGCTACG GGCTCACCGG GATGATCGAG
CGTGCGCGCC TGCTCGGCGG CACCTGCGAG GCCGGCCCCG CCCTCGACCG GGGCTGGACG
GTGACCGCCA CCCTGCCCCG GGCCGGGTGG GCGACGTGA
 
Protein sequence
MTARHDARVG YRGGVRDLVR PLWEEPRPAR PSTPGRRDWP LAAALAVAAL TEGLLRADLP 
WRTWSVLLAL ALVPTLLWRR ARPLAAVTIA FGASAVASVL TGGGVPQLNS MIFMLLLPYS
LLRWGSGREA VTGAAVVLAA AVCMISAATG AADAVGGAAV LLASFALGGA FRYRAGARLR
ELEQAKLLER ERLARDLHDT VAHHVSAIAI RAQAGIATAP SSPAAAAEAL RVIELEASRT
LAEMRAMVRV LRRDEPAELA PNPTVADLER LAGQARSGPA VRVRIVGEVG DLPPSVGSAI
YRLAQESITN ARRHARHANH VEVVVSADDA CVRLSVRDDG DTAALHPPPS PGYGLTGMIE
RARLLGGTCE AGPALDRGWT VTATLPRAGW AT