Gene Franean1_4386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4386 
Symbol 
ID5672739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5233400 
End bp5234635 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content65% 
IMG OID641243255 
Producthistidine kinase 
Protein accessionYP_001508672 
Protein GI158316164 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAGCG ACCCGAGGCA GCCGAGCGGA AATCTCGACG CGGCTGTTCT CGAAAACGGG 
GGGTTCGGTG GCGGTAGCCG CCGCCGGTAC CGGCCCCGCA CTCGCGCCGA CACCGCGCTC
GGCGGGCAGC GGGTCATCGT CGAGCTGCTG GTCCGCCAGT TCCGCTGTGA GCAGCCGGGC
TGTGCGCGGA TGTTGTCGCA GAACCCGGCC CGCAACCTCA CAGCAAAACA GGTCGAGTAC
GCGCATGTCA TCCATTCCGC CGGCGCCGAT CTGCTGCAGC TAGTCAACGA CATCCTGGAC
CTGTCCAAAG TCGAGGCCGG CAAGATGGAA ATTCACATGG AGCACGTCTC CCTGCGTGCT
CTGCTGGAGG ACCTCCGGGC TACCTTCCAC CCCATCACGG AGGAGAATGG TCTCGAGTTC
ACCGTGGACG TCGCGCCCGA CGCGCCGACC GAACTGTTCA CCGACTCCCA ACGCGTGTCA
CAGGTACTGC GCAACCTGCT GTCGAACGCG GTGAAGTTCA CCGAGCAGGG CTGCGTGGAG
CTGCGGATAC GGACGACCGA AGGCCCGGAC GGGGCCGCGG GGCCCCGCAA GACGGTCGCG
TTCTCGGTCG TCGACACCGG CGTCGGAATC GCGGACGACG ACCTGGACCG GCTCTTCGAG
GCCTTCCAGC AAGGGGAAGG CCCCACCAAC CGCAGATACG GCGGCACCGG TCTGGGTCTG
TCCATATCCC GCGAGGTCGC GGCGCTGCTC GATGGCGAGC TCCACCTGTC CACCGCCAAC
CTCGACGAGG AACCGGCCCT GACGAGAGAC GTGACTGAGC CGGTGGAGCG ATCACCCATC
CCCCAGGCCC CTGCGCATGA AGAGCTACAC GGCCGCAAGA CCCTGGTGAT CGACGATGAC
GTGCGCAACA TCTTCGCCAT CACCAGCGTC CTCGAGCTCT ACGGCATCAC CGTGATCTAC
GCATCCGACG GGCGGGAAGG CATCGACACC CTGCTCGCCA CCGCTGACGT AGACATCGTC
CTCGTCGACG TGATGATGCC GGAAATGGAC GGGTACGCCA CCATGACGGC CATCCGCCAG
ATCCCCCAGT TCGCCACGAT TCCGGTCATC GCGGTAACCG CCAAGGCCAT GCCGCATGAC
CGGGAGAAAT GCCTCGCCGC AGGTGCCACC GACTACGTCA CGAAACCCGT CGACACCGAA
GAACTCCTCA TCCGGATGGA ACGACAGATC ACCTGA
 
Protein sequence
MSSDPRQPSG NLDAAVLENG GFGGGSRRRY RPRTRADTAL GGQRVIVELL VRQFRCEQPG 
CARMLSQNPA RNLTAKQVEY AHVIHSAGAD LLQLVNDILD LSKVEAGKME IHMEHVSLRA
LLEDLRATFH PITEENGLEF TVDVAPDAPT ELFTDSQRVS QVLRNLLSNA VKFTEQGCVE
LRIRTTEGPD GAAGPRKTVA FSVVDTGVGI ADDDLDRLFE AFQQGEGPTN RRYGGTGLGL
SISREVAALL DGELHLSTAN LDEEPALTRD VTEPVERSPI PQAPAHEELH GRKTLVIDDD
VRNIFAITSV LELYGITVIY ASDGREGIDT LLATADVDIV LVDVMMPEMD GYATMTAIRQ
IPQFATIPVI AVTAKAMPHD REKCLAAGAT DYVTKPVDTE ELLIRMERQI T