Gene Franean1_6146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6146 
Symbol 
ID5674467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7477811 
End bp7479208 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content73% 
IMG OID641244998 
Producthistidine kinase 
Protein accessionYP_001510396 
Protein GI158317888 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.854948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGGC GGCCTGACTC GGCTGATCGG GGCGGTGCGA CGGACGCCGG CCCCCGCGGC 
GCGGCGGCGG CCGCCACCGG CCCGCACGCG CTGCCCGCCC ACCTCGTCCG ACAGGTGATC
GCCGGCCTGC CCTCCGGGCT GGTGGTGGTG AACGCCGAGG ACGTGGTCGT GCTGGTCAAC
TCCGCCGCCC GCCGGATGGG TGTCGTCGAC GGTGACGAAC TCAGCGTGGG CGAGGTGGCG
GATCTGGTGA AGGCCTGCCG GTCCGCCGCG GCCGAGCGCA ACCGCCAGCT GGACCTGCCG
CCGGTGCCCG AGCCACCGCT GACCAGGGCC CGTCCGGACC AGACCGCGCT GGCCGTCTGG
GCATACGCGC GCCCGCTCGG TGACAGCGGG TACGTCTCGG TCCTGCTCGA CGACATCACC
GACTCGCGGC GGGTCGAGGC CGTCCGGCGC GATTTCGTCG CCAACGTCAG CCACGAGCTC
AAGACGCCCG TCGGGGCGCT GCACGTGCTC GCGGAGGCGG TCAGCGAGGC CAGCGAGGAC
CCGGTGGCCG TGCGCCGTTT CGCGTCGCGG ATGACGCACG AGTCCACCCG GCTGGCCCGG
CTGGTCCAGG AGATCATCGA CCTCTCCCGG CTGCAGGGCG CCGAGCCGAT GCCCGACCTG
GCCCCGGTGC CGGTCTCGGT GGTGCTCGCG GAGGCCGTCG ACCGGAACCG GTTCAGCGCG
CAGGCGCAGG AGATCTCGGT CGCCGTCATC GGCGCTGGCG GGCTGCGGGT CCGCGGCGAC
GAGAACCAGC TGGTCACCGC CGTGGCGAAC CTGCTGGACA ACGCCATCAG CTACTCGCCG
CGCGGCACAC GGGTCGTGCT GGGAGTCCGC CAGGTCGGCG GAACAATAGA GATATCAGTC
GCGGACGAGG GAATCGGCAT CGCCGAGAAG GACCTGGAAC GGGTCTTCGA GCGTTTCTAC
CGCGCCGACC CGGCGCGTTC CCGGGCGACC GGGGGAACTG GTCTCGGCCT GGCTATCGTG
AAGCACATCG CGACCAACCA CGGCGGTTCG GTGAGCGTGT GGAGCGCGGA GGGTCGGGGC
TCGACGTTCA CGCTGCGCCT GCCCTCCGGT ACCGACGATC CCAAGGCGGC CGGGTCCGAA
ACAGGCCGCT CCGAAACAGG CCGTGCGGGG AAGTCCGGCG GCACGGTCGA GCCTGGTGGC
GCGGCGCCGT CGCGGCCGAG CGGTGTCGTT CCGTTGCGGC CGCCAGTCGA TGCACCCGAT
CTCGCCGATT CCGCGTCGCG GGGCCCGGTC CTGGCGGAAA CTGTGCACGA ATCAACTGGT
CGCACCCGGC CGGCGGCGGC CGCTCCGGTC GCGTCCGAGA ATTCCTCCGG AACCGGCCCC
GGCGGCCCCG GAGCATGA
 
Protein sequence
MARRPDSADR GGATDAGPRG AAAAATGPHA LPAHLVRQVI AGLPSGLVVV NAEDVVVLVN 
SAARRMGVVD GDELSVGEVA DLVKACRSAA AERNRQLDLP PVPEPPLTRA RPDQTALAVW
AYARPLGDSG YVSVLLDDIT DSRRVEAVRR DFVANVSHEL KTPVGALHVL AEAVSEASED
PVAVRRFASR MTHESTRLAR LVQEIIDLSR LQGAEPMPDL APVPVSVVLA EAVDRNRFSA
QAQEISVAVI GAGGLRVRGD ENQLVTAVAN LLDNAISYSP RGTRVVLGVR QVGGTIEISV
ADEGIGIAEK DLERVFERFY RADPARSRAT GGTGLGLAIV KHIATNHGGS VSVWSAEGRG
STFTLRLPSG TDDPKAAGSE TGRSETGRAG KSGGTVEPGG AAPSRPSGVV PLRPPVDAPD
LADSASRGPV LAETVHESTG RTRPAAAAPV ASENSSGTGP GGPGA