Gene Franean1_5470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5470 
Symbol 
ID5673801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6615410 
End bp6617137 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content76% 
IMG OID641244325 
Producthistidine kinase 
Protein accessionYP_001509731 
Protein GI158317223 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC GGGCACGAGA CCTGCTCAAC CGATCCCGCG CGCTCCTGGG TGGCTCCCGC 
GGCATAGCGG CCCAGCTGCG GGGTGGTGAT CTCCTGACTC CCCTGCCCCG GCCGGAGCAG
CCGGCCCCCG CCGGCCTCGT CAGGCGCCCC GCGGCCCATG GCGGAAGGTC CGCGAACGAG
CCGTGGGTTC GGCCGGGAAT CCCCGCGGCG GACGCCGGGG TGAGGCGACC GGCCCGCCGC
GGCGGCACCG GCATCAGCAC CAGCGCCGCG GACCAGTCGG CCCACGCCAC GCGGGCCGAG
ACGTCCCCGG CAGCATCCCC GGGCAACGGG TCGGCCGTGA CGGCGCCGCC CGCGGCCGAG
GCGCTCTCCG CGATCTGCGC GGACGTGGCG CTGCGCGACC TCAACCTGGT CGACTCCCTG
CTCTCGCAGC TTGAGGACAT GGAGGCCAAG GAGGAGGACA CCGACCGCCT CGCCGAGCTG
TACCGGCTGG ACCACCTCGC CACCCGGCTG CGCCGCAACG CCGAGAACCT GCGGGTGCTC
GCGGGCCGTG ACGCCGGCGA CGCCTCGTCC GACACGGCCG CGCTGGTCGA CGTCGTGCGC
GCGGCGATGT CGTCCATCGA CCACTACTCG CGGATCACGA TCGGCCGGGT GGTCCCGCTC
GGTGTGGTCG GCTTCGCCGC CGAGGACGTC GGACGGCTGC TGGCAGAGCT GCTGGACAAC
GCGACCAAGT CGTCTCCCCC GACCGCGCCG GTACGGGTCG GCGTGCACCT CACGGAGCTG
GGCAGCGCGC TGCTGCGGGT CGAGGACGAG GGCATCGGGC TCCCGCCGGA CCGGCTGCGG
CAGCTCAACG AGCGGCTGGC CGGCGATCCC GTCCTGGACG ACGACGCGGT GCGCCACATG
GGCCTGGCCG TGGTGCGCCG GCTCGCGATT CGCCACGACC TGCGGATCTG GCTCGATCGC
CGCCACCCCC ACGGGACGAC CGCCTCGGTG CTCATCCCCT CCCCCCTGAT CTGTGAACTG
CCCGAGGGCA GCTGGTCGGG TACGCAGACC GTGGCGATCC GCGGCGGCGA CCCAGCCGCG
CCGGGTCCGT CCGCGCGTCC CGCCACGCGC GACGGTCAGC CGCGGGCGGA CGTCCAGGGC
CGGGCGCCGG GCGTGAAACG GTCCGGCAAC GGAGCGGGCC ACGTACGCGA ATCCACGCTG
TTCACCCGCG CGGGCAGCGC GTCGGCCAGG CCGGTGCCAG CGCCGAGCAA GGATGCGGCG
ACTCCGGCCA CACCCCCCAC GGCGGACTCC CTGATCGGCG GCACGACCGC GAGCGGTCTC
CCCCGCCGGG TGTCGCGCAG CCTCAAGAAC CCCGCCGGCG ACGGCGCGCG CCCACCCGCT
GCCACGGCAC CGGCAGCCAC CATGCCCGCG GCAGCCACCG CGCCCGCGGC ACCCCCGCCC
GTGGCACCCC CGCCCGTGGC ACCCCCGCCC GTGGCACCCG CCTCCACCGC GGCTGCGGTG
CCCGCAACAC CCGCGGCGGC TCCCGCGCCG TCGACACCCG CCGCGGCGTC CACGGAAGAC
CAGGAAGACG CGACAGGCAC GGCAGGCGTC CCTCCCGCGG AGGAGGGCGG ATCGACGGCC
ACACAGGCGC TTTCGGCCCG GGCCACGGAC CACGCGAGAC TACTTGCGGA TCTCGACGCC
TTCAGCGAAG GCGAACGGAT CGCCCACGAA CACCAGCGAG GGGACTGA
 
Protein sequence
MTTRARDLLN RSRALLGGSR GIAAQLRGGD LLTPLPRPEQ PAPAGLVRRP AAHGGRSANE 
PWVRPGIPAA DAGVRRPARR GGTGISTSAA DQSAHATRAE TSPAASPGNG SAVTAPPAAE
ALSAICADVA LRDLNLVDSL LSQLEDMEAK EEDTDRLAEL YRLDHLATRL RRNAENLRVL
AGRDAGDASS DTAALVDVVR AAMSSIDHYS RITIGRVVPL GVVGFAAEDV GRLLAELLDN
ATKSSPPTAP VRVGVHLTEL GSALLRVEDE GIGLPPDRLR QLNERLAGDP VLDDDAVRHM
GLAVVRRLAI RHDLRIWLDR RHPHGTTASV LIPSPLICEL PEGSWSGTQT VAIRGGDPAA
PGPSARPATR DGQPRADVQG RAPGVKRSGN GAGHVRESTL FTRAGSASAR PVPAPSKDAA
TPATPPTADS LIGGTTASGL PRRVSRSLKN PAGDGARPPA ATAPAATMPA AATAPAAPPP
VAPPPVAPPP VAPASTAAAV PATPAAAPAP STPAAASTED QEDATGTAGV PPAEEGGSTA
TQALSARATD HARLLADLDA FSEGERIAHE HQRGD