Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5470 |
Symbol | |
ID | 5673801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6615410 |
End bp | 6617137 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244325 |
Product | histidine kinase |
Protein accession | YP_001509731 |
Protein GI | 158317223 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC GGGCACGAGA CCTGCTCAAC CGATCCCGCG CGCTCCTGGG TGGCTCCCGC GGCATAGCGG CCCAGCTGCG GGGTGGTGAT CTCCTGACTC CCCTGCCCCG GCCGGAGCAG CCGGCCCCCG CCGGCCTCGT CAGGCGCCCC GCGGCCCATG GCGGAAGGTC CGCGAACGAG CCGTGGGTTC GGCCGGGAAT CCCCGCGGCG GACGCCGGGG TGAGGCGACC GGCCCGCCGC GGCGGCACCG GCATCAGCAC CAGCGCCGCG GACCAGTCGG CCCACGCCAC GCGGGCCGAG ACGTCCCCGG CAGCATCCCC GGGCAACGGG TCGGCCGTGA CGGCGCCGCC CGCGGCCGAG GCGCTCTCCG CGATCTGCGC GGACGTGGCG CTGCGCGACC TCAACCTGGT CGACTCCCTG CTCTCGCAGC TTGAGGACAT GGAGGCCAAG GAGGAGGACA CCGACCGCCT CGCCGAGCTG TACCGGCTGG ACCACCTCGC CACCCGGCTG CGCCGCAACG CCGAGAACCT GCGGGTGCTC GCGGGCCGTG ACGCCGGCGA CGCCTCGTCC GACACGGCCG CGCTGGTCGA CGTCGTGCGC GCGGCGATGT CGTCCATCGA CCACTACTCG CGGATCACGA TCGGCCGGGT GGTCCCGCTC GGTGTGGTCG GCTTCGCCGC CGAGGACGTC GGACGGCTGC TGGCAGAGCT GCTGGACAAC GCGACCAAGT CGTCTCCCCC GACCGCGCCG GTACGGGTCG GCGTGCACCT CACGGAGCTG GGCAGCGCGC TGCTGCGGGT CGAGGACGAG GGCATCGGGC TCCCGCCGGA CCGGCTGCGG CAGCTCAACG AGCGGCTGGC CGGCGATCCC GTCCTGGACG ACGACGCGGT GCGCCACATG GGCCTGGCCG TGGTGCGCCG GCTCGCGATT CGCCACGACC TGCGGATCTG GCTCGATCGC CGCCACCCCC ACGGGACGAC CGCCTCGGTG CTCATCCCCT CCCCCCTGAT CTGTGAACTG CCCGAGGGCA GCTGGTCGGG TACGCAGACC GTGGCGATCC GCGGCGGCGA CCCAGCCGCG CCGGGTCCGT CCGCGCGTCC CGCCACGCGC GACGGTCAGC CGCGGGCGGA CGTCCAGGGC CGGGCGCCGG GCGTGAAACG GTCCGGCAAC GGAGCGGGCC ACGTACGCGA ATCCACGCTG TTCACCCGCG CGGGCAGCGC GTCGGCCAGG CCGGTGCCAG CGCCGAGCAA GGATGCGGCG ACTCCGGCCA CACCCCCCAC GGCGGACTCC CTGATCGGCG GCACGACCGC GAGCGGTCTC CCCCGCCGGG TGTCGCGCAG CCTCAAGAAC CCCGCCGGCG ACGGCGCGCG CCCACCCGCT GCCACGGCAC CGGCAGCCAC CATGCCCGCG GCAGCCACCG CGCCCGCGGC ACCCCCGCCC GTGGCACCCC CGCCCGTGGC ACCCCCGCCC GTGGCACCCG CCTCCACCGC GGCTGCGGTG CCCGCAACAC CCGCGGCGGC TCCCGCGCCG TCGACACCCG CCGCGGCGTC CACGGAAGAC CAGGAAGACG CGACAGGCAC GGCAGGCGTC CCTCCCGCGG AGGAGGGCGG ATCGACGGCC ACACAGGCGC TTTCGGCCCG GGCCACGGAC CACGCGAGAC TACTTGCGGA TCTCGACGCC TTCAGCGAAG GCGAACGGAT CGCCCACGAA CACCAGCGAG GGGACTGA
|
Protein sequence | MTTRARDLLN RSRALLGGSR GIAAQLRGGD LLTPLPRPEQ PAPAGLVRRP AAHGGRSANE PWVRPGIPAA DAGVRRPARR GGTGISTSAA DQSAHATRAE TSPAASPGNG SAVTAPPAAE ALSAICADVA LRDLNLVDSL LSQLEDMEAK EEDTDRLAEL YRLDHLATRL RRNAENLRVL AGRDAGDASS DTAALVDVVR AAMSSIDHYS RITIGRVVPL GVVGFAAEDV GRLLAELLDN ATKSSPPTAP VRVGVHLTEL GSALLRVEDE GIGLPPDRLR QLNERLAGDP VLDDDAVRHM GLAVVRRLAI RHDLRIWLDR RHPHGTTASV LIPSPLICEL PEGSWSGTQT VAIRGGDPAA PGPSARPATR DGQPRADVQG RAPGVKRSGN GAGHVRESTL FTRAGSASAR PVPAPSKDAA TPATPPTADS LIGGTTASGL PRRVSRSLKN PAGDGARPPA ATAPAATMPA AATAPAAPPP VAPPPVAPPP VAPASTAAAV PATPAAAPAP STPAAASTED QEDATGTAGV PPAEEGGSTA TQALSARATD HARLLADLDA FSEGERIAHE HQRGD
|
| |