Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4386 |
Symbol | |
ID | 5672739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5233400 |
End bp | 5234635 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641243255 |
Product | histidine kinase |
Protein accession | YP_001508672 |
Protein GI | 158316164 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAGCG ACCCGAGGCA GCCGAGCGGA AATCTCGACG CGGCTGTTCT CGAAAACGGG GGGTTCGGTG GCGGTAGCCG CCGCCGGTAC CGGCCCCGCA CTCGCGCCGA CACCGCGCTC GGCGGGCAGC GGGTCATCGT CGAGCTGCTG GTCCGCCAGT TCCGCTGTGA GCAGCCGGGC TGTGCGCGGA TGTTGTCGCA GAACCCGGCC CGCAACCTCA CAGCAAAACA GGTCGAGTAC GCGCATGTCA TCCATTCCGC CGGCGCCGAT CTGCTGCAGC TAGTCAACGA CATCCTGGAC CTGTCCAAAG TCGAGGCCGG CAAGATGGAA ATTCACATGG AGCACGTCTC CCTGCGTGCT CTGCTGGAGG ACCTCCGGGC TACCTTCCAC CCCATCACGG AGGAGAATGG TCTCGAGTTC ACCGTGGACG TCGCGCCCGA CGCGCCGACC GAACTGTTCA CCGACTCCCA ACGCGTGTCA CAGGTACTGC GCAACCTGCT GTCGAACGCG GTGAAGTTCA CCGAGCAGGG CTGCGTGGAG CTGCGGATAC GGACGACCGA AGGCCCGGAC GGGGCCGCGG GGCCCCGCAA GACGGTCGCG TTCTCGGTCG TCGACACCGG CGTCGGAATC GCGGACGACG ACCTGGACCG GCTCTTCGAG GCCTTCCAGC AAGGGGAAGG CCCCACCAAC CGCAGATACG GCGGCACCGG TCTGGGTCTG TCCATATCCC GCGAGGTCGC GGCGCTGCTC GATGGCGAGC TCCACCTGTC CACCGCCAAC CTCGACGAGG AACCGGCCCT GACGAGAGAC GTGACTGAGC CGGTGGAGCG ATCACCCATC CCCCAGGCCC CTGCGCATGA AGAGCTACAC GGCCGCAAGA CCCTGGTGAT CGACGATGAC GTGCGCAACA TCTTCGCCAT CACCAGCGTC CTCGAGCTCT ACGGCATCAC CGTGATCTAC GCATCCGACG GGCGGGAAGG CATCGACACC CTGCTCGCCA CCGCTGACGT AGACATCGTC CTCGTCGACG TGATGATGCC GGAAATGGAC GGGTACGCCA CCATGACGGC CATCCGCCAG ATCCCCCAGT TCGCCACGAT TCCGGTCATC GCGGTAACCG CCAAGGCCAT GCCGCATGAC CGGGAGAAAT GCCTCGCCGC AGGTGCCACC GACTACGTCA CGAAACCCGT CGACACCGAA GAACTCCTCA TCCGGATGGA ACGACAGATC ACCTGA
|
Protein sequence | MSSDPRQPSG NLDAAVLENG GFGGGSRRRY RPRTRADTAL GGQRVIVELL VRQFRCEQPG CARMLSQNPA RNLTAKQVEY AHVIHSAGAD LLQLVNDILD LSKVEAGKME IHMEHVSLRA LLEDLRATFH PITEENGLEF TVDVAPDAPT ELFTDSQRVS QVLRNLLSNA VKFTEQGCVE LRIRTTEGPD GAAGPRKTVA FSVVDTGVGI ADDDLDRLFE AFQQGEGPTN RRYGGTGLGL SISREVAALL DGELHLSTAN LDEEPALTRD VTEPVERSPI PQAPAHEELH GRKTLVIDDD VRNIFAITSV LELYGITVIY ASDGREGIDT LLATADVDIV LVDVMMPEMD GYATMTAIRQ IPQFATIPVI AVTAKAMPHD REKCLAAGAT DYVTKPVDTE ELLIRMERQI T
|
| |