Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4838 |
Symbol | |
ID | 6131023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 5314447 |
End bp | 5317254 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641644975 |
Product | CheA signal transduction histidine kinase |
Protein accession | YP_001771602 |
Protein GI | 170742947 |
COG category | [K] Transcription [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.552774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00614515 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACGATT TGCTGCGTGA ATTCCTGACC GAGACCGGCG AACACCTGGA TACGGTCGAC CTGGAGCTCG TCCGCTTCGA GCAGGATCCG AACAACCAGA CGATCCTCCG GAACATCTTC CGGCTCGTGC ATACCATCAA GGGCACCTGC GGCTTCCTCG GACTGCCGCG CCTCGAGGCG CTGGCCCACG CGGCCGAGAC GCTGATGGGC AAGTTCCGCG ACGGGATGGG CGTCACGAGC GAGGCGGTGA CGCTGATCCT GCAGACGCTC GACCGCATCA AGGTGATCGT CGCGGAGCTG GAGCGCACGG CGGCCGAGCC CGTGGGCGCC GACGAGGACC TGATCGGCGC CCTGGAGCGC ATGGCCGAGG GCGAGGCCGC TCCCGCCCCG GCGCCCGCCC CGCCGCCGCC GCCCGTCCCG ATGACCACCG GCTCCCTGGT GTCCCAGACG CTGGAGCGGG CGCTCAAGGC CGACGAGGTC TCCCTGGACG ACCTGGAGCG CGCCTTCCGC GACACGCCCG GGCCCGAGGC CTCGCCCGCG CCCATGACCA CGGGCTCGCT GGTGCCGCAG ACGCTGGAGC GGGCCCTCAA GCCGGGCGAG GTCTCCCTGG ACGACCTGGA GCGCGCCTTC CGCGACACGC CCGGGCCCTC CGCGGCGCCC GCCAAGGCGG TGAGCGCGCC GAAGGCCGCG CCCGCTCCGG CCGAGGCGCC CCGCGCCGAC GCCGCGCCCG AGACGGACGG CGCCGCCATC AACAAGGTGC AGACGATCCG GGTGAACGTC GACACCCTCG AGCACCTGAT GACGATGGTC TCGGAACTGG TCCTGACCCG CAATCAGCTC CTCGAGATCG CCCGCCGGCA GGAGGACAAC AGCTACAAGG TGCCGCTGCA GCGGCTCTCG AACGTCACCG CCGAGCTGCA GGAAGGGGTC ATGAAGACCC GCATGCAGCC GATCGGCTCG GCGTGGCAGA AGCTGCCGCG GGTGGTGCGC GACCTCTCGT CGGAACTCGG CAAGAAGATC GACCTGGTGA TGCAGGGCGC CGAGACGGAA CTCGACCGTC AGGTGCTGGA GGTCATCAAG GACCCGCTCA CCCACATGGT GCGCAACTCC GCCGACCACG GCATCGAGTC GGCGCTGGAG CGCAAGGCCG CGGGCAAGCC CGAGAAGGGG ACGATCCGCC TCAACGCCTT CCACGAGGGC GGCACGATCA CGATCGAGAT CTCGGACGAC GGCAAGGGGC TCGACCTCGC CACGATCCGC CGCAAGGCCG TGGAGCGGGG CATCGCCACC GAGGCCGAGG TCGAGCGGAT GACCGACGCG CAGGTCGCGA AGTTCATCTT CCACGCCGGC TTCTCGACCG CCAAGGCCGT CACCTCGGTC TCGGGCCGCG GCGTCGGCAT GGACGTGGTC AAGACCAACA TCGAGCTGAT CGGCGGCACC GTCGACATCC GCACCCAGTT CGGCCAGGGC ACCACCTTCA CGATCAAGAT CCCGCTGACG CTCGCCATCG TCGCGGCGCT GATCGTCTCG GCCCGCGACC ACCGCTTCGC GATCCCGCAG GTCTCGGTGC TCGAACTCGT GCGGGTGCAG CCGGGCAGCG ACCACCAGGT CGAGCGCATC AACGGCTCGC CGGTGCTGCG CCTGCGCGAC CGCCTGCTGC CGATCGTGCC GATCGCCGCC ATGCTCGGCC TGGACAAGGA TCCCACCACC GCCTCCTCCG ACGAGGGCTT CGTGGTGGTG AGCCAGGTCG GGCGCCAGCG CTTCGGCATC CTGGTGGACG GCGTCTTCCA CACGGAAGAG ATCGTCGTGA AGCCGATGTC GACGAAGCTG CGGCACATCC CGCTCTTCTC CGGCAACACG ATCCTGGGCG ACGGCGCGGT CGTGCTGATC ATCGACCCGA ACGGGGTCGC CCGCATGGTC GGTTCCGGCA CGGCGACCGG GCAGCCCGCG GAGGCGGATG CCGAGGCCGA GGAGGCGGCG GCGTCGGCCG ATCAGACCGT CACGCTGCTG GTGTTCAAGG GCGGCGGCGA CGCGCTGAAG GCGGTGCCCC TGTCGCTGGT CACGCGCCTG GAGGAGGTCG ACGGCGCCAA GGTCGAGTGG GTGGGCGGGC GGCCGCTCAT CCAGTACCGG GGCCGGCTCA TGCCCCTGGT GCCCGTCGAT CCCGATCAGG TGCTGCGGCG CGAGGGCGCG CAGGCCCTGG TGGTGTTCTC GGACGGCGAG CGCTCGATGG GCCTCGCGGT GGACGAGATC GTCGACATCG TCGACGAGGT GCTCGACGTC GAACTCACGG CGGACCGGTC CGACCTGATC GGCTCGGCGG TGGTGCGCGG GCGCGCGACC GAGATCGTCA ACATCGCCCA TTACCTGCCG CTCGCGCACG ACGACTGGGC GCGCGCGCCG CGCCGCAAGG AGGAGCAGGC GCCGCGGCGG CTGCTCCTGG TCGACGACTC GGCCTTCTTC CGCGAGATGC TGACGCCCGT GCTGAAGGCC GCCGGCTACC GGGTGATCCC GGCCGCCAGC GCCGAGGAGG CTCTCACGGT GCTCACCGGC GAGACGCCGA TCGACGTCGT CGTCGCCGAT CTGGAGATGC CGGGCCGCTC CGGCTTCGAC CTGATCGAGC AGATGCGCCG GACCGGGCCG CGCCTCGCCG AGATGCCGGT GATCGCGCTC GCCTCGAGCG TCGCCCCGGA GGGGATCGAG CGGGCGCGCG CCCTCGGGAT CGCCGACTTC GTCGCGAAGT TCGACCGCAG CGGCCTCGTC GCCGCCCTCA ACGAGATCAC CGCGCCCAGC CTCGACGCCG CGGCCTGA
|
Protein sequence | MDDLLREFLT ETGEHLDTVD LELVRFEQDP NNQTILRNIF RLVHTIKGTC GFLGLPRLEA LAHAAETLMG KFRDGMGVTS EAVTLILQTL DRIKVIVAEL ERTAAEPVGA DEDLIGALER MAEGEAAPAP APAPPPPPVP MTTGSLVSQT LERALKADEV SLDDLERAFR DTPGPEASPA PMTTGSLVPQ TLERALKPGE VSLDDLERAF RDTPGPSAAP AKAVSAPKAA PAPAEAPRAD AAPETDGAAI NKVQTIRVNV DTLEHLMTMV SELVLTRNQL LEIARRQEDN SYKVPLQRLS NVTAELQEGV MKTRMQPIGS AWQKLPRVVR DLSSELGKKI DLVMQGAETE LDRQVLEVIK DPLTHMVRNS ADHGIESALE RKAAGKPEKG TIRLNAFHEG GTITIEISDD GKGLDLATIR RKAVERGIAT EAEVERMTDA QVAKFIFHAG FSTAKAVTSV SGRGVGMDVV KTNIELIGGT VDIRTQFGQG TTFTIKIPLT LAIVAALIVS ARDHRFAIPQ VSVLELVRVQ PGSDHQVERI NGSPVLRLRD RLLPIVPIAA MLGLDKDPTT ASSDEGFVVV SQVGRQRFGI LVDGVFHTEE IVVKPMSTKL RHIPLFSGNT ILGDGAVVLI IDPNGVARMV GSGTATGQPA EADAEAEEAA ASADQTVTLL VFKGGGDALK AVPLSLVTRL EEVDGAKVEW VGGRPLIQYR GRLMPLVPVD PDQVLRREGA QALVVFSDGE RSMGLAVDEI VDIVDEVLDV ELTADRSDLI GSAVVRGRAT EIVNIAHYLP LAHDDWARAP RRKEEQAPRR LLLVDDSAFF REMLTPVLKA AGYRVIPAAS AEEALTVLTG ETPIDVVVAD LEMPGRSGFD LIEQMRRTGP RLAEMPVIAL ASSVAPEGIE RARALGIADF VAKFDRSGLV AALNEITAPS LDAAA
|
| |