Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3255 |
Symbol | |
ID | 6130650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3602857 |
End bp | 3606390 |
Gene Length | 3534 bp |
Protein Length | 1177 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641643442 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001770094 |
Protein GI | 170741439 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0680649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.187811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCGG GCTGGGCGGT GGTGCTGACC GCACTCACCT ACATCTGCGC CCTGTTCACG GTGGCGCATT GGGGCGACGT GTCCGGCCGG CGCCTGATGC GCGACGAGCG CGTGCGGCCG ACCATCTACG CCCTGTCGCT CGCGGTCTAC TGCACCTCCT GGACCTTCTT CGGCTCGGTC GGGCTCGCCA GCCATTCGGG CCTCGACTTC CTCACCATCT ATGTCGGCCC GGTGCTGGTG ATCGGCCTCG GGCACCGGCT GGTGGCACGG GTCGTGCGGA TCGCCAAGGC CCAGAACTCG ACCTCCGTCG CCGACTTCAT CGCGGCCCGC TACGGCAAGA GCGAGCGCAT CGCCGCCCTG GTCTGCCTGA TCAGCATCGT CGGGGCGATC CCCTACATCG CCCTGCAGCT GCGCGCCGTC GCGGCCTCGC TGCGGGTCTT CCTCGACGCC ACGGACGGGC GCGGCGGCGG CACGATCGGC CTGATGGGCG ATCTCGGCCT GTTCACCGCC CTGGTCCTCG CCGGCTTCGC GGTCGCCTTC GGGACCCGCC ACGCGGACGC CACCGAGCAC CAGGACGGGC TGACGCTCGC CATCGCGATC GAGTCGCTCG TCAAGCTCCT CGCCTTCCTG GTCGTCGGCG GCTTCGTGGT CGGCTGGGTG CTGCGGAAGG CGCCGGTGGT CACCCCCGGG GCCCTGCTCG GCGGGACGGC CACCCTGGTC GCCGACACGT CCGGCCCCTG GACGCTGCTG GTGCAGGTGC TGCTCTCGTC CTGCGCCGTG CTGCTGCTGC CGCGCCAGTT CCACATGGCC GTGGTGGAGA ACCGGGCGGT GGCGGACGTG ACCCGCGCCG CCTGGGCCTT CCCGCTCTAC CTCGTGCTCA TCAACCTGTT CGTCGTGCCG CTCGCGGTGA TCGGCCTGAT GATGTTCCCG GACGGGAGCG TCATGCGCGA CATGACGGTG CTCGCCCTGC CCCTGGCCGA GCGCGCGGAC GGCATCGCGC TCATCGCCTT CGTGGGCGGG CTCTCGGCCG CGACCGCGAT GGTGATCGTG GAATCGGTCG CGGTCGCGAT CATGATCTCG AACCACCTCG TCATCCCGCT GGTCCTGCGC GGGCGGCCGG GGAGCCAGCG GGCGACCAAT CTCGGCGGCG TCGTGCTGGC GGTGCGCCGG GTCGCCATCG TGGTGGTGAT CCTGGCGGCC TACGCCTATT CGCGGGTGGC CGGCGAGGTG GCGCTCGCCT CGATCGGCCT CCTGTCCTTC GCGGCCGTGG CGCAGATCGG GCCGGCCTTC CTCGGCGGCC TGATCTGGCG GCGCGGCACC GGCCTCGGCG CGGTGGCGGG GCTGACGGCG GGGCTCGCGG TCTGGGCCTA CACGCTGCTG CTGCCGAGCC TGCTCGGCGA ATCCGCCAGC CCCTGGGCGC GCGCCTTCCT GGAGGACGGG CCCTTCGGCA TCGCGGCCCT CAGCCCCACC GCCCTGATGG GGCTCGACGA CCTGCCGCGC CTCGTCCACG GCACGCTCTG GAGCCTCGGC CTCAACGCCC TGGCATATTG GGGCTTCTCG CTGCTGCGGG CGCCGAGCGC GATCGAGCGG CTCCAGGCCG AGGCCTTCGG GCACGAATTC GTGCAGGACG CGCCGCCCCT GCGCCTGTTC CGCGGCACGC TGAGCTTCGG GGAGCTGCGC GCCGCCGTCG CCCGCTTCCT CGGCGAGGAG CGCGCGCAGC GGGCCTTCGA CGCCTACTTC GCCGAGCGCG GCCGGATGAT CCTCCACCCG GACGCGGTGG CGGGGCTCGG CGAATTGCGC CACGCCGAGC ACCTGCTCGC CTCGGCGATC GGCGCCTCCT CGGCCCGGCT CGCGCTCTCG CTGCTGCTCG GGCGGCGCAA CGTCTCGCCG CGGGCGGCCC TGCGCCTCCT CGACGACGCC TCGGCGGCCT TCCAGTACAG CCGCGACTTC CTGCAGCACG GCCTCGACCA CGCGGGCCAG GGCATCACGG TCTTCGACCG CGACATGACC CTGATCGCCT GGAACCGGGC CTTCGCGGAC CTCTACGACC TGCCCAACGA CATCATGCGC ACCGGCATGC CGCTGGAGGA GATCGTCCGC TACAACGCCG CCCGGGGCGC CTACGGCGAC CGCGAGGCGG ACGACCTCGT GCGCGAGCGC ATCGCCGCCT TCCGGCAGGA GACCGGGCCG CAGCGCCTGC GCCTCTCCCC GAGCGGGCGC GTGATCGAGA TCCGGGCCAA CGCCCTGCCG AACGGGGGCG TGGTCGCGAC CTACACGGAC GTCACCGACG CGGTCGCGGC CGAGGAGGCC CGCGAGCGCC TCAACGAGGA GCTGGAGCGC CGGGTGCGGG AGCGGACCGA GGAGCTCACC CGCCTCAACG CCGCGCTGAG CCGCGCCAAG GCCGAGGCCG AGGAGGCCAA CGCCTCGAAG ACGCGCTTCC TCGCCGCCGC GAGCCACGAC ATCCTGCAGC CCCTCAACGC GGCCCGCCTC TACGCGGCGG CCCTGGTCGA GCGCGACCGC GCCGCCGACC CGACGCTCGC CGAGAACGTC GACGCCTCGC TGGACGCGGT CGAGGAGATC CTGACCGCGC TCCTCGACAT CTCCCGCCTC GACACCGGCG CCCTGACGCC GCAGCTCTCG ACCTTCCGGG TCTCCGAGCT GATGCGCCAG ATCCGGCGCG AATTCGAGCC GATGGCGCGC GAGAAGGGCC TGGAGCTGCG GGTGATGCCC TGCGGCCTCG GCGTCCGCTC GGACCGGCCG CTCCTGCGCC GGCTGCTCCA GAACCTCGTC TCCAACGCCA TCAAGTACAC CCAATCGGGC CGGGTCCTGG TCGGTGCGCG CCGCCGCGGC GAGCGCCTCG AACTCATGGT CTGCGACACC GGGCTCGGCA TCCCGGCCTC CAAGCGCAAG GTGGTGTTCC AGGAATTCCA GCGCCTCGAA CAGGGGGCCC GGGTCGCCCG CGGCCTCGGG CTCGGCCTCT CGATCGTCGA GCGCACCGCG CGGCTCCTCG GTCACCCGAT CCGCCTGCGC TCCGAGGTCG GGCGCGGCTC GATCTTCTCG GTCCTGGTGC CGGTCGCCGC CCTGCGGCCG GCCCCGGAGG CCGCCGCGGA GGCGCCCCGC CCGGCGGACG CCGCCCTCTC GGGCCTCTCG GTCCTCGCCA TCGACAACGA GCCGGCGATC GTGGACGGCA TGGCGCGCCT GCTCGCGAGC TGGGGCTGCC GCGTGCGCAC GGCGGGCTCG GTCGGCGAGG CGGTGCGGCG GGTGCTCGCC CCCGCCCCGC CCCCCGACGT GATCGTGGCG GATTACCACC TCGACGAGGG CAACGGGCTC GACCTGATCG CCTCGCTGCG GGCGGCCCTG TCGGCCGACG TGCCGGCGGT GCTGCTCACC GCCGACCGCT CGCCGCCGGT GCGGGAGACG GCCGCGGCGC AGCGCGTCCA CCTGCTCACC AAGCCGCTGA AGCCCGCCGC CCTGCGGGCG CTGCTGACCC AGTGGCAGGC CCGGCGGGCG GCGGCCGAGG AACCCTCCGG CTGA
|
Protein sequence | MIAGWAVVLT ALTYICALFT VAHWGDVSGR RLMRDERVRP TIYALSLAVY CTSWTFFGSV GLASHSGLDF LTIYVGPVLV IGLGHRLVAR VVRIAKAQNS TSVADFIAAR YGKSERIAAL VCLISIVGAI PYIALQLRAV AASLRVFLDA TDGRGGGTIG LMGDLGLFTA LVLAGFAVAF GTRHADATEH QDGLTLAIAI ESLVKLLAFL VVGGFVVGWV LRKAPVVTPG ALLGGTATLV ADTSGPWTLL VQVLLSSCAV LLLPRQFHMA VVENRAVADV TRAAWAFPLY LVLINLFVVP LAVIGLMMFP DGSVMRDMTV LALPLAERAD GIALIAFVGG LSAATAMVIV ESVAVAIMIS NHLVIPLVLR GRPGSQRATN LGGVVLAVRR VAIVVVILAA YAYSRVAGEV ALASIGLLSF AAVAQIGPAF LGGLIWRRGT GLGAVAGLTA GLAVWAYTLL LPSLLGESAS PWARAFLEDG PFGIAALSPT ALMGLDDLPR LVHGTLWSLG LNALAYWGFS LLRAPSAIER LQAEAFGHEF VQDAPPLRLF RGTLSFGELR AAVARFLGEE RAQRAFDAYF AERGRMILHP DAVAGLGELR HAEHLLASAI GASSARLALS LLLGRRNVSP RAALRLLDDA SAAFQYSRDF LQHGLDHAGQ GITVFDRDMT LIAWNRAFAD LYDLPNDIMR TGMPLEEIVR YNAARGAYGD READDLVRER IAAFRQETGP QRLRLSPSGR VIEIRANALP NGGVVATYTD VTDAVAAEEA RERLNEELER RVRERTEELT RLNAALSRAK AEAEEANASK TRFLAAASHD ILQPLNAARL YAAALVERDR AADPTLAENV DASLDAVEEI LTALLDISRL DTGALTPQLS TFRVSELMRQ IRREFEPMAR EKGLELRVMP CGLGVRSDRP LLRRLLQNLV SNAIKYTQSG RVLVGARRRG ERLELMVCDT GLGIPASKRK VVFQEFQRLE QGARVARGLG LGLSIVERTA RLLGHPIRLR SEVGRGSIFS VLVPVAALRP APEAAAEAPR PADAALSGLS VLAIDNEPAI VDGMARLLAS WGCRVRTAGS VGEAVRRVLA PAPPPDVIVA DYHLDEGNGL DLIASLRAAL SADVPAVLLT ADRSPPVRET AAAQRVHLLT KPLKPAALRA LLTQWQARRA AAEEPSG
|
| |