Gene M446_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3643 
Symbol 
ID6133365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4061946 
End bp4063136 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content69% 
IMG OID641643810 
Productresponse regulator receiver protein 
Protein accessionYP_001770458 
Protein GI170741803 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0163627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCGA TCGGCAGGGT CGCGATCCAG GGTCTCGGGG CCTGTCTCCT CGCCGCCCTC 
CTCACGGCGC CGGCCCGCGC CGAGCCGGGC GAGGACGGCA AGCTGCGCGT CGGCCTGATG
TTCACCCTGA GCGGCCCCTC GGCGGTGCTC GGCGAGCAGG GGCGCGACGG GTTCCTGCTC
GCGCTCGAGA CCATGGGCCG GAAGCTCGGC GGCCTCGACA CCGAGGTGCT GGTGGTCGAC
GACGAGCTCA AGCCCGACGT CGCCGCCAAC CGGGCGCGGG ACTTCGCGCG GCGCGACCGG
GTCGATTTCG TGGTCGGCCC GACCTTCTCG AACGTGCTGC GGGCGATCGT GCGGCCGGTC
ACCGAATCGG GCGCCTTCCT GATCAGCCCC AATGCCGGCA CCTCGAACTA CGCCGGGTCC
GAGTGCAACC CGAACCTGTT CGTCTCCTCC TACCAGAACG ACCAAGTCCA CGAGGTCCTG
GGCAAGGTCG CGCAGGACAA GGGCTACAAG CGCCTCGTCC TCCTCGCCCC GAACTACCAG
GCCGGCAAGG ATTCGCTGGC CGGCTTCAAG CGCTCCTACA AGGGCGAGGT GGTGAGCGAG
ATGTTCACCC CGCTGGGCCA GCTCGACTTC TCGGGCGAGC TGGCGCAGAT CGCGGCCGCC
AGCCCGGACG CGGTCTTCGC CTTCATGCCG GGCGGCATGG GCGTCAATCT CGTGCGGCAG
TACCGGCAGG CGGGCCTCGC CCAGATCCCG TTCCTCTCCG CCTTCACGGT CGACGAGAGC
ACGCTGCCGG CCCAGAAGGA CGCCGCGGTC GGCTTCTACG GCGGCGCCAA CTGGGCGCCC
GACCTCGACA ACCCGCAATC CAAGGCCTTC GTGGCCGCCT ACGAGAAGGC GTATGGCCGC
GTGCCCGGCA CCTACGCCAT GCAGGCCTAC GACGCCGCCC AGATGATCGA CAGCGCCGTC
AGGGCCGCCA AGGGCAACCT GAAGGACAGG GACGCGCTGC GCGCCGGCCT CAAGGCGGCC
GAGTTCCCGT CGCTGCGCGG CCGATTCCGG ATCGGCAACA ACCACTTCCC AATCCAGGAC
TTCTACCTCG TCCGCGCCGC CAAGCGCCCC GACGGCAAGT ACGAGACCCA AATCGTCGAG
AAGATCTTCT CGGACTACCG CGACGCCTAC GCCGCCGAGT GCAAGATGTG A
 
Protein sequence
MKPIGRVAIQ GLGACLLAAL LTAPARAEPG EDGKLRVGLM FTLSGPSAVL GEQGRDGFLL 
ALETMGRKLG GLDTEVLVVD DELKPDVAAN RARDFARRDR VDFVVGPTFS NVLRAIVRPV
TESGAFLISP NAGTSNYAGS ECNPNLFVSS YQNDQVHEVL GKVAQDKGYK RLVLLAPNYQ
AGKDSLAGFK RSYKGEVVSE MFTPLGQLDF SGELAQIAAA SPDAVFAFMP GGMGVNLVRQ
YRQAGLAQIP FLSAFTVDES TLPAQKDAAV GFYGGANWAP DLDNPQSKAF VAAYEKAYGR
VPGTYAMQAY DAAQMIDSAV RAAKGNLKDR DALRAGLKAA EFPSLRGRFR IGNNHFPIQD
FYLVRAAKRP DGKYETQIVE KIFSDYRDAY AAECKM