Gene M446_4638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4638 
Symbol 
ID6129467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5097395 
End bp5098876 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content74% 
IMG OID641644777 
Productmethyl-accepting chemotaxis sensory transducer with Pas/Pac sensor 
Protein accessionYP_001771412 
Protein GI170742757 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.246645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.228311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGT GGTTCTTCGG CCGTGACAAT GCCGCGCTGA TCGCGGCGCT CGACCGCTCG 
CAGGCGGTGA TCGCCTTCGC GCCGGACGGG ACGATCCTCG ACGCCAATGC CCTGTTCCTC
ACGGCGGTCG GCTACGGGCT GGACGAGATC AAGGGCCGCC ACCACCGGCT GTTCGTCGAC
CCGGCCGAGC AGGACGGGGC CGCCTACCGG GACTTCTGGG CGTCGCTCGC CCGCGGCGAG
CCCCATACTG CCGAGGTGCG GCGGCTCGGC AAGGGCGGCC GCGAGATCTG GCTGCAGGCG
ACCTACACGC CCATCCCGGG CCGCGGCGGC CGCCCGACCA AGGTGGTGAA GGTCGCCACC
GACATCACCG AGCGCAAGCG CAGCGACGCG GATTGCCGCG GCAAGATCGA GGCGGTCGAG
CGCGCCTGGG CGATGATCGA GTTCGACCTG TCGGGCCGGA TCCTGACCGC CAACCCGAAT
TTCCTGCGCG CGGTCGGCTA CGGCCTCGAC GAGATCGCCG GGCGCCATCA CGAGATCTTC
GTCGATCCCC AGGAGCGGGC GGCGCCGGCC TACGCGGCGT TCTGGCAGCG GCTCGGCCGC
GGCGAGTTCC ACTCCGGCGA GTTCCGCCGC ATCGGCAAGG GCGGCCGCGA GATCTGGCTG
CAGGCGATCT ACAACCCGGT GCTCGACGCG AGCGGCCGGC CCGTCAAGAT CGTCAAGTAC
GCGACCGACA CCACCGAGGC GGTGCGGCAG CGCCGCGAGC GCGAGCAGGC GCTGGCGGCG
ATCGAGGCGA AGCTGTCGGA GATCGACGAG ACCATGGCGG GGGTGGCGGC CCAGGTCTCT
GGGACCGCGC AGGCGGCCGA CGCGACCGCC GCCACTGTGG GGCAGGCGGC GGCGGGAACG
CGGGCCGTCG CGGCCTCGAT CGAGGGCCTG ACGCGCCACG CCGGCGAGGC GCGCCACGCC
GGCGACGCCG CCGTGCGCCA GACCGAGGAG GCGCGGGGCA TCGTGACCGG CCTGCTTCAG
GCGGCCGACC GGATCGGCGA CGCGGTGAGC GCGATCCGCG CTGTGGCCGA GCAGACGAAC
CTGCTCGCGC TCAACGCCAC GATCGAGGCG GCGCGGGCCG GCGAGGCGGG GCGGGGCTTC
TCGGTGGTCG CCACCGAGGT GAAGGCGCTG GCCGGCCAGT CGTCGCGGGC GGCCGAGGAG
ATCGGCAGCC TGATCGCCGC GGTCCAGACC TCGACCGGCG AGGCCGTGCG GGTGATCGAG
ACCATCAGCC AGGCCATCCA CCAGATGAGC GCGCTCTCCG GGCGCGTGTC GCAGGCGGTG
TCCGAGCAGG CCGACGCCAC GCGGGCGGTC ACGAGCGGGG TGGAGACGGT GGCGGGGAGC
GTGGAGCAGG TGCGCGAGAG CGCCCGGGCC ATCGAGGCCG CGGCGGCCGC GGTGGGCGCC
TCGATGAGCG AGATGGCCGG GTTCGCGCGC CGGATGGTCT GA
 
Protein sequence
MSMWFFGRDN AALIAALDRS QAVIAFAPDG TILDANALFL TAVGYGLDEI KGRHHRLFVD 
PAEQDGAAYR DFWASLARGE PHTAEVRRLG KGGREIWLQA TYTPIPGRGG RPTKVVKVAT
DITERKRSDA DCRGKIEAVE RAWAMIEFDL SGRILTANPN FLRAVGYGLD EIAGRHHEIF
VDPQERAAPA YAAFWQRLGR GEFHSGEFRR IGKGGREIWL QAIYNPVLDA SGRPVKIVKY
ATDTTEAVRQ RREREQALAA IEAKLSEIDE TMAGVAAQVS GTAQAADATA ATVGQAAAGT
RAVAASIEGL TRHAGEARHA GDAAVRQTEE ARGIVTGLLQ AADRIGDAVS AIRAVAEQTN
LLALNATIEA ARAGEAGRGF SVVATEVKAL AGQSSRAAEE IGSLIAAVQT STGEAVRVIE
TISQAIHQMS ALSGRVSQAV SEQADATRAV TSGVETVAGS VEQVRESARA IEAAAAAVGA
SMSEMAGFAR RMV