Gene Mext_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4037 
Symbol 
ID5834365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4492077 
End bp4493177 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID641369828 
ProductPAS sensor protein 
Protein accessionYP_001641478 
Protein GI163853435 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.221223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGTT TTCCCAAAGG AGCGAGCCTG CCCTCGGGCG TCACGGTGGA GGTCTTCGCG 
GCAGCCTTCG AGGCCAGCCC GACGCCGATG GTGGTCACCG ATCCGCGGCG GGGCGACAAC
CCGGTCGTCT GGGCCAACGG CGCCTTTCTC GGACTCACGG GCTATGCCCG CGAGGAACTC
TACGGCCAGA ATTGCCGCAT GCTGCAAGGT CCCCTCACCG ATGCGGCGGT GCTTCAGACG
ATGCGGGCGG CGCTCGCCAC AGGCCGGCCG TTCGAGGGCG AGCTGCTCAA TTACCGCAAG
GACGGCACAT CGTTCTGGAA CGGAATGACG ATCAACCCGG TCTGCGACGA GGCGGGCAAG
GTCCTGTTCT TCTTCTCGGC CCAGGCCGAC ATGACCGACA AGCACCGCCT GGAACTGGCG
ATGCGCGACG CCAACGACGC GCTGGAGCGC GAGGTGAGCG AGCGCACCGC CGACCTGCGC
TCGGCCCTGG AACAGAAGAC CGCGCTGCTC CACGAGGTCG ATCATCGGGT CAAGAACAAC
CTCCAGGTCA TCTCCTCGCT GATGCTGCTG AAGGCCCGCC GCACGCCGGA GGGCGATGCC
CGCAACGCGC TCCAGGCCAT GGCCGACCGG ATCGGCGCCC TCTCCACGGC CCACCGGATG
CTGTACTCGG AGGGCGACGT GACCCGCTTC GACTTCCGGG AGTTCACCGC CGACCTGATC
GCCGACCTCG CCGCCGGCCT CGACGGGGAC CGCACCCGGA TCGAGACGGA GATCGAGGCG
CTGGCGCTCT CCGCCGCCAT GGCCGCCCCG CTGGCGCTGC TGATCCACGA ATTGACGACG
AACGCCCTGC ACCACGCCTT CCCGGAGGCG CGCCGCGGCC GGGTCGCGAT TGAGGCACAC
CGTTTCGAGG CGGGGATGCG CCTCGTCATT CAGGACGACG GCATCGGCAT GGCCGCGGTG
CCGTCCAACC CCGCAGGCTT CGGCCGCACC CTGGTCGAGA TGGTGGTGCG CCAGTTGCGC
GGCACCCTCG AATGGTCGGA TGCCGGGCCC GGCACCCGGA TCACGATCAC GATCCCGCTG
GTCGGGACCG ACGCATTGTG A
 
Protein sequence
MTGFPKGASL PSGVTVEVFA AAFEASPTPM VVTDPRRGDN PVVWANGAFL GLTGYAREEL 
YGQNCRMLQG PLTDAAVLQT MRAALATGRP FEGELLNYRK DGTSFWNGMT INPVCDEAGK
VLFFFSAQAD MTDKHRLELA MRDANDALER EVSERTADLR SALEQKTALL HEVDHRVKNN
LQVISSLMLL KARRTPEGDA RNALQAMADR IGALSTAHRM LYSEGDVTRF DFREFTADLI
ADLAAGLDGD RTRIETEIEA LALSAAMAAP LALLIHELTT NALHHAFPEA RRGRVAIEAH
RFEAGMRLVI QDDGIGMAAV PSNPAGFGRT LVEMVVRQLR GTLEWSDAGP GTRITITIPL
VGTDAL