Gene Mext_4428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4428 
Symbol 
ID5833462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4923301 
End bp4925382 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content73% 
IMG OID641370221 
ProductPAS sensor protein 
Protein accessionYP_001641867 
Protein GI163853824 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.426756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTGA GCATCGCCGC CGCGATGCTG GCGCTAATCC TGGTGTCGCT CCGCCGCCGC 
GAGCCCGTCG TGCCGGAATC CGACATGACG GAGGCGCTCC AGGACCGGCT CTGGCAGATC
GCCGAGAGCG AGGAGCGCTA CCGGGTGCTG GTCGAGGCCA CCACCGAGGC CGTGGTTCAA
CGGGACGGGC AGGGGCGGAT CACCTTCGCC AGCGCGGGGT TTGCCGCGCT GCTCGGCATG
AAGCCGCTGG AGCTGATCGG CTCGACGCTG AGCCCGCAGG TGATCGAGCG CGGCGCGAGC
GAGCAGCGCG CCGACGGCGT GCGGGTGGTC GAGGAGCGTC TGGTGCCGGT GGACGGGCTG
CCGCGCTGGT TCTCCTTCAT CGAGATGCCG GTCTCCGGCA GCGTTGACGG TCCGAACTGG
CTGCGGGCCG GCCAGGACGT CACCGCGCGC GTCGAGGCGG CGCGTGTCCT CGACGAGGCG
AAAAGCCGGG CGGAGGCCGC CAACGTCGCC AAATCCCGCT TCCTCGCCAC CGTCAGCCAC
GAACTGCGCA CGCCGTTGAA CGGCATTCTG GGGATGGCCG ACCTGCTGCT CGACACCCGG
CTCGACCCCG AACAGCGCAC CTATGTCGAG GCGTTCCGCA CCAGCGGCAA GGCACTGCTC
GGCCTCGTGG ACGGCATCCT CGATTTCTCC CGGATCGAGG CCGGCCGCCT CGATCTGGCC
GCCGAACCCT TCGACGTCGC CGCCCTGGTC GAGGGCGTGG TCGAGCTGCT GGCGCCGCGC
GCGCAGGACA AGGGCCTGGA AATCGCCCTC GACATCGCCG ACGACCTTGC CGCCCTGCGC
GTGGGCGATG CCGACCGGGT GCGGCAGATC CTCGTGAATC TGGCCGGCAA CGCCATCAAG
TTCACGCAAG CCGGCGGCGT CGGCGTCAGT CTCGCCCGGT CGGGGGAGGG GCAGGGGGAA
GGGCTCGTCC TCACGGTCGA GGATACCGGG CCGGGCATTC CGGAGGAGCG CATCCCGATC
CTGTTCGAGG AGTTCGAGCA GGGCGACGAC AGCGCGAGCC ACGAGGGCAC CGGCCTCGGT
CTGGCCATCA CCCGCCGCCT CGTGGAGCGC ATGAACGGCA CGATCGAGGC GCGCTCGACG
GTGGGGCGCG GCTCGACCTT CCGGGTCGTG CTGCCGCTGC CGGCGGCGGA GGGTGCGATG
AGCCCCGAGA CACCGAGCCT CGCGCCGCGA AAGGTGCTGA TCGTGGCGGC CTCGCCGTAT
CAGGCGCCGT TCCTGGCCCG GCGGCTCAGC CGCTCCGGGG CGGCGGCGGT GGTCGTGAAC
AGCGCGGAGG CCGCCCTCGA TGCGCTGTCG GGCGTGGCCT TCGATGCGCT CATCGCCGAC
CGTAGCCTCG GCGACGCCGC CGTGCGGCGG CTCGCGGCCG AGGCCGCCCG CTGCGGCGTG
CGCTGCAGCC TGATCCTGCT CTCGCCCTTC GACCGGCGGG AATTCGGAGC CCCGAACGCG
GCGGGCTTCG ACAGCTACCT GATCAAGCCG GTGCGCGCCC GCTCGCTGTT CGACCGCCTG
CTGGAGCCGC GGCCCGCCCC GGCCCGCAGC CCGGCCGACG CGACGGCCAA GACTGGCACG
AGCCAACCCG GCCCGCGCCA GCCCGCTCCG GTCAAGACTA CACCGGGCAA GACTGGATCA
ATCCAGACCG CCGCGCGGGG CCGCAGGGTG CTGCTGGCAG AGGACAACCC GATCAACGCG
CTGCTCGCCA CCAAGGCGCT GGAGCGGCTC GGCGCACAGG TGATCCACGC CCGCGACGGG
CTCGAGGCTC TGGCGGCGGC GGAAGGGCAG GGGCCGTTCG ACCTCGCGCT GATCGACATC
CGCATGCCCG GCCTCGACGG CCTGGAGACC GCCCGCCGCA TCCGCGCCCG CGAGGCCGAG
ACCGGCGCGA GCCCGCTCCA CCTCGTGGCG CTCACCGCCA ATACCGGCCG CGAGGATGTC
GACGCGGCCT CCGCGGCCGG GTTCGACGGC TTCCTGCCCA AGCCGCTGAA CCTGCGCGCC
CTGCCGGCTC TGCTGGACCG CCGCCAAGAG GAGGCAGCTT GA
 
Protein sequence
MGLSIAAAML ALILVSLRRR EPVVPESDMT EALQDRLWQI AESEERYRVL VEATTEAVVQ 
RDGQGRITFA SAGFAALLGM KPLELIGSTL SPQVIERGAS EQRADGVRVV EERLVPVDGL
PRWFSFIEMP VSGSVDGPNW LRAGQDVTAR VEAARVLDEA KSRAEAANVA KSRFLATVSH
ELRTPLNGIL GMADLLLDTR LDPEQRTYVE AFRTSGKALL GLVDGILDFS RIEAGRLDLA
AEPFDVAALV EGVVELLAPR AQDKGLEIAL DIADDLAALR VGDADRVRQI LVNLAGNAIK
FTQAGGVGVS LARSGEGQGE GLVLTVEDTG PGIPEERIPI LFEEFEQGDD SASHEGTGLG
LAITRRLVER MNGTIEARST VGRGSTFRVV LPLPAAEGAM SPETPSLAPR KVLIVAASPY
QAPFLARRLS RSGAAAVVVN SAEAALDALS GVAFDALIAD RSLGDAAVRR LAAEAARCGV
RCSLILLSPF DRREFGAPNA AGFDSYLIKP VRARSLFDRL LEPRPAPARS PADATAKTGT
SQPGPRQPAP VKTTPGKTGS IQTAARGRRV LLAEDNPINA LLATKALERL GAQVIHARDG
LEALAAAEGQ GPFDLALIDI RMPGLDGLET ARRIRAREAE TGASPLHLVA LTANTGREDV
DAASAAGFDG FLPKPLNLRA LPALLDRRQE EAA