Gene Mext_0108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0108 
Symbol 
ID5833312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp113859 
End bp115721 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content63% 
IMG OID641365893 
ProductPAS sensor protein 
Protein accessionYP_001637607 
Protein GI163849564 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGTC GCAACTCGCT CGAGTGGCCG CCTGGCAGTA GTGAAATGGC GGAGCGCATC 
CGCGCGCATG ACTGGTCAGG ATCGCCGCTC GGATCAGCTG GGCGCTGGTC GCCGCGCCTG
AAACTTGCTG TCGAGATGAT GCTGACCAAC CCTCTTGCCG CCAGCCTCGT ATGCGGGCCA
GAACGTCTCC TGATCTACAA CGATGCAGCT GCCACCCTCA TTGGCGGGCA ACATCCCGCA
GCCCTCGGCC AGCCTCTGCC GCATACCTTC CCGGCTGGCC GGGCGGCGGC CGAGCCGTTC
TTCCAGCGTG CCTTCGCAGG TGAGGTAATA CAGGTCCATG GCCAAGCACT CAGTATGTGC
TGTGAGGGTC AGGCGAACGC AGACGTGTCC GATGCCATCC TCATGCCTGT GCGCGACGAG
GACGGGCAAG TCGCTTACGT CCATATGATC TGCACTGAGA ATGGCGGGCG GGCTCGAGCC
GAAACCATCG CACGTCAGAG TGACGAGCAA CTCGCCGCTA TCTTCGGCAG CGCCGCAGTC
GGGCTGTCCG AGCTTGCGCA GGATGGCCGC TTCCTACGTG CCAACGACGA GCTCTGCCGT
ATCCTGGGGC GCACCCGTGA GGAGGTGCTG CGTCTCACCG TGATGGACGT GACGTATCAT
GACGACGTCC CCCCCAGCCT GATCGCTGTC GCGGAGGCGC TTCGAACAAA CCATCCCGCC
TCGCTGGACA AGCGCTACCT GCGCCCAGAC GGCAGCTTTG TGTGGGCCAA CAGCCGTGTG
CAACCGCTGC ACCATGAGGG CGAGCCGAGT ACGCTATTGG CTGTCACGGC TGACTTGAGC
GAGCGCCGTC ACACTGAGGA GCGCCTGCGC GAGAGCGAGG AGCGGTTCCG GGCGTTAGCG
AACCTTGTGC CGGTCATTCT ATGGCGCTCG GACGCGAGCG GGCTCACCTT CTCAGAGAAC
CAATCCTTCC TCGATTACAC CGGCCAAACG GCAGACGAAG TGCAGGATTT CGGCTGGCTC
GCCCCAATGC ATCCCCATGA CCGTAGGGGC GTACAGGAGG CATTTGTGCA TGGAATCGAC
ACCAAGCAGC CAATCGACGC GCAGTTTCGA CTGCGTAGCC GCGATGGACA ATACCGGTGG
TTCCTCGCAC GGCAGGTGCC AATCTTTGGC GGAGAGGGGC AGGTAACGGA GTGGTTCGGC
GCGGCCATGG ACATCCACGA GCTGCACGAA CTGCAGCAGC GACAAGCCGT CATGGTGGAT
GAACTGCAGC ACCGGACCCG CAACCTGCTC ACTGTGGTGC GCGCCATCGC GCAGCAAACC
ATGGCTCAGA CCGGCCCGAC TGTGCTGTTC CGCGAGCAGT TCAACGACCG CCTCGCAGCG
CTCTCTCGCG TGCAGGGCCT TCTCTCCCGC TCTGATCAGG AGCCAATTAC CATTCGGGCC
CTTGTCGAGA TGGAGCTTGA TGCGCTCGGA GCGACAGACA TGGGCGAGCG CATCGCGGTA
GAGGGGCCTC GGGTGGTGCT GCGAAAGGGC AGTGTGCAGA CGTTGGCTCT GGCTCTGCAC
GAACTTGCCA CCAATGCGCG CAAGTACGGT GCCTTGTCCT GTGAGCAGGC CGAACTCTGG
GTCAGCTGGG ACACCTACAT CGCGGAAGCC GGAGAGCAGC GACTTGCCCT TACATGGCTG
GAGGAGGGAA TCTGCCGTTC GCGACAGAGC AACCCCGTTC GGCGCGGCTA TGGGCGTGAG
CTGATCGAGA AGGCGCTACC CTACACGTTA AAGGCACACA CAGACTATGA ACTTGGCGAG
AATGAGCTGC GTTGCTCCAT TGACCTGCCA CTCACTGAGC TTGCAGGGAC AAGGGCCCAA
TGA
 
Protein sequence
MGSRNSLEWP PGSSEMAERI RAHDWSGSPL GSAGRWSPRL KLAVEMMLTN PLAASLVCGP 
ERLLIYNDAA ATLIGGQHPA ALGQPLPHTF PAGRAAAEPF FQRAFAGEVI QVHGQALSMC
CEGQANADVS DAILMPVRDE DGQVAYVHMI CTENGGRARA ETIARQSDEQ LAAIFGSAAV
GLSELAQDGR FLRANDELCR ILGRTREEVL RLTVMDVTYH DDVPPSLIAV AEALRTNHPA
SLDKRYLRPD GSFVWANSRV QPLHHEGEPS TLLAVTADLS ERRHTEERLR ESEERFRALA
NLVPVILWRS DASGLTFSEN QSFLDYTGQT ADEVQDFGWL APMHPHDRRG VQEAFVHGID
TKQPIDAQFR LRSRDGQYRW FLARQVPIFG GEGQVTEWFG AAMDIHELHE LQQRQAVMVD
ELQHRTRNLL TVVRAIAQQT MAQTGPTVLF REQFNDRLAA LSRVQGLLSR SDQEPITIRA
LVEMELDALG ATDMGERIAV EGPRVVLRKG SVQTLALALH ELATNARKYG ALSCEQAELW
VSWDTYIAEA GEQRLALTWL EEGICRSRQS NPVRRGYGRE LIEKALPYTL KAHTDYELGE
NELRCSIDLP LTELAGTRAQ