Gene Mext_3507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3507 
Symbol 
ID5833702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3884461 
End bp3885999 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content69% 
IMG OID641369306 
ProductPAS sensor protein 
Protein accessionYP_001640963 
Protein GI163852920 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTT CCATGGCCGC GGCTGAGGCG ATTCAGGGGG GCGGCGAAGC GCTGACGGCC 
GAGGACTTCC GGCAGACCCT CCACGAGGTC GGTGTCTGCA TCTGGTCGCT GGACATTTCC
ACCGGCCGTG TCAGCGCTTC GCAGACCTGC GGCTGCCTCT TCGGTATTCC AACCGAACGT
CTGACGAGCT TTGCCGCGAC CCAGGATCTG GTCCACCCGG ACGACCGCCA AGCTCGCGCT
CACGCCATCG AGAGCGTGCT GCGGGACGGC GGCAGTTACG AGATCGAATA CCGTGTCGTG
CTGCCGAATG GGCGGGGCGG CTGGCTGCGC TCGCGGGGGC AGGTGCATCT CGACGCCGAA
GGCCGGCCCC ACCGGCACCG CGGGGTCGTC TTCAGCATCG AAGAGCAGAA GCAGGTGGAG
GCGGAGCTGC GCGCCCGCGA GGCTCATCTC CGCTCGATCC TCGACACGAT GCCGGAGGCG
ATGGTGGTCA TCGACGAGGC AGGGCTGATC CACTCGTTCA ACCCGGCGGC CGAACGCCTC
TTCGGCTACG CGGCCGGCGA GGCGATCGGG CAGGACGTCC GCATCCTGAT GCCGGAGGCG
ATGCAGGATG GACATGCCGC CGACCTCGAG CGCTACCGGC AGACGCGCCA GCGCCACATC
ATCGGCACCA CGCGGTGCGT GACGGGCCGA CGGCATGACG GCTCGACCTT TCCGATGGAG
CTGGCCATCG GCGAGATGCA TTCGGGCGAG CGGACCTTCT TCACCGGCTT CATCAACGAC
CTCAGCGCGC AGCGGCGCAC CGAGGCGCGG CTTCAGGAAC TCCAGTCCGA GCTGGGCCAT
GTTTCCCGCT TGAGCGCCAT GGGCGAGATG GCGACGACGC TCGCCCACGA GCTGAATCAG
CCGCTCGGCG CCATCACCAA CTACACCAAC GGCTGCCGCC GCCTCCTCGC CCATCCCGAC
CCCGAGACCA TCGCCCGGGC ACAGGAGGTT CTCGACAAGG CGGCCGAGCA GGCGCTGCGG
GCCCGGCAGA TCATCGCCCG CCTGCGGGAG TTCGTCGCCC GTGGCGAGAC GGAGAAACGG
GTCGAGCCGG TCGCGACGAT GATCGAGGAG GCCGGCGCCC TGACCCTGGC GGCGGCCGGC
GAGCAGGGCA TCACGGCCCA CGTCGTGCCG GATCCGCGGG TCGGATCGGT CTTGGTCGAC
CGGGTTCAGG TGCAGCAGGT TCTGGTCAAC CTGATGCGCA ATGCCTGCGA GGCGATGCAG
CGCAGCAGCC GGCGCGAGCT GACCGTCGCG ACGCGGCGGG TTTCGCCGGA TCTGGCCGAG
GTCGCGGTGT CGGATACCGG CCCCGGTATC GCCGAGGAGG TGGCCGACCG GCTGTTCCAG
CCCTTCGTCA CCACCAAGGA TGCCGGGATG GGCGTCGGCC TTTCGATTTC CCGCACCATC
ATCGAGGCGC ATGGCGGCCG CCTCTGGGTC GAGCCCAACG CCGCGGGCGG GGCGACCTTC
CGGCTGACCC TGCCGACCGC ACCCGAGAGA GGTCGGTAA
 
Protein sequence
MTLSMAAAEA IQGGGEALTA EDFRQTLHEV GVCIWSLDIS TGRVSASQTC GCLFGIPTER 
LTSFAATQDL VHPDDRQARA HAIESVLRDG GSYEIEYRVV LPNGRGGWLR SRGQVHLDAE
GRPHRHRGVV FSIEEQKQVE AELRAREAHL RSILDTMPEA MVVIDEAGLI HSFNPAAERL
FGYAAGEAIG QDVRILMPEA MQDGHAADLE RYRQTRQRHI IGTTRCVTGR RHDGSTFPME
LAIGEMHSGE RTFFTGFIND LSAQRRTEAR LQELQSELGH VSRLSAMGEM ATTLAHELNQ
PLGAITNYTN GCRRLLAHPD PETIARAQEV LDKAAEQALR ARQIIARLRE FVARGETEKR
VEPVATMIEE AGALTLAAAG EQGITAHVVP DPRVGSVLVD RVQVQQVLVN LMRNACEAMQ
RSSRRELTVA TRRVSPDLAE VAVSDTGPGI AEEVADRLFQ PFVTTKDAGM GVGLSISRTI
IEAHGGRLWV EPNAAGGATF RLTLPTAPER GR