Gene M446_3321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3321 
Symbol 
ID6134282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3689073 
End bp3691994 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content68% 
IMG OID641643506 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001770158 
Protein GI170741503 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTCG TGCCCCTCCG TCCTCGTTCT GCCTATCCCC TGCGCGGTGC GGTCGGCATC 
ACCTTGCTCA CTGCCTTGCT CCTGGTGACC CTCGGCGCGT GGCAGATCAT GCGCTCACGC
GACGCTGCTC TCGCGGACGC GCGCAAGGAC GCCCGCAATC TCAGCCGTTC ACTGGCCCAG
CACGCCGAGC GCACCATCGA GGCCGTGGAC CTCATCCTCT CGGGCACGAC CGAGTGGGTC
GAGGGCAACC CTGATCGAGC AACCCTGGGG AGTTTTCTCG TCCGCCGCAC CGGGACCATT
GCGCAGGTGC GCAATATCGC CATCACCGAT GCGAGCGGCT CGTGGATTGC CGACTCCCTC
TCGCCGCATC CCCCCCTCAG CAGCGCCGAT CGGCCTTACT TCGCTTGGCT CCGAGATCAC
CCGGAAGATC GGGCCATGTT CATCGGTGCG CTGATCCGCA GCCATTACGA TCAGAAGCCG
ATCCTCCCTC TCGCGCGGCG GCTGAACAAT TTCGATGGCT CGTTCGCCGG CGTCGTTGTG
GCCGAACTGG AGCCGGACTA CTTCGCACAG TTCTACGCGA CCTTCGGCCT AGGCGAGCGA
GGGACGATCG GGTTGTGGAG CTCGAACGGC AGGCTCCTCT CGCGAGACCC AGCGGCACCG
ACCGAGAGCC AGGACAAGGA CTTCTCGTCC ACGCCTCTCT TCCACAACCT GAGACGCATC
TCGACCGGCG TCTACGACGC GCCACCGGCA CCCATCGATG GCGTCAAGCG CATCGTCGCC
TACGAGCGGC TGGTGAAGTA CCCCGCCTTG GTGGTCACGG CCGGGATCGC GACCGATGAC
GCCCTCGCGG CTTGGCGGCG GGAGGCGCTC GTCCAGGCCA GCGCGATCAG CACGGCGGTG
CTCGCTCTCG TCGGGCTCGG TGTCGGCCTC GGGCGGCGCG ACCGGCGCGT GCGGGCCGCC
GAAGCCGAGG CGCGCGCGAG CGCGGACCTG CTCGCTGTGA CGCTCGAGAA CATGGACCAA
GGTCTGATGA TGATCGATGC CGACGATCGG GTCCAGGTCT GCAACCGGCG CGCCCTGGAA
CTCCTCGATC TACCCCCTGC CTTCATGGCG CGAAAGCCAA CTTTCACGGC GGTGCGCAAT
CACCAGCTCG CGCACAACGA GTTCGCCCAT TCCGACGAGG CCTTCCGCAC CTGGGTGGCC
ACGGCGGGAC TCGAGCCCAG GGAGCACACC TACGAGCGCG AGCGTCCGAA CGGCACCGTT
CTGGAGATCC GCACCGTGCC CCTGGCGGGC GGTGGTGCGG TGCGAACCTA CACCGACATC
ACCGCCCGGC ATGGCGCGGA GCGGGCCCGG CGTGAGAGCG AGGCCCGCTA CAAGGTTCTC
GCCGACAACG TCTCCGACCT GATCGTCCTC GGCCACGCCG ACGGGCGCCG CTCCTACATC
TCGCCCTCGG TGCACGCCAT GCTCGGCTAC GCGGTCGAGG AGGCGCACCG GATCAGGGTG
CGCGACTGCC TTCACCCGGA TGATCTGAAC CGGGTGTCGG CGGCTGCAGC GAGCCTCTCC
CGGGAGAGGC CGACCGCCTC GGTCGTGTTC CGCCTCAAGC ACAAGGCGGG GCACTACGTC
TGGGTCGAGG CCGCGTACAA GCGGATCGAG GACGCCGACG AGGTCACCAT CGTCACCGCC
ATCCGGGACG TCACCCAGCG GGTCCGTCAG GAACGCCATC TGGAGCGGGC CAGGGTCGAG
GCGGAGGCTG GCGCCCGGGC CAAGGCCGAG TTCCTGGCCA ACATGAGCCA CGAGCTCAGG
ACGCCGCTCA CCGGCATGCT CGGCGTGCAT GATCTCTTGG CCGGCGACGC CTCGCTGTCG
GACGCCCAGC GGCACCTGGT CGGGCTGGCG CAGGAGGCCG GGCGCTCGCT CCTGGCCATC
GTCAACGACA TCCTGGACTT CTCCAAGATC GAAGCCGGCC AGATGGCGAT CGAGAGCGTG
CCTTTCTCTC TGCGGGCGCT CGTCGAGAGC TGTCGGGCCC TGGTCGCGGA GAGCGCCAAG
GACAAGCCGC TCCGGCTGGT CACGGACATC GAGGGTGACG CGCCTGACCG GTTCGTCGGC
GATCCGACCC GCTTGCGCCA GATCCTGCTC AACCTCGCCA GCAACGCCGT GAAGTTCACG
CCGAGAGGTG AGATCACGAT GCGGGTTGGC TTCGCGGCGA GCACCGGGCG AGTGCGGGTC
GCGGTCACGG ACACTGGCAT CGGCATCCCG GCCGACAAGC TGCCCCTGCT GTTCGAGCGG
TTCAGTCAAG CCGACGCTTC GACCACCCGT CGCTACGGCG GCACGGGCCT GGGTCTGGTG
ATCTGCAAGC GTCTCGTGGA ACTCATGGGG GGCACGATCG GCGTGGAGAG CGTTCCTGGG
CGAGGCTCGA CCTTCTGGTT CGAACTGCCG CTGCCCTTGG CCGAGTCGGA TCGACAGGGT
CAGGCCGCGC CGCAAGTCCT CGCGGGTCCT GTTGCCCGCG GCCACCGGAT CCTGGTTGCC
GAGGACAACG AGATAAACCA GGAGGTGATC CGCGCCGTGC TGAGCCGCCG GGGCTACGAG
GTCGTGCTTG TTGAGGACGG GGCTCAAGCC GTGGAGGCGA TCAAGACAGG ACCGGCCTTC
GACATCGTGC TCATGGACGT GCAGATGCCG GTGCTCGACG GGCTTGGTGC AACGGCTGCG
GTGCGGGCGT GGGAGCGTGC GCAGGGTCGG CCTCCGATGC CGATCGTGGC GCTCACGGCG
AATGCCATGA ACGAGGATGC AGAGCGGTGC CGGGCCGGTG GCATGGACGC CCACGTCGGC
AAGCCGATCA AGTGGACGGA ACTGTTCGAT GTCATTGAGC AGCTGTGTTC CGCGCCCAAT
CGAGACACGC CACCTGCACC TCACTTGCAA ATCGGAATGT AA
 
Protein sequence
MQFVPLRPRS AYPLRGAVGI TLLTALLLVT LGAWQIMRSR DAALADARKD ARNLSRSLAQ 
HAERTIEAVD LILSGTTEWV EGNPDRATLG SFLVRRTGTI AQVRNIAITD ASGSWIADSL
SPHPPLSSAD RPYFAWLRDH PEDRAMFIGA LIRSHYDQKP ILPLARRLNN FDGSFAGVVV
AELEPDYFAQ FYATFGLGER GTIGLWSSNG RLLSRDPAAP TESQDKDFSS TPLFHNLRRI
STGVYDAPPA PIDGVKRIVA YERLVKYPAL VVTAGIATDD ALAAWRREAL VQASAISTAV
LALVGLGVGL GRRDRRVRAA EAEARASADL LAVTLENMDQ GLMMIDADDR VQVCNRRALE
LLDLPPAFMA RKPTFTAVRN HQLAHNEFAH SDEAFRTWVA TAGLEPREHT YERERPNGTV
LEIRTVPLAG GGAVRTYTDI TARHGAERAR RESEARYKVL ADNVSDLIVL GHADGRRSYI
SPSVHAMLGY AVEEAHRIRV RDCLHPDDLN RVSAAAASLS RERPTASVVF RLKHKAGHYV
WVEAAYKRIE DADEVTIVTA IRDVTQRVRQ ERHLERARVE AEAGARAKAE FLANMSHELR
TPLTGMLGVH DLLAGDASLS DAQRHLVGLA QEAGRSLLAI VNDILDFSKI EAGQMAIESV
PFSLRALVES CRALVAESAK DKPLRLVTDI EGDAPDRFVG DPTRLRQILL NLASNAVKFT
PRGEITMRVG FAASTGRVRV AVTDTGIGIP ADKLPLLFER FSQADASTTR RYGGTGLGLV
ICKRLVELMG GTIGVESVPG RGSTFWFELP LPLAESDRQG QAAPQVLAGP VARGHRILVA
EDNEINQEVI RAVLSRRGYE VVLVEDGAQA VEAIKTGPAF DIVLMDVQMP VLDGLGATAA
VRAWERAQGR PPMPIVALTA NAMNEDAERC RAGGMDAHVG KPIKWTELFD VIEQLCSAPN
RDTPPAPHLQ IGM