Gene M446_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2138 
Symbol 
ID6130857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2386532 
End bp2387692 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content76% 
IMG OID641642366 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001769034 
Protein GI170740379 
COG category 
COG ID 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0223192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.334318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGG ATGCGATAGG ATCCGACGCC ATCCGGGCCG CGCTCGCCCT CCTGCTGCAG 
CAGCCTCAGC TGCGCAAGTC GCCCCAGCTC TCGACCTTCC TGTCCTACGT GGTGGGCGAG
AGCCTCGCGG GACGCGGCAG CCTGCTGAAA TCCTACACCA TCGCCACGGA CGCCCTCGGC
CGGCCGGCCA ATTTCGATCC GGCGACCGAC GCCATCGTGC GGGTGGAGGC GCGGCGGCTG
CGCCAGGTGC TGCAGCAGAT CTACGAGGAT CCCGCCTGCC CCCTCAGCGT GCGGATCGAG
CTGCCGCTCG GGCGCTACGA GCCGACCTTC ACGCGGATCA CGCCGGCGAC CTCCCGCAAT
CCCGTCCCCG ACCCCGAGGC GAGCCTGCGC GAGAGCGAGC AGCGCTACCG CGCCCTCGTG
GAGGCGAGCG CCGCCATCGA GTGGCGGGCG AGCCCCGACG GACGCTTCAT CCGCAGCTTC
GGCTGGACCG CGCGGACCGG CGAGCCCGAG GACCGGCTGC GCGACGAGGG CTGGCTCGAC
GCGCTCCACC CCGAAGACCG GGGCCGGGCC ACCGAGGCCT GGGCGCAGGC CCGGCGCACC
GGCGAGCCCC TCGAGATCGC CTACCGGGTC CGGCACCGGG GCGGGCATTA CCGCTGGATG
CTGGCGCGCG GCATCCCGAT CGAGAATCTC GACGGCAGCA TCCGCGAATG GGTGGGCACG
CTGTCGGACA TCCACGAGCA GGAGACGGCC GAGGAGGCGC AGCGCGCCCG CAGCGAGGGC
CTGCGGCTCG CCCTCACGGC CGCCGGCCTC GCCGCCTGGG AGCTCGACCC CGAGACCCGG
TCGGTGTGCT GGTCGCAGCC GCCGCCGGAC CGGATCGAAC CGCCCGGCGA GGCCGCGCCC
CGGGGCGGGC CGGCCGAGGA GGAGCCGCTC GACGCGTGGG TGGCGCGGCT CGACCCCGCG
GACGGGCCGC GCCTGGTCGC GGCCCTGGAA CGCGCCCTGC GGGGCGGGGG CGACGTCGAT
CTCGTCTACC GCAGCCGCGC GCCGGCCGAG CGGCCCCGCC GCCTCGCCTG CCGCGGCGGC
CTCGTGCGCA ACGCCCGCGG CGAGGCGCGC CTCGCCGGCG TCGTGGCGGA TGTCACCGGG
CGCGCGTCGC CGCTGCCTTG A
 
Protein sequence
MDQDAIGSDA IRAALALLLQ QPQLRKSPQL STFLSYVVGE SLAGRGSLLK SYTIATDALG 
RPANFDPATD AIVRVEARRL RQVLQQIYED PACPLSVRIE LPLGRYEPTF TRITPATSRN
PVPDPEASLR ESEQRYRALV EASAAIEWRA SPDGRFIRSF GWTARTGEPE DRLRDEGWLD
ALHPEDRGRA TEAWAQARRT GEPLEIAYRV RHRGGHYRWM LARGIPIENL DGSIREWVGT
LSDIHEQETA EEAQRARSEG LRLALTAAGL AAWELDPETR SVCWSQPPPD RIEPPGEAAP
RGGPAEEEPL DAWVARLDPA DGPRLVAALE RALRGGGDVD LVYRSRAPAE RPRRLACRGG
LVRNARGEAR LAGVVADVTG RASPLP