Gene M446_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0039 
Symbol 
ID6135424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp42036 
End bp43145 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content76% 
IMG OID641640382 
ProductLuxR family transcriptional regulator 
Protein accessionYP_001767061 
Protein GI170738406 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0051974 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGATG TTCGCGACGA TCGTCTGGCT GACGCGATCG ATCTGTTCTA CGAGGCGGCG 
CTGAACCCGG AGCGCTGGCC GGCGGTGCTG GAGGCCTACG GGCGCGCCGT CGGCGCGGAC
GGCGCCGTGA TGCTGCCCGG GCCGGCGGCG CCGCTCGCGG CCTCGGTGTC GGAGGACATG
GCCGAGGTAG TGGAGGCGGG GGTCCGGGAC GGCTGGCTCG CCGAGAACCC GCGGATCGCG
CGCGGCATCC AGGCCCTGAA GGATCCGCGC GGCGTGATCA CCGAGTCCCA GATCTTCACG
CCGCGGGAAC TCGACCACAT CCCGTTCAAC GCGGATTACG TGGGACGCCA CGGCTACCGC
TGGTTCGCGG GCCTCTACAT GGTGGCGGAG GGGGAGCGCA GCGTCATCCT GTCGGCGGAG
CGCCGCCGCG AGCGCGAGAT GTTCTCGCAC CGGGAGATCG CGCAGATCCG GCGGGCGGTG
CCGCACCTGC AGCGGGCCGG GCAGATCGCC CTGCGGATCG CCGAGGCGCG GGCCGGCGCC
GCGCTCGACG CCTTCGAGAC CCTCCGCTGC GGCGGCCTGC TGCTCGACGC GAGCGGCACG
GTGCTGCGGA TGAACGCCCA CGCCGAGCGG CAGCTCGGCC GCGGCATCGC CGTCGTGAAG
GGGGCGCTGC TCGCCCAGGA TCGGGCGGCG AACGCCGCCC TCGGCCGCCT GATCGCGAGC
GCGATCCGCG CCGGCCGGCC GCACGAGGGG GCGGCCGAGG GGCCGGTGGC GGTGCCGCGG
CCCGAGGGCC CGCCCCTGAT CCTGCACGCG GCCCCGCTCG CCGGGGCGGC GCAGGACCTG
TTCCAGCGCG CCCGCGCGGT GATCCTGGTG GTCGATTCCG CGGCGGGCGG GCGCCCGGGC
GAGGCCCTGC TGCGGCAGGC CTTCGGGCTC ACGGCCGCGG AGGCGCGCCT CGCCCGCGAC
CTCGCCGAGG GGGGCGAACT CGCGGCGGTC GCGGCGGCGC ACGGCATCAC GATCGCCACC
GCGCGCTCGC AGCTCAAGGC CGTCTTCGCC AAGACACGCA CCCACCGCCA GCCGGAACTC
GTCGCGCTGC TGGGGCGGAT GCGGCTGTGA
 
Protein sequence
MTDVRDDRLA DAIDLFYEAA LNPERWPAVL EAYGRAVGAD GAVMLPGPAA PLAASVSEDM 
AEVVEAGVRD GWLAENPRIA RGIQALKDPR GVITESQIFT PRELDHIPFN ADYVGRHGYR
WFAGLYMVAE GERSVILSAE RRREREMFSH REIAQIRRAV PHLQRAGQIA LRIAEARAGA
ALDAFETLRC GGLLLDASGT VLRMNAHAER QLGRGIAVVK GALLAQDRAA NAALGRLIAS
AIRAGRPHEG AAEGPVAVPR PEGPPLILHA APLAGAAQDL FQRARAVILV VDSAAGGRPG
EALLRQAFGL TAAEARLARD LAEGGELAAV AAAHGITIAT ARSQLKAVFA KTRTHRQPEL
VALLGRMRL