Gene M446_1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1578 
Symbol 
ID6134661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1759629 
End bp1760759 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID641641844 
ProductAraC family transcriptional regulator 
Protein accessionYP_001768513 
Protein GI170739858 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.321807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCAG CCATCAGTTG GAGTTGCGAG CGATCGCGGC CGGAGGCGAG GCGTGCCGAG 
ACCAGCATCC CTCCGCGCCC GGCCGCCCCT GCCGCCGTTC ACGGCAGCGC CGTCGAGGCG
ATCTGCGAGG TTCTGGTGGC GCTCGGGATC GAGCCGGCTC CGCTGCTGGT GCAGAGCGGC
ATCGGCCCGC GATGCCTGGA GGGAGCCGGG GCGCTCTCGT TCGAGAGTCT CGGACGCTTG
ATGGCGCTCG CCGCCCGCCG GAGCGCCTGT CCGCATTTCG GCCTCCTGGT CGGCCAGCGC
ACCACGCTGG CCTCGCTCGG GCTGCTCGGC GTGCTGATGA GGAACTCGGA GACGGTCGGT
GACGCCCTGC GGGTGCTGGA GACGCATCAC GGTCTTCTCA ACCGAGGGGC CGTGATCCGC
GTCGCGGTGA ACGGCCCGCT CGCCATCGCC AGCTACTCGC CCTACCGGCC CGAGGCCGAG
GGGATCGCGC TCCATTGCGA GCGGGCGCTC ACGGCGATGA CGAACGTGAT CAGATCCCTC
TGCGGGGGCG ATTGGGCGCC CGAGGAGGTG CTGCTGCCGC GCCTGGGGCC GGACGATGCG
ACGCCCTACG CGAATGTCTT TCGCGCTCCC GTCCGCTTCG GACAGGAGAT CGCGGCGCTG
ACCTTCCCGG CGCGCCTCCT CGGGCGGCCG ATCGGGGACG CGAGCCCGAT CGTGCGCAAG
CTTGCCGAGC AGCGCATCCG CCAGTTCGCG GCCAGCATGC CCGCGGACCT GACGGACGAG
CTGCGCCGGC ACCTGCGTGC CACCTTGACG CAAGGAGAGC TGAGCGCGCG CCAGGCCGCG
GAGGCGCTGG CGGTTCACCG GCGGACGCTG AGCCGGCGTC TGAGGGCCGA GGGAACGAGC
TTCCGATCGG TCGCGAACGA GACGCGCCTC TCCGTCGCCA AGCAGCTGCT GGCCGACACC
AACCTGAGCT TGGCGGAGAT CTCCGTCGCC CTGGAATTCT CGGAGCCCGC CGCCTTCACC
CATGCCTTCC GGCGCTGGAC CGGGACGACG CCGAGCGCGT GGCGCAAGCA GCGGCGAGAT
CCGAGCGGCG GCGAGATGAG CGACGGCCGT CGCGCTGCCG CGGGCGCGTG A
 
Protein sequence
MMSAISWSCE RSRPEARRAE TSIPPRPAAP AAVHGSAVEA ICEVLVALGI EPAPLLVQSG 
IGPRCLEGAG ALSFESLGRL MALAARRSAC PHFGLLVGQR TTLASLGLLG VLMRNSETVG
DALRVLETHH GLLNRGAVIR VAVNGPLAIA SYSPYRPEAE GIALHCERAL TAMTNVIRSL
CGGDWAPEEV LLPRLGPDDA TPYANVFRAP VRFGQEIAAL TFPARLLGRP IGDASPIVRK
LAEQRIRQFA ASMPADLTDE LRRHLRATLT QGELSARQAA EALAVHRRTL SRRLRAEGTS
FRSVANETRL SVAKQLLADT NLSLAEISVA LEFSEPAAFT HAFRRWTGTT PSAWRKQRRD
PSGGEMSDGR RAAAGA