Gene M446_1582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1582 
Symbol 
ID6134665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1765353 
End bp1766456 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content66% 
IMG OID641641848 
ProductAraC family transcriptional regulator 
Protein accessionYP_001768517 
Protein GI170739862 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.116449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCCG ATGTGCCGTC GAGTGGCAAT CAAGGCCGCG ACCCAGAGTT TCAGAAGGGT 
CTTATATCAA AAGATTCAAA ATGTTTCAGC GCACCGTCTA TCCCATTCAA TTTCACCCAC
GTCTTCGCAG TCAAGGCCGT CTACGATATA CTGTTAGAGC TTGAGCTAGA TCCTGAAACT
GTCCTTGAAG AAGCCGGGAT CGATGCCCAG ATCTTCGAGA CGGTGGAAAC GGTCTCTTTC
GCGGCGTTGG GCCGCCTGAC GGCACTGGCC GCCGACCAGG CGCGGTGCGC CCATTTCGGC
CTCCTCGTTG GTCAGCGCAC CACCCTCGGC TCGCTCGGCC TGCTCGGGAC GCTGATGCGC
CACTCCGAGA CGATCGGCGA CGCCCTGCAG GCCTTGCAGA CGCACCACGA TCTCCTGAAC
CGCGGCGCCA TGATCGAGCT GGCGGTCGAT GGTGCCGTCG CGACCGTGAG CTACGCCCCC
TACGCGTCCG ACGTCGACGG CGTCGCGCTC CACTGCGAGA GGGCCGTCGC GGCCCTGACC
AACGTCCTGC GCGCGCTGTG CGGCGCGAAA TGGAGCCCGG ACGAGGTGCT GCTGCCGCGG
CTCGAGCCGC CGGACACGAC ACCCTACACG GGCTTCTTCC GGGCCCCGGT CCGGTTCGCG
CAGGAGATCG CCGCGCTGGT GTTCCCCGCA CGGCTCCTGA AGCGGCCGGT GGAGGACGCC
AACCCCGTCA TTCGCGCGAG CGTGGAGCGG CGCATCCAGC AATGCGAGGC CATCCTCCCG
TCCGACGTCA CCGACGAGGT CCGGCGGCGC GTGCGCTCCA GGATCATGCA GAAGCGAATC
GAGAAGGATC AGGTCGCCCA GACGTTGGCG ATCCACCAGC GCACGCTGAC CCGCCGCCTG
AAGGCCGAGG GAACGACGTT CCGGTCCATC GCGAACCAGA CGCGGCTCGG CATCGCCAAG
CAGCTGCTGG CCGACACCAC CATGAGCTTG GCGCAGATCT CATCCGTGCT CGAATTCTCG
GAGCCCGCCG CCTTCACGCA CGCCTTTCGG CGCTGGACAG GCATGACGCC GAGCGCGTGG
CGCAACGAAC GATGGGCCAA GTAG
 
Protein sequence
MPSDVPSSGN QGRDPEFQKG LISKDSKCFS APSIPFNFTH VFAVKAVYDI LLELELDPET 
VLEEAGIDAQ IFETVETVSF AALGRLTALA ADQARCAHFG LLVGQRTTLG SLGLLGTLMR
HSETIGDALQ ALQTHHDLLN RGAMIELAVD GAVATVSYAP YASDVDGVAL HCERAVAALT
NVLRALCGAK WSPDEVLLPR LEPPDTTPYT GFFRAPVRFA QEIAALVFPA RLLKRPVEDA
NPVIRASVER RIQQCEAILP SDVTDEVRRR VRSRIMQKRI EKDQVAQTLA IHQRTLTRRL
KAEGTTFRSI ANQTRLGIAK QLLADTTMSL AQISSVLEFS EPAAFTHAFR RWTGMTPSAW
RNERWAK