Gene M446_0512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0512 
Symbol 
ID6129211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp610156 
End bp611187 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content72% 
IMG OID641640834 
ProductAraC family transcriptional regulator 
Protein accessionYP_001767509 
Protein GI170738854 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.594629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00245431 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGATC GGACCATCCA GAGCGGCGGG AGCGACGGCG CGCGGGTGCT CAGCCACAAG 
TTCGAGCCGC CCGCCGATCC GCGCGACCTG GCGCGCGCGT GGTGCGAGCA CATCGCACCC
GCCTTCGAGG TCGGCCTGCG GCCCGAGGCC GACCTGTCGG CGCCGATCGC GATGCAGACC
TACCACCTCG GGGACCTGAT CGTCGGGGAC GTGATCGCGC CCGCCCACGT CCTGGAGCGC
GGATCGCGGA TGATCGAGCG GCAGGGGATC GACCACATCC TGATCCAATT CTACCGGCGC
GGGCAGAGCA CCGTCGAGCG GCGCGACGGG AGCGAGCGGG TCACGGAGGG GCAATGCGTG
GTCTTCGATC TCGCCCAGCC CGTCCGCATC GTCGCCGAGC CGGTCGATGC GACGAACCTC
GTCGTGCCCC GCGCCCGCCT GGAGGACCAG GGATGCCAAG TGGGCGGCCT CCACGGCCGC
GCCTTCGACT ACGACGGCGA CCCGTTCGGA CGGCTGTTCC ACGAGTTCCT CGCCAACCTC
GTCGCCTGCG GCGACCTGCT CCATCCGCGC GAGGCCGCCG CCGGCGCGCG CGCCCTGGTG
CAGCTCTGCG ACACCTTCCT GCGCGGGCGC GCGGGGAACG GCCCCCCGCA GAACCTCGAC
GCGCGCATCC GGGTCCGGCG CTTCATCGAG CGTCAGCTTC ACGATTTCGA CCTGGGCCCG
GCCATGATCG CGGCGCAGCT GGGCCTGTCG CGCTCCACCC TGTACCGCCT CTTCGGTGAG
ACGGGCGGCG TGCTGGCCTA TATCCGGGAC CGCCGCCTGA TGCGCGCGAT GCGCCTCCTG
GTCCGGTCCG ACGCGGCGCA GCCGATGCGG ATCTCGCAAC TCGCCTACGC GGTCGGCTTC
GCCGACGAGA AGACGTTCCG GCGCGCCTTC CGGCGCCGGT TCGGGTTCCT GCCGAGCGAG
GCGATGGCCT ACCAGCTCGG CCCCGACGAT GCCGGGATGC CGGTCCTGCG CCGCTGGTTC
GACAACCTGT AG
 
Protein sequence
MADRTIQSGG SDGARVLSHK FEPPADPRDL ARAWCEHIAP AFEVGLRPEA DLSAPIAMQT 
YHLGDLIVGD VIAPAHVLER GSRMIERQGI DHILIQFYRR GQSTVERRDG SERVTEGQCV
VFDLAQPVRI VAEPVDATNL VVPRARLEDQ GCQVGGLHGR AFDYDGDPFG RLFHEFLANL
VACGDLLHPR EAAAGARALV QLCDTFLRGR AGNGPPQNLD ARIRVRRFIE RQLHDFDLGP
AMIAAQLGLS RSTLYRLFGE TGGVLAYIRD RRLMRAMRLL VRSDAAQPMR ISQLAYAVGF
ADEKTFRRAF RRRFGFLPSE AMAYQLGPDD AGMPVLRRWF DNL