Gene M446_5539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5539 
Symbol 
ID6130122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6076192 
End bp6077286 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content56% 
IMG OID641645671 
ProductAraC family transcriptional regulator 
Protein accessionYP_001772286 
Protein GI170743631 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGAGG GGACGGTGTC TAAGAGGCTG CCGCTGCCAA GCGAGAGGAT CTACGCTCCG 
TACAAAATAG CCGCGCTGGT TGAAATCTTG GGCGAGCAGG GCATCCCGCC CGAGGAGGCT
CTGCGCGATA CGGGGGTTGA GGCAAGCAAA ATCTATGATG CCTCGGCTCT TACGTCGGTG
CGGCAGTACG TGGCGGTTTG CAGAAATGCA CTCGCGCTAT CGTGCGAACC AAGAACACCT
TTTCAAGTCG GCGCGCGCCT GCATCTTTCT GCATACGGTA TGTATGGGTA CGCTTTGATG
TCGTGTCTTT CACTCCGGGA CTACTTTAGA CTCGGAGTAA AATACCATTT ATTGGCTACC
CCGACCCTTG CAATAGAGTG GAGAGAGTAC CCGAATGTGG CGGTGTGGAC GTTTCCTGAC
GAATTTACAT TTGCTCCATC CAAGGAACTT CGGCAGTTCC TGATAGAACA GCAGTTTACA
CAACAGGTCA CTCACCTGCA GGACGTTGCC GGCCGGAGTT GCCCACCGGC CAAGGCGCAT
TTTTCGTACT CGGCGCCGGA ACATGCCGCT ATTTATGCCG ATTATCTTGG GTGTCCGTGT
TTTTTTGAGC AAGAACATTG CGAGTTGCAC TACGATAGCG CCATTCTCGA ACAAAAGCCC
CAGCTTGCGC ATCGGCTGAC GTCCGCTCTG CTTCAGGACG CGTGCGATAC TCTGATCGGA
AAGGCTAATG CGTCGGCCGG TGTCGCCGGT GAGGTCTACC AGATCTTGAT GAGATCGCCC
GGCGTGTTCC CTGATATGGA AGATGTAGCA CAGACCCTGC GTATGACATC TCGGACACTA
CGGCGCCGCC TCGACGCCGA ACAGGTATCA TTTTCAGCAA TTATCGATGA CGTCCATCGT
TCGCTGGCAA CGGAATATCT GCGAATGACA AGTATGAGCC TTGAGGACAT CGCGCTGCTT
GTCGGTTTCA GCGATGCCGC GAACTTCCGG CGAGCCTTCA AACGGTGGAC CGGGAAAAAT
CCAGGGGAGT TCCGTGGCGA GATGCCGCTA AGGGCGACGC ACCGGCATCA TGTCCCCCGT
CACTCCGGTT CTTAA
 
Protein sequence
MLEGTVSKRL PLPSERIYAP YKIAALVEIL GEQGIPPEEA LRDTGVEASK IYDASALTSV 
RQYVAVCRNA LALSCEPRTP FQVGARLHLS AYGMYGYALM SCLSLRDYFR LGVKYHLLAT
PTLAIEWREY PNVAVWTFPD EFTFAPSKEL RQFLIEQQFT QQVTHLQDVA GRSCPPAKAH
FSYSAPEHAA IYADYLGCPC FFEQEHCELH YDSAILEQKP QLAHRLTSAL LQDACDTLIG
KANASAGVAG EVYQILMRSP GVFPDMEDVA QTLRMTSRTL RRRLDAEQVS FSAIIDDVHR
SLATEYLRMT SMSLEDIALL VGFSDAANFR RAFKRWTGKN PGEFRGEMPL RATHRHHVPR
HSGS