Gene M446_1365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1365 
Symbol 
ID6131008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1503343 
End bp1504404 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content64% 
IMG OID641641644 
ProductAraC family transcriptional regulator 
Protein accessionYP_001768315 
Protein GI170739660 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.036458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCGTGT GCATCGAGGC GGCTCCCTGC GCCCCCGCGC ACTCCGTTGC CGGGATAGGT 
GATTCGGGTT ACGTTCAGGT AACCTGGGCG TTCTGGGCAG TGCTTGCGAT GGAGCTTCAG
GGACACACCG CGTTCAAGTA CGTGACGGGC AGACGGTTGG GGACGTCGTC CGGTCGCGGC
TGGACGAGCG TCCTGGCCGA GCGCTGGGAT CATGAGGCTG GTGCCCTGCC CTCGCTGCTT
CCCCGGGAAA CCGAGGTTGC GGTTCTGCTC AGCGGCCGTT CCCTCGTGTA CCGCGAGGGG
GCTGGGTTGC GGCAGAGGAC TCCCGGTCAT TCCGGGACCG TCTGGCTCTG CCCGGCCGGC
ATCCGGGAAG AACGCATCGA CTTCGAGCAG CCGCTCCACG ATTGTCTGCA CATCTTCCTG
CCGCCCGATC CGTTCGCGGA GTGCGTGCTG CAGGATCTCA ATATCGATCC TGCTCGTGCG
GGGCTTCGCT ATGAAGCAAT CGCCTACGAT CCGTTCATCG AGCAGATTGC ATTCGCGATC
AACCGCGAGC TGCAGGCAGA AACCTCCGCC GGACGCCTGC TGGTCGAGTC GCTCGCCCGG
TCACTTTCGG CATATCTCGT TAACCGCTAT TCGGAACTTT CGACGCGGGC GATAGGATTT
TCGTCCGAGG CTAAGCCGAT CGACAGCCGG CGAATGTCGC GCGTTTTAGA GTTCATCGGA
GCCCGCCTTG ATCAGAACTT TACCGTAGCG GAACTGTCAT CAGTCGCCTG CATGAGTCAG
GCCCATTTCG CGCGCGCATT CAAGGCAACG ACCGGGCACG CGCCGCACGC GTTCGTAAGC
CGGATGCGTC TCGAATCAGC GAAGCGGATG CTGGCCGATG GCCTCCGGCC GATCGGCGAG
ATCGCCCTGG CTGCCGGCTT CTCTTCGCAA TCCAACTTCT CGAGGGCGTT CCGCAGCGCC
GTGGGCCTGC CGCCCGGTGA CTATCGCCGC AGCCAAGCCC GATCGCGAGC CGCATCGGGA
GATCTCACGC AGGCAGGGCA GCGATCGTCA GTCGCGGAAT AG
 
Protein sequence
MCVCIEAAPC APAHSVAGIG DSGYVQVTWA FWAVLAMELQ GHTAFKYVTG RRLGTSSGRG 
WTSVLAERWD HEAGALPSLL PRETEVAVLL SGRSLVYREG AGLRQRTPGH SGTVWLCPAG
IREERIDFEQ PLHDCLHIFL PPDPFAECVL QDLNIDPARA GLRYEAIAYD PFIEQIAFAI
NRELQAETSA GRLLVESLAR SLSAYLVNRY SELSTRAIGF SSEAKPIDSR RMSRVLEFIG
ARLDQNFTVA ELSSVACMSQ AHFARAFKAT TGHAPHAFVS RMRLESAKRM LADGLRPIGE
IALAAGFSSQ SNFSRAFRSA VGLPPGDYRR SQARSRAASG DLTQAGQRSS VAE