Gene M446_5067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5067 
Symbol 
ID6135292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5554152 
End bp5555156 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content79% 
IMG OID641645202 
ProductAraC family transcriptional regulator 
Protein accessionYP_001771827 
Protein GI170743172 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0666766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGACG GACTCACGCG AAACCCGGAT CCGGGGGCGC CGCCGGGCGG CCTCGCGCCG 
GTGTGGATCC TCGCCGAGGG CGCGCCGCCG GGCGGCGGCT TCGCGCCGCC CCGCGCCGCG
GGCCGGACCG CCCCCGTCCG CCGCGCGCCG CCCCCGGAGC TCCCGTTCGG GGCCGCGGCT
TTGCTGCGCC TGGAGCCGGA TATCGGGCTG ATGTCCTGCT TCCGCCCGAC GCCCGAGACC
GACCTCCCGG TGGCGAGCCC CGTCCTGCCG GGCGGCGCCG TGCTGCTGCG CCCGCACGGC
GGCGCCGTCC GGGCGCGGGT GGGGGGGACC CACGTCCTCG TCGAGGACGG CGAGGCGATC
CTGCTTGCGG GGCCGGCGAG CCTGCGCGTC GCGGATGCGG GGCGCCTCGA CGCGCTCGCG
CTGCCCGCGC GCGCCGTCAC GCCCGCCATG GCGGAGGTGG CCGCCTCCCT CCGGGTCTTC
CCTCGGGACA GCGCGGCCTT GGCCCTGCTG CACCATTACG GCGCGGCCCT GATGCGGGGG
CTGCTGCCGG TGGCGACGGG CGCGCTGCGC GAGCACGCCC TCGGGCACAT GGCGGGCCTC
GTCGTGATCC TGTGCGCCGA CCCGGCGCCG GGCCCCGTCC CCGCGCCCCT CGACCGCGCG
GCGGCCCGGA TCGGGGCGAT CAAGGCCGAG ATCGAGCTGC GCCTCGACGA CCGCACGATC
ACGGCGCGGC GCGTCGCGCA GCAGCACGGG ATCAGCCTGC GCTCGCTCCA GAAGCTGTTC
GAGGCGGAGG GCCGGACCTT CTCGGACTTC GTGCTGGAGC GCCGGCTCGA CCGGGCGTTG
CGCCTCCTGC GCTCGCCCGC GCGGCGGCGC CAGCCGATCA GCGCGATCGC CTTCGAGGTC
GGGTTCGGCG ACCTCTCCTA CTTCAACCGC ACCTTCCGGC GGCGCTACGG GATCGCGCCG
CGCCGGGCCC GCGCCGCGCC GGGCGATCCG CCCGAGGGCC ACTGA
 
Protein sequence
MEDGLTRNPD PGAPPGGLAP VWILAEGAPP GGGFAPPRAA GRTAPVRRAP PPELPFGAAA 
LLRLEPDIGL MSCFRPTPET DLPVASPVLP GGAVLLRPHG GAVRARVGGT HVLVEDGEAI
LLAGPASLRV ADAGRLDALA LPARAVTPAM AEVAASLRVF PRDSAALALL HHYGAALMRG
LLPVATGALR EHALGHMAGL VVILCADPAP GPVPAPLDRA AARIGAIKAE IELRLDDRTI
TARRVAQQHG ISLRSLQKLF EAEGRTFSDF VLERRLDRAL RLLRSPARRR QPISAIAFEV
GFGDLSYFNR TFRRRYGIAP RRARAAPGDP PEGH