Gene Mext_0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0045 
Symbol 
ID5835745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp52134 
End bp53255 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content63% 
IMG OID641365829 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001637544 
Protein GI163849501 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCCAAC GCCCGCCCGC GGTCGTTGGA GACATCGGAT CGCACGAGAG GCCATCTCTC 
ATTGGCTTGA CTCGCCCCGG AGTCGCGAAG GCGGTCTACC TCACTCTCGT CGAACTGGGC
GCCAATTTGG ACGAACTGCT CGCCGAGGCG GGGCTTGATC CTCGGACCTT CGACGGCGGC
AGGACCCCTG TCCCATACGC CTCGCTCGGT CGCCTGATCG CTCTGGGAGC CGAGAGGACG
GGTTGTCACC ATCTCGGGCT CCTTGTTGGA CAGCGCGCGA CATTGGCCTC GCTCGGGCTG
CTTGGCCTGC TCATGCGCCA CTCGGACACC ATCGGCGGCG CTTTGCGGGC TCTTGAAGCG
CATGCCGGTG TGCGGAACTG GGGCGCAGTC GTCGGACTCG ATATCGACAG TGAGGTGGCC
GTTCTCAGCT ACTGCCCTTA TGGCTCGGAA GCCGAGAGCA CGGCCCTCCA ATCAGAGAGG
GCACTCGCCA CAATTACAAA CGTCATTCGG GCGTTGGGTG GCTCTGATGC GGCTCTATTA
GAAGTGCTGT TGCCGCGCTC CGCGCCACGC GACACAGCGC CCTACATCAG CTTCTTTCGG
GCGCCCGTGC GGTATGACCA AGAAACGGCC GCGTTGGTGT TTCCAACTCT ACTCCTTGAA
CGGCGCATCA AGGGGGCGGA CCCGGCAGCC CGCGGGAGAG TTGAGGATCG CATCCGCAAG
CTTGAGGCCG AACAGCCTTC CACGCTGAAG GACAAGCTTC GCGAGTACCT CCAAGCCCAG
GTGATGCGGC AGCGCTGTAA GGCCGCGCAT GTGGCGCGAC TGCGACTGGT CCCCCCCCGT
ACCCTGCGTC GTCGGCTGAA AGCCGAGGGC ACGACGTTCA AGCAAATCGC TAACGAAGCG
CAGTTCTCAG TCGCCAAGCA GCTCCTAGCC AATACCAGAA TGAGCATGGC GCAGATCTCG
GCGGCCTTGG ATTTCTCCGA GCCCGCTGCC TTTAGCCATG CGTTCCGACG CTGGTCAGGC
TTCGCGCCCA GTACATGGCG GCGGGAGCAT CAGTCGAAGT GCCTTGGTCG AGAGCAGGAC
GAAAATTCCT ACTCCGCACA GACACAGCAG CCGGTCCGAT AG
 
Protein sequence
MCQRPPAVVG DIGSHERPSL IGLTRPGVAK AVYLTLVELG ANLDELLAEA GLDPRTFDGG 
RTPVPYASLG RLIALGAERT GCHHLGLLVG QRATLASLGL LGLLMRHSDT IGGALRALEA
HAGVRNWGAV VGLDIDSEVA VLSYCPYGSE AESTALQSER ALATITNVIR ALGGSDAALL
EVLLPRSAPR DTAPYISFFR APVRYDQETA ALVFPTLLLE RRIKGADPAA RGRVEDRIRK
LEAEQPSTLK DKLREYLQAQ VMRQRCKAAH VARLRLVPPR TLRRRLKAEG TTFKQIANEA
QFSVAKQLLA NTRMSMAQIS AALDFSEPAA FSHAFRRWSG FAPSTWRREH QSKCLGREQD
ENSYSAQTQQ PVR