Gene Mext_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3081 
Symbol 
ID5835478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3427981 
End bp3429519 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content72% 
IMG OID641368882 
ProductCHAD domain-containing protein 
Protein accessionYP_001640541 
Protein GI163852498 
COG category[S] Function unknown 
COG ID[COG3025] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC CGCGCGAGAT CGAGCTGAAG CTGGAATGCG AGCCCTCCGA CCTCGCCGTC 
CTGCAGGATC ATCCGCTCCT GCGGGAGGCG GCGGTCCAGG GGGAAGCCGA GCTCGCCTCC
GTCTATTTCG ACACGCCGGA TCGGCAGCTG CACGCGGCCG GGCTCGGGCT GCGAGTGCGC
GAGAGCGAGG GGCGCTTCGT CCAGACCCTC AAGGCCGAAG GCGATGGCCT GTTCGACCGC
CCGGAATGGG AGCAGCCGGT CGAGGGCGCC GAGCCGGACC GGGCGGCGCT CGCCGACACG
CCCTTCGCGC GCCGCGTCGC GGACGATGCC GCGCTGGAGC CGCTCTTCAC CAGCCGCGTC
ACCCGGCGGA CCTACCTCGT CGAGCAGGGT GAGTCCCGCA TCGAGGTCGC CCTCGATCTC
GGCCGGATCG AGTCGCCGGC CGCCGGCGAC GACATCCTGT CGATCTGCGA GATCGAGCTT
GAACTGAAGG AGGGGACCGC GAGCGACGTG TTCGCGCTCG CCTACGCCAT CGCCGCCCTC
GTGCCGGTGC GGCTCGGCGT GCGCAGCAAG GCCGAGCGCG GCTACGCCCT GGCCGCCGGC
AAGATCGACC GGGTGCGCAA GTCGGAGCCG GTGCCGCTGC ATGACGACAT GAGCGCGGCG
GAGGCGTTCC GCGCCGTCGC CCATGCCTGC CTGCGCCACA TGCGGATCAA CGAGGACATC
CTGCTCAAGA GCCGCGACGC CGATGCGCTG CACCAGATGC GCGTCGCGAT CCGGCGCCTG
CGCTCGGCCT TCTCGCTGTT CGGCGACCTC GTGGACGATC CGCTCGGCGT TCGCATCCGC
GCGGAGCTGA AGGCGGCGAC CGAGCCGCTG GGCCGGGCGC GAAATCTCGA TGTCTTCCTC
GCCACCATCC TGCCGGCCGA GCGCGAGCGC CATCCCGACG AGGTCGGCCT GCTCGGCCTC
GAGCGGCAGC TCGAAGACGA GCGCGCGAAG GCCTATCGCG ACCTCGCCGC GCTGCTGCGC
TCCGATGCGT GGCGGATGCT GCTGCTCGAC CTGATCGGCT GGATCAATGC CGGCCCCTGG
CTGCGGGACG ACAGCCCCGG CCGCGTCTCC CTGCGCGAGG AGCCGGCCCG CGTCTTCGCC
GCCCGCGAAC TCGACCGGCG GCGGCGGCAG GTGAAGCGGC GCGGGCGCCA TCTCGACGAC
CTCGAGCCCG AGGAGCGCCA CCGGGTGCGC ATCGCCGCCA AGAAGCTGCG TTACGGCGCG
GAATTCTTCG CGCCCCTGTT CCCCGGCAAG AAGGCGGGCA AGCGCCACGG CGCCTTCGGC
AAGGCCCTCT CGGATCTGCA GGACCATCTC GGCGCGCTCA ACGACATCGC CACCGGCCAC
GAATTGATGC GGGACCTGAG GGTCGAGCCG GCCGGCGCCA CGACCCTGTT CGCCGCCGGG
ATGACGGCGG CCGATATCGA GGCGCGCAGC CGCAAGCTCT TGGAGGCGGC GGCCGAGGCG
CACGAGGATC TCGTCGACAC CAAGCCGTTC TGGCGTTGA
 
Protein sequence
MSDPREIELK LECEPSDLAV LQDHPLLREA AVQGEAELAS VYFDTPDRQL HAAGLGLRVR 
ESEGRFVQTL KAEGDGLFDR PEWEQPVEGA EPDRAALADT PFARRVADDA ALEPLFTSRV
TRRTYLVEQG ESRIEVALDL GRIESPAAGD DILSICEIEL ELKEGTASDV FALAYAIAAL
VPVRLGVRSK AERGYALAAG KIDRVRKSEP VPLHDDMSAA EAFRAVAHAC LRHMRINEDI
LLKSRDADAL HQMRVAIRRL RSAFSLFGDL VDDPLGVRIR AELKAATEPL GRARNLDVFL
ATILPAERER HPDEVGLLGL ERQLEDERAK AYRDLAALLR SDAWRMLLLD LIGWINAGPW
LRDDSPGRVS LREEPARVFA ARELDRRRRQ VKRRGRHLDD LEPEERHRVR IAAKKLRYGA
EFFAPLFPGK KAGKRHGAFG KALSDLQDHL GALNDIATGH ELMRDLRVEP AGATTLFAAG
MTAADIEARS RKLLEAAAEA HEDLVDTKPF WR