Gene Mext_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3839 
Symbol 
ID5833469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4265817 
End bp4267520 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content70% 
IMG OID641369629 
Producthypothetical protein 
Protein accessionYP_001641282 
Protein GI163853239 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases
[COG3453] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01244] conserved hypothetical protein TIGR01244 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.359472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTTT ATATAGCGAC GGAGCCGTCG GTGGACGTCC ATCACATCAC GCGCGACTTG 
GCCGTCGCGC CACAGATCCG GCCTGACGAC ATCCCGGCGG TCGCGTCCGC CGGGTTCCGG
TCGATCCTCT GCAACCGCCC CGACGGCGAG GCCCCCAACC AGCCGAATTT TCGTGAGATC
GAGCGGCGGG CCGGGGAGGG CGGCCTCGTC GTCCGCTACC TGCCGGTCAC GTCGAGCCGT
ATCACCGACG CGGATGTCGC AGCCTTCGAG GCGGCGGCGG ACGCCCTACC GAAGCCGATC
CTTGCCTATT GCCGCACCGG CACGCGCTCG GCGACGCTGT GGTCGCTCGC CCAGGCACGG
CGCGGCCGCG CCGTGGCGGA GATCCTGGCT GCGACGAAGG CCGCAGGCTA CGACCTGAAA
GGCGCCGCGC CCCGGATGGC GGCGCAGGCC GGCGCGGCGA AAGAGAGAAC CGAGCAACGG
TTCGCGATTG TCATCGTCGG CGGCGGCTCG GCCGGCCTCG CGGCGGCCTC AAGCCTGAAG
GCGCGCAAGC CCGACCTGGA GGTCGCCGTG ATCGATCCGG CCGACATCCA CTACTACCAG
CCCGGCTGGA CACTGGTGGG CGCCGGCGTG TTCGACCCGG CGGTGACCGC CCGGACCATG
GCGTCCCTGA TCCCGGACGG CGTGACGTGG ATCAAGGCCG GCGTTGTCGC CTTCGAGCCG
CAGAGGAAGG CCGTGATGCT GGAGGACGGC CGGACCATCG GCTACGACCG CCTCGTCGTC
GCCCCCGGCC TCAAGCTCGA CTGGGACGGC ATCGAGGGGC TGGTCGAAAC GCTCGGCCGG
AACGGGGTCA CCTCGAACTA CCGCTTCGAC CTCGCGCCCT ATACCTGGGA GCTGGTCCGG
AACCTCGGCG GAGGACGGGC CGTGTTCACC CAGCCCCCCA TGCCGATCAA GTGCGCGGGC
GCCCCGCAGA AGGCGATGTA TCTCTCCGCC GACCATTGGC GGCGCGCGGG CCGCCTGAAG
CAGATCGGGA TCGACCTCTT CACGGCGGCC CCGAGCCTGT TCGGCGTGAA GGAATACGTG
CCGCCCCTGA TGGAGTACGT CCGGCGCTAC GACGCGAAGC TGCACTTCCG TCACGACCTC
ACGCGCATCG ACGGCTCGGC CAAGCGCGCG TGGTTCACCC GCACGGCCGA GGACGGAACC
CAATCGACGG TCGAGACCGG GTTCGACATG ATCCATGTCG TTCCGCCCCA GCAGGCCCCC
GATTTCATCA GGGAATCCCC CCTGGCGGAT CCGAGCGGCT GGGTCGAGGT GGACCCGGCG
AGTCTGCGCC ACAAGCGCTT TACCGACGTG TACGGGCTGG GCGACGCTTG CAGCGCGCCG
AACGCCAAGA CCGCCGCCGC GGCGCGCAAG CAGGCGCCGG TGGTGGCGCA CAACCTGCTG
CGCGACATGG GCTTCATCGA GGGGCCGGAT GCCATTTACG ATGGCTACGG CTCGTGCCCG
CTCACCGTCG AGCGCGGCAA GATCCTGCTT GCCGAGTTCG GCTATGGCGG CAAGCTTCTT
CCCAGCTTCC CGTCCTGGCT GCTCGACGGC ACGAAGCCGA GCCGGGCCGC GTGGCTGCTC
AAGGAGCGCC TGCTCCCGCC CCTCTACTGG CACGGCATGC TCAAGGGGCG CGAGTGGATG
GCCAAGCCCA GGCGGGCGGT TTGA
 
Protein sequence
MQFYIATEPS VDVHHITRDL AVAPQIRPDD IPAVASAGFR SILCNRPDGE APNQPNFREI 
ERRAGEGGLV VRYLPVTSSR ITDADVAAFE AAADALPKPI LAYCRTGTRS ATLWSLAQAR
RGRAVAEILA ATKAAGYDLK GAAPRMAAQA GAAKERTEQR FAIVIVGGGS AGLAAASSLK
ARKPDLEVAV IDPADIHYYQ PGWTLVGAGV FDPAVTARTM ASLIPDGVTW IKAGVVAFEP
QRKAVMLEDG RTIGYDRLVV APGLKLDWDG IEGLVETLGR NGVTSNYRFD LAPYTWELVR
NLGGGRAVFT QPPMPIKCAG APQKAMYLSA DHWRRAGRLK QIGIDLFTAA PSLFGVKEYV
PPLMEYVRRY DAKLHFRHDL TRIDGSAKRA WFTRTAEDGT QSTVETGFDM IHVVPPQQAP
DFIRESPLAD PSGWVEVDPA SLRHKRFTDV YGLGDACSAP NAKTAAAARK QAPVVAHNLL
RDMGFIEGPD AIYDGYGSCP LTVERGKILL AEFGYGGKLL PSFPSWLLDG TKPSRAAWLL
KERLLPPLYW HGMLKGREWM AKPRRAV