Gene Mext_3861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3861 
Symbol 
ID5832771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4288332 
End bp4290341 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content77% 
IMG OID641369651 
ProductRNA-binding S4 domain-containing protein 
Protein accessionYP_001641304 
Protein GI163853261 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.227341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA TTCCAGACCC CTCCCCGGCC GTGCCCGCGG ACCAACCCGC CGCGCCCGAG 
GGCGAGCGCA TCGCCAAGGC CATCGCCCGC GCCGGCATCG CTTCCCGCCG CGACGCGGAG
GCCCTGATCG AGGCCGGCCG CGTCACCCTC AACGGCAAGG TCCTGACCTC GCCGGCGATC
AACGTCACCG ATGCCGATCG CATCACGGTG GACGACGAGC CCCTGCCCGC CCGCGAGCGG
ACCCGGCTGT GGCTGCTGCA CAAGCCGCGC GGCGTCGTCA CCACTGCCCG CGACCCGGAG
GGCCGCCCGA CGGTGTTCGA CGGCCTGCCC GAAGAGCTGC CGCGAGTCGT CGCCATCGGC
CGGCTCGACA TCAACACCGA AGGGCTGCTG CTCCTCACCA ATGACGGGGG GCTGGCCAAG
GTGATCGCCC ATCCCGATAC CGGCTGGCTG CGCCGCTACC GGGTGCGCGC CTTCGGCGAC
GTGACCCAGC CCGATCTCGA CAAGCTGAAG AAGGGCCTGA CCGTCGATGG CATGGAGTAC
GGCCCGATCG AGGCGACCCT CGACCGGGCG CAGGGCGACA ACGTCTGGCT GACGCTCGGC
CTGCGCGAGG GCAAGAACCG CGAGGTCAAG CGCATCCTCG AGCATCTCGG CCTGTCGGTG
AACCGGCTGA TCCGGCTCTC GTTCGGGCCG TTCCAGCTCG GCGACCTCGA AGTCGGCCTC
GTCGAGGAGA TCCGCACCAA GGTTCTGAAG GACCAGCTCG GGAAGAACCT CGCCGAGCAG
GCCGGCGTCG ATTTCGAGAG CCCGGTGCGC GAGACGATCG CCCCCTTCGG CAGCCCGAAG
AAGGCCACCC GCGCCACGCA GGCGGCGCCC GCAGGACGGG GCCGTCAAGG CGCGCGCGAC
GAGGGCGCCT CGGAGGCCCG CTCGGGGTCT CGCGGCGGCT CGGCCGGCCG TCCGCCGCGC
GATCCCGCCC GCCCTGCCGC CCCCCGCGCT GGTGCGCGCC CGGCCCCGCG TCCCCCTTCC
CCGACCGTGT GGCGCGCCGA CGAGGAGACG CGGCCTCGCG CCTCGAAGGT GCCGCGCCGG
GGCATGGACC CCAAGACCGC CCGCGCGGCG GCGGCCGAGC GCGGCCGCGA GCGGGTCGGC
GCGATCCAGG CGGCCGGCGA GCGCCGGGTG CTGGTGGAGC GGCTTCAGCC CTCGCCCGAG
AAGCCCGCAC CCGAGCCGCA TCGCCGGGTC CGCTTCCGCA ACGAGGAGGA GCGGTCCGCG
CCCCGCGACA GGGCGCGCGA AGACAGGCCG CGCCAGGATC GGCCAGGTCA AGATCGGCCG
CGTGAAGACC GGCCGCGCAG CGAGGGACCG CGCGAGAACC GCGCCCGCGA CGAGCGGCCG
GGCCGGGACG AGTTCCGACG CGGCGCACCC GCGGGCGATG CCCGTCCCCG CCGTCGTGAG
GCCGGGGAGG CCGCACCGGC CGAGCGGCGC GGGCCGCCCC GCGAGCGTCC GTCCGATTTC
GAGGGGCGTC CGCCGCGTGA GCGGAGCGAG GGCCGGCCTC AGCGTGACTT CCATGGTGAC
TCCCGTGAGC GCGGCGAGGG CGCACCCCGC CGCGAGCGGC CGAGCGAAGG GCGCCCGCCC
GAGCGTCGCG GCCCCCCGCG CGACCGGCCC CGTGCCGCCG AAGGCGAGCG GCCGGCCCGC
CCCCCGCGCG GCGCCGAAGC GTCCGATCGT CCCGCCTTCC GCAAGGGCCC CGGCAAGGGG
CCCGGTGGTG GTGCAGGCAA GAGCTTCGGC GGCAAGCCCG GCTTCAAGGG CGGTGACCGC
GACGGCGCCA GGGGCGATTT CAAGGGTGGC GCGAAAGGCG GCCCGCGGGG CGACTTCAAG
CCCGGTGGCG GCGGCCGGCC CGGTGGGCGC CCGGGCGGAC GTCCCGGCGG CAAGCCCGGC
GGTGCCCCCG GTGGACGCCC CGGTGGTGGC GGCAAGCCGG CACCGCGCGG GGGCGGCCGT
CCGAGCCGTC CGCCGCGTGG CGAGGGCTAA
 
Protein sequence
MTDIPDPSPA VPADQPAAPE GERIAKAIAR AGIASRRDAE ALIEAGRVTL NGKVLTSPAI 
NVTDADRITV DDEPLPARER TRLWLLHKPR GVVTTARDPE GRPTVFDGLP EELPRVVAIG
RLDINTEGLL LLTNDGGLAK VIAHPDTGWL RRYRVRAFGD VTQPDLDKLK KGLTVDGMEY
GPIEATLDRA QGDNVWLTLG LREGKNREVK RILEHLGLSV NRLIRLSFGP FQLGDLEVGL
VEEIRTKVLK DQLGKNLAEQ AGVDFESPVR ETIAPFGSPK KATRATQAAP AGRGRQGARD
EGASEARSGS RGGSAGRPPR DPARPAAPRA GARPAPRPPS PTVWRADEET RPRASKVPRR
GMDPKTARAA AAERGRERVG AIQAAGERRV LVERLQPSPE KPAPEPHRRV RFRNEEERSA
PRDRAREDRP RQDRPGQDRP REDRPRSEGP RENRARDERP GRDEFRRGAP AGDARPRRRE
AGEAAPAERR GPPRERPSDF EGRPPRERSE GRPQRDFHGD SRERGEGAPR RERPSEGRPP
ERRGPPRDRP RAAEGERPAR PPRGAEASDR PAFRKGPGKG PGGGAGKSFG GKPGFKGGDR
DGARGDFKGG AKGGPRGDFK PGGGGRPGGR PGGRPGGKPG GAPGGRPGGG GKPAPRGGGR
PSRPPRGEG