Gene M446_6055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6055 
Symbol 
ID6129725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6640694 
End bp6642523 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content79% 
IMG OID641646154 
ProductRNA-binding S4 domain-containing protein 
Protein accessionYP_001772766 
Protein GI170744111 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.228541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.392617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA GCAACGACGA CGAGCCCCGC GGGGAGAGGC GCCGCCGCGG GACGGCCGAG 
GCGTCCCCCG GCGCGGCCTC CCCCGCCCGC CCCGCGCCCT CCGAGACGGC CGAGCCCGAG
CGCATCGCCA AGGTCATGGC CCGGGCGGGC GTCGCCTCGC GGCGGGACGC GGAGGCGATG
ATCCTGGAGG GGCGCGTCAG CCTCAACGGC GAGACCCTGA CCACGCCCGC CGTGACGGTC
GGGCCCGGCG ACCGCATCGT CGTCGACGGC GAGCCCCTGC CCGTCCGCGA GCGCACCCGG
CTGTGGATCT TCCACAAGCC CCGCGGCGTG GTGACCACCG CCCGCGACCC CGAGGGGCGC
CAGACCGTGT TCGACATCCT GCCCGAGGAC CTGCCGCGGG TGGTGGCGAT CGGCCGGCTC
GACATCAATA CCGAGGGGCT GCTGCTCCTC ACCAATGATG GCGGTCTCGC CAAGGTGATC
GCCCATCCGG AGACCGGCTG GCTGCGCCGC TACCGGGTGC GCGCCTACGG CGACGTCGAC
CAGGCGGCCC TCGACCGCCT GCGCGGCGGC GTCACCATCG ACGGGATGGA ATACGGCCCC
GTCGAGGCCA GCATCGACCG GCAGCAGGGC GACAACGTCT GGCTGACGCT CGGCCTGCGG
GAGGGCAAGA ACCGCGAGGT GAAGCGGATT CTGGAGCATC TCGGCCTCTC GGTGAACCGG
CTGATCCGGC TCTCCTTCGG CCCGTTCCAG CTCGGCGACC TGGAGGTCGG GCTCGTCGAG
GAGGTGCGCA CCAAGGTGCT CAAGGAGCAG CTCGGCCGCT CCCTCGCCGA GCAGGCCGGC
GTCGACTTCA CCAGCCCCGT GCGCGAGCCG ATCGCCCCGT TCGGGTCGCC CAAACCCCCG
GCCCCGGCGG CCGGCAGGAC GGGCGCCCCG GGTCGCGACC GGCCCCGGGG CGAGCGCCCG
GACCGCAGCG CGCGCCCGGA TCGCGGGGAG CGCCTCGACC GCGGCGAGCG TCCGGCCCGG
CCGGGCCGCG ACCCGGCCCG GCCGCAATTC GCGCGCCCGC CCGCCCCCGG GGCGCGCCCG
GAGCGGGACC GCAAACCCGC GACCGGGCCG GCCCTGCGCC GGGCGGTCTG GCACGACCCG
GAGATCGAGG CCGCGGTCGA GGCGCGCCCG CGGCTGCGGC GGCGGACGAA CGATCCCAAG
GAGGCCCGGG CCGCGGCGGC CGAGCGCCCG CGCGAGCGGG TCGGCGCGAT CGCGACGGGG
GAGCGCCGCG TCGTGGTGGA GCGCCTCAAG GCGGAGCCCG CCCCCGAGCC GCGCCGCCCG
CCCGAGGGCC GGCCCCGGCC GGAGGGCCGG CAGCGGCCGG AGGGCCGGCA GCGGCCGGAG
GGCCGGCAGC GGCCGGAGGG CCGCCTAGAT CGCGAGCGGC CCGCCCGCGG CGACCGCCCG
CCCCGCGCGG GCGAGGAGCG CCCCGCGCGG CGCCCGCGCC CGGAGGGGTC AGGGCGGCCC
GAGGGCGGCG CGCGTCCGCG CCCCGAGGGC GGCGGCCGGC CCGGCTTCGG CAAGGGCGGA
TTTGACAAGG GCGGCTTTGA CAAGGGCGGC TTCGGCAAGG GCGACTTCGG CAAGGGCGGC
CCCGGCGCGG GCCGCGGCGG CCCCGGCTTC GGCAAGGGCG GTCCCGGAAA GGGCGGCTCC
GGGTTTGCAA AGGGCGGCCC CGGCTTTGCA AAAGGCGGCT CGAAGGGCGG CCGGCCGGGT
CGCGGCGGCG CGGCCCCGCG ACCCGGCGGA CCGGGCGGAC GACCGGGCGG CGGGCGGCCC
GGCGAAGGGC GGCCGCCCCG CGGGCGCTGA
 
Protein sequence
MSDSNDDEPR GERRRRGTAE ASPGAASPAR PAPSETAEPE RIAKVMARAG VASRRDAEAM 
ILEGRVSLNG ETLTTPAVTV GPGDRIVVDG EPLPVRERTR LWIFHKPRGV VTTARDPEGR
QTVFDILPED LPRVVAIGRL DINTEGLLLL TNDGGLAKVI AHPETGWLRR YRVRAYGDVD
QAALDRLRGG VTIDGMEYGP VEASIDRQQG DNVWLTLGLR EGKNREVKRI LEHLGLSVNR
LIRLSFGPFQ LGDLEVGLVE EVRTKVLKEQ LGRSLAEQAG VDFTSPVREP IAPFGSPKPP
APAAGRTGAP GRDRPRGERP DRSARPDRGE RLDRGERPAR PGRDPARPQF ARPPAPGARP
ERDRKPATGP ALRRAVWHDP EIEAAVEARP RLRRRTNDPK EARAAAAERP RERVGAIATG
ERRVVVERLK AEPAPEPRRP PEGRPRPEGR QRPEGRQRPE GRQRPEGRLD RERPARGDRP
PRAGEERPAR RPRPEGSGRP EGGARPRPEG GGRPGFGKGG FDKGGFDKGG FGKGDFGKGG
PGAGRGGPGF GKGGPGKGGS GFAKGGPGFA KGGSKGGRPG RGGAAPRPGG PGGRPGGGRP
GEGRPPRGR