Gene M446_6621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6621 
Symbol 
ID6130884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp7287543 
End bp7289024 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content75% 
IMG OID641646710 
Producturacil-DNA glycosylase superfamily protein 
Protein accessionYP_001773309 
Protein GI170744654 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.647751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.384505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAGCA TCCCGCTGAG GCCCGGCGCC GACCTCGAAG GGTTCCGGGC CGCGGTGCGC 
CGCCTCGCGG CGGAGGAGTG CCCTCCCGAG GCGGTGACCT TCACGCAGGG CGGGGCCCCC
GGCCTGTTCG GGCCCGAACC CGGCCTGTTC GGGGCCGAGC CCGGCCTGTT CGGGGCCGAG
GCGGCGCGCG CGTCCGAGGA GGCCGGGCCG GCGCCCCCGC TCGCCCTGCC GCGGGCCGTG
GCGGCGCTGG TGCCCGCGAT CGTGCCCCAC CGGGACGAGG CGCGCTACGC GCTGCTCTAC
CAGCTGATCT GGCGGGTGCG GCGGGGCGAG CGCCACCTCC CCGAGGTGGC GAGCGATCCC
CTGGTCCACC GGCTGACGCG CATGCGCCGA GCGGTCGCGC GCGACCTGCA CAAGATGCAC
GCCTTCCTGC GCTTCCGGCG CGTGCCCGGC GAGGGAGAGC GGGAGCGCTT CGCCGCGTGG
TTCGAGCCCG ACCACCACAT CCTGGAGGCG GCCGCGCCGT TCTTCGTGGC GCGTTTCCCG
GGCTTCGACT GGTTGATCCT GACGCCCGAG GGCTCGGCCC ACTGGGACGG CGCGCGCCTG
CGCTTCGGCC CACCCGAGCG GCGCGAGGCG CTGCCCGCCG GCGACGCCTT CGAGGCCGGC
TGGAGCGCCT ATTACGCCAG CACCTTCAAC CCGGCCCGGA CCAACCTCGC GGCGATGCGG
GCGGAGATGC CCAAGAAGTA CTGGCGCAAC CTGCCCGAGG CGGCGGCGAT TCCCGACCTC
GTGCGCAACG CGCGCCGCCG CGTCGCCGCG ATGATCGAGA GGGAGCCCGC CATGCCGCGG
AAACGCACGC CCACCCGCGC CCTCGACGCC ATGGCGGCGC AGGGACCGGA GGATCTCGAG
GCCCTGAACG CCCTGATCCG GCGCTCCGAA CCGCTGGTGC CGGGGGCGAC GCAGGCGGTG
CTGGGCGAGG GACCGGTCGG CGCCGCGATC GCCTTCGTGG GCGAGCAGCC CGGGGACCAG
GAGGACCGGC TGGGGCGGCC CTTCGTCGGC CCGGCGGGGC AGCTCCTCAC CCGGGCGATG
GAGGAGGCCG GCCTCGCGCG TGGTTCCTGC TATCTCACGA ATGCGGTCAA GCACTTCAAG
TTCGAGGAGC GCGGCAAGCG GCGCATCCAC CAGAAGCCGA CCGCCGGGGA GGTCGCGCAT
GGCCGCTGGT GGCTCGACCG GGAGCTCGGC TTCGTGCATC CGCGCCTCGT CGTCGCGCTC
GGGGCGACGG CGGTGCTGGC GCTGACCGGC AAGGCGATCC CGATCACCCG GGCCCGCGGC
CCGGCCCGGT TCGACGGCAA ACCCTATGCG GGCTTCGTCA CCGTGCACCC CTCCTACCTG
CTGCGCCTGC CCGAGGAGGC GAAGGCCGAG GCCTATGCGG GTTTCGTGGA CGATCTGCGC
CGGGTGCGAA TGCTGGCGCA GGAACTCGCC GGGGCGGCGT AG
 
Protein sequence
MRSIPLRPGA DLEGFRAAVR RLAAEECPPE AVTFTQGGAP GLFGPEPGLF GAEPGLFGAE 
AARASEEAGP APPLALPRAV AALVPAIVPH RDEARYALLY QLIWRVRRGE RHLPEVASDP
LVHRLTRMRR AVARDLHKMH AFLRFRRVPG EGERERFAAW FEPDHHILEA AAPFFVARFP
GFDWLILTPE GSAHWDGARL RFGPPERREA LPAGDAFEAG WSAYYASTFN PARTNLAAMR
AEMPKKYWRN LPEAAAIPDL VRNARRRVAA MIEREPAMPR KRTPTRALDA MAAQGPEDLE
ALNALIRRSE PLVPGATQAV LGEGPVGAAI AFVGEQPGDQ EDRLGRPFVG PAGQLLTRAM
EEAGLARGSC YLTNAVKHFK FEERGKRRIH QKPTAGEVAH GRWWLDRELG FVHPRLVVAL
GATAVLALTG KAIPITRARG PARFDGKPYA GFVTVHPSYL LRLPEEAKAE AYAGFVDDLR
RVRMLAQELA GAA