Gene Mpop_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_1091 
Symbol 
ID6313843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp1163806 
End bp1165176 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content74% 
IMG OID642649812 
Productcytosine deaminase-like protein 
Protein accessionYP_001923801 
Protein GI188580356 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.528531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0465874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAC CCTCGTTCCT GCCCGAGGCC CGCGCCTACC GCCTGCGCAA CGCGCGCGTG 
CCCGGAGCCT TCCTGACCGG CGGCGTGCCC GCCGGCGCGA TCCTCGACGG GGATGCTTGC
GCGCTCATCG ACATCGTCGT CGCGGACGGC GTCATCGCCG GCCTTCTGCC GGCCGGAAGT
TCGCCGGACA CCTTGCCGGC TGCCGATCTC GCCGGGCGGC AGGCCTGGCC GCGCCTCGTC
GATGCGCACA CCCATCTCGA CAAGGGCCAT ACGGTCGCCC GCACGCCCAA TCCGGACGGC
GACTTCCCCG GCGCCCGCGA CGCCACCACG GCGGATCGCA CCCGCCACTG GGACGCGGAG
GATCTGCGCC GCCGCATGGC CTACGGTCTC GCCTGCGCCT TCGCCCACGG CACCGGCGCG
ATCCGCACCC ATCTCGACAG CCAGGACAGC GCAGGGGAGG GGGCGCTGCG CGACCAGGCC
GCGACGACCT GGGCGGTGTT CCGGGAAATG CGGGCGGCCT GGGCCGGGCG GATCGCGCTC
CAGGGCGTCG GCCTGACCCC GATCGATGCC TACGCCACCG CGTACGGGCG CCGTCTCGCC
GACCTGATCG CGGAGTCGGA CGGGCTGATC GGCGGCGTGA CCCGGCCGAC CGGCGGTCTG
CACGGCGGGG CGCTGGCCGA GATCGACGCC CTGCTCGACC GCCTGTTCGG CCTCGCCCGT
GAGCGCAACC TCGATGTGGA CCTCCATGTC GACGAGACCG GCGATCCCGC CGCGGCGTCC
CTCGACGCGG TCGCGCGGGC GACCCTGCGC CACGGCTACG AGGGCCGCGT CACCTGCGGG
CATTGCTGCA GTCTCGCGCT CCAGCCCGAC GCGCAGGCGT CAGGCACGAT CGCGCGGGTG
GCGGAGGCGG GGATCCGCAT CGTGACGCTG CCGACGGTCA ACATGTACCT CCAGGATCGG
CAGCGGGGGC GCACCCCGCG CTGGCGCGGC GTCGCCCCGG TCCAGGAACT GATGGCGGCC
GGCGTGCCCG TGATGGTCGC GGGCGACAAT TGCCGCGATG CGTTCTACGC CTACGGCGAC
CACGACATGC TCGACACGTT CCGGGCGTCG GTGCGCATCC TCCATCTCGA TCACCCGCTG
GCCGGGGCGC CCGCGCTCGC CGGGCCTGTG CCGGGGGCGA TGATGGGGCT TCCCCATGCC
GGCACGATCC GCGAGGGCGC CCCCGCCGAC CTGATCCTCC TGCCGGCGCG CAGCCTCAAC
GAGGTCGTCG CGCGGCCGCA TGCGGATCGA ATCGTCGTGG TCGCGGGCCG GGCGATCGCG
GCGCGTCGGC CAGCCTATGA AGCCCTGAGC GGCGAGGCCG CCCCCTGGTA G
 
Protein sequence
MSEPSFLPEA RAYRLRNARV PGAFLTGGVP AGAILDGDAC ALIDIVVADG VIAGLLPAGS 
SPDTLPAADL AGRQAWPRLV DAHTHLDKGH TVARTPNPDG DFPGARDATT ADRTRHWDAE
DLRRRMAYGL ACAFAHGTGA IRTHLDSQDS AGEGALRDQA ATTWAVFREM RAAWAGRIAL
QGVGLTPIDA YATAYGRRLA DLIAESDGLI GGVTRPTGGL HGGALAEIDA LLDRLFGLAR
ERNLDVDLHV DETGDPAAAS LDAVARATLR HGYEGRVTCG HCCSLALQPD AQASGTIARV
AEAGIRIVTL PTVNMYLQDR QRGRTPRWRG VAPVQELMAA GVPVMVAGDN CRDAFYAYGD
HDMLDTFRAS VRILHLDHPL AGAPALAGPV PGAMMGLPHA GTIREGAPAD LILLPARSLN
EVVARPHADR IVVVAGRAIA ARRPAYEALS GEAAPW