Gene Gdia_0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0979 
Symbol 
ID6974376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1101991 
End bp1103445 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content69% 
IMG OID643390502 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_002275378 
Protein GI209543149 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.441427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATG GTCATTCCCC CATGGGTGAC ATCTCACCCC GCACACCCTT CCGCGTGCTG 
GACCTGTTCG CGGGTGCGGC TGGCGGCTGG ACGCTGGGCC TGCACCGTGC GGGTTTCGTC
ACCGTCGCCG CCTGCGAGAT CGTTCCATGG CGGCGCGTGC TCTACGCGGA GAACAATCCC
CATGTCCGCC TCTACGACGA TGTCCGGACC CTCACGGCAG GGCGACTTGT TTCCGACCTC
GGTTTCCTGC CCGACCTCAT CGCGGGCAGC CCGCCATGCC AGGACATCAG CAGCGCCAAC
ACCAGGGGCA AAGGGATCGA CGGCGCGCGG TCGGGCCTCT ACCGCGAAGC CGTCCGCCTG
GTCGGAGAAT GCCGCCCTCG CTGGTTCGCT TTTGAGAACA GCGCTAACCT CCGAACTCGC
GGTGCGGACC GGCTGCTCGA TGCGCTGGAG GCGCTCGGCT ACGCCTGCGA ACCGTGCGTG
GTGGGTGCTG GAGACGTCGG CGCCTGCCAT GTCCGCAAGC GGTCCTGGCT CATCGGTTTC
GACCCCCGGC AGCTTGCCGA CACCGGTCTC GCAGTCGCAA CAGGGCGGGA TGCGGATGGA
AGGGGGAGCG GGCGCGCGGG CGGCGATGAA GGCGTCGGGT CTTTACGATG TGTTCCGCAC
GGATCGTATG GCGACGCCAC GGGCGTCGGA TGCCGCGAAG GGTGGTCGGG GCGACGTGCT
GGCGCAGATG ACGGGGCAGG AGAACCGTCA TGCGGGGATG CTGGGAACCC CGACTGCCAA
CCCTGCGCCA CGGGGCAGCA TGCTGCGTCG GTGCAAGGGG CGGATGACAT CGGAGCAGGA
CGATTTCAAC CGTACGCCGA CGATCGGGGA AGCGCTGTAT TTCGAGGAGC ACCAGCCTCC
CTGCGGGATG CTGCCCACAC CGCTGGCGCC GAACGGGGGC CGCCGGCTCA CCGCGCGGGA
GATCGGGACG GGCCGTCGGG CGAACGGATC CAAGGCCCAG ATCGACACGC CGAACCTGCT
GCGGCACGCA GCACTGGAAA CGCTACCGAC ACCCACGAAG CGGGACAGCC GGATGGACGG
CTGGAGCGAT GCCTACGACC GGAGGAAGTC GCCCACGATG GACGCGGTGA TGGATGGAGC
GATGACGGGC AGGGCACCCC CGGACCGCTG GGCGGGTGCA CGGGCGCTGG CGGTCCTGCT
GCGGAGCCAT GGGCTGACTG GAACGGCGGC CCTGCCGCTC ACCTACGGCT GGATGATGGG
CTTTCCGCCT GGCTGGCTCG CACGCGCGTT GCGCTCGGCG ATGGACGCCG GGCGTCTGCC
GCCAGCCTCG TCGTCGAGGC GTTCGGCGAC GCGGTCGTCC CGGACATCCC CGAAGCCATA
GGCAACGCCA TCCTGCGGGT CGAACTGGCG CTCGACATGG TCCTGGCGCG TGCGGCGGCC
GGAGATGTGT CATGA
 
Protein sequence
MTHGHSPMGD ISPRTPFRVL DLFAGAAGGW TLGLHRAGFV TVAACEIVPW RRVLYAENNP 
HVRLYDDVRT LTAGRLVSDL GFLPDLIAGS PPCQDISSAN TRGKGIDGAR SGLYREAVRL
VGECRPRWFA FENSANLRTR GADRLLDALE ALGYACEPCV VGAGDVGACH VRKRSWLIGF
DPRQLADTGL AVATGRDADG RGSGRAGGDE GVGSLRCVPH GSYGDATGVG CREGWSGRRA
GADDGAGEPS CGDAGNPDCQ PCATGQHAAS VQGADDIGAG RFQPYADDRG SAVFRGAPAS
LRDAAHTAGA ERGPPAHRAG DRDGPSGERI QGPDRHAEPA AARSTGNATD THEAGQPDGR
LERCLRPEEV AHDGRGDGWS DDGQGTPGPL GGCTGAGGPA AEPWADWNGG PAAHLRLDDG
LSAWLARTRV ALGDGRRASA ASLVVEAFGD AVVPDIPEAI GNAILRVELA LDMVLARAAA
GDVS