Gene Acid345_4264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4264 
Symbol 
ID4073191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5066200 
End bp5067726 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content54% 
IMG OID637986296 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_593338 
Protein GI94971290 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCCAG TTATAGACCT TTTCGCCGGG CCGGGCGGTC TGGGCGAGGG CTTTTCCGCC 
CTCCGAGATG ATGCAGGGCG CCGTGTCTTC AGGATCGGCC TTTCCATCGA GAAAGATCCT
GCTGCGCACA AAACCCTGTT GCTCCGAAGT TTTTTTCGGC AGTTCGAGAT TGCACCGAAC
CAGTACTACA GCTACGTTCG TGGGGAGTTG GCGATTAGCC AGTTGATCGA GGCCTTCCCA
ACCCAATACG GCTCGGCCCA GGAAGAGGCA CTCTGCGTTG AATTAGGGAA GCACGACTGG
ACCGACATTG ACAGTCGTAT CCGGAGAGCG ATCGGCGACT TCAAGAATTG GGTCCTGATC
GGAGGACCTC CGTGCCAGGC GTATTCACTT GTGGGCCGAT CGCGAATGCG CAATAAGAAC
CCGCAGAAAT TCGGCAGGGA CAAGCGTCAT TTGCTGTATC GCGAGTATCT TCGAATTCTC
GCGGTGCACC GGCCACCCGT CTTCGTGATG GAAAACGTGA AAGGCATTTT GTCCTCTCAG
CACAAGAAAG AACCGATCAT CAATAAGATC ATTGCTGACC TGAAACACCC ACTTGACTCT
CTTCCCGAGC TGAAGACCTC TGAATGGGAC TCGGTCACCT ACGACATTCA GTCACTCGTC
GATCGCGTTG ACTCAAGGCA GAGCAGCCTT TTCGGTAAGT CCCCGCGGGA TTTCGTGATT
CGATTTGAAC AGCATGGGAT ACCCCAAGCT CGGCACCGCG TTCTTCTGGT TGGAATACGT
TCGGACATCT CTCGCGATCT CTCGACGTTA ACGCCACAAA AGCAGATCGC AATGTGGAAT
GCAGTTCAGG ACTTGCCTGA AATGAGGAGC CGATTGTCCG GGGAGCCCGA TTCGCCCGAG
GCGTGGCTTC GCGCCATTGG AATGATTCGT GATTCGCTGG GGAAAACCTC TTGCGGCAGT
CAGTTGCGAG ATTTCATCCA CAAGAGTTTG GGACAATTGA CGGCTCGCCA AAGCTGCGGA
GGTGAATTCA TGGAATGGCG ACGGAGGTCG GAGTGGGCTT CAGATTGGTT TCACGATGCG
CGACTCGGCG GGGTTTTGAA TCACACCTCT CGGGGCCATA TCGCATCCGA TTTGCAGAGG
TACTTCTTCG CGGCCTGTTT TGCCAAGGTA AGGAATCGCT CCCCGAAAAT TGAGGACTTC
CCGATTGCTT TATTGCCCAA GCATAAAAAC ATCAAGAAGA AGAACACCAA AGCGACGATT
TTTGCAGACA GATTTCGGGT TCAGATTGCT AGCCGCCCCG CGACAACTAT CACATCACAC
ATCAGCAAAG ACGGCCATTA CTTCATCCAC CCTGACCCGT TACAAGCGCG AAGCTTGACG
GTGCGAGAGG CGGCACGCCT TCAAACGTTC CCAGATAACT ACTATTTTGA AGGCCCGCGC
ACGTCCCAGT ACCACCAAGT TGGAAACGCT GTACCGCCCT TGATAGCTCG CGAGGTTGCA
AAGATGGTCG CCGACCTTTT GGGGTGA
 
Protein sequence
MIPVIDLFAG PGGLGEGFSA LRDDAGRRVF RIGLSIEKDP AAHKTLLLRS FFRQFEIAPN 
QYYSYVRGEL AISQLIEAFP TQYGSAQEEA LCVELGKHDW TDIDSRIRRA IGDFKNWVLI
GGPPCQAYSL VGRSRMRNKN PQKFGRDKRH LLYREYLRIL AVHRPPVFVM ENVKGILSSQ
HKKEPIINKI IADLKHPLDS LPELKTSEWD SVTYDIQSLV DRVDSRQSSL FGKSPRDFVI
RFEQHGIPQA RHRVLLVGIR SDISRDLSTL TPQKQIAMWN AVQDLPEMRS RLSGEPDSPE
AWLRAIGMIR DSLGKTSCGS QLRDFIHKSL GQLTARQSCG GEFMEWRRRS EWASDWFHDA
RLGGVLNHTS RGHIASDLQR YFFAACFAKV RNRSPKIEDF PIALLPKHKN IKKKNTKATI
FADRFRVQIA SRPATTITSH ISKDGHYFIH PDPLQARSLT VREAARLQTF PDNYYFEGPR
TSQYHQVGNA VPPLIAREVA KMVADLLG