Gene Acid345_1127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1127 
Symbol 
ID4069897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1404454 
End bp1405533 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content56% 
IMG OID637983136 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_590204 
Protein GI94968156 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000503374 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000088158 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCTAAAAT CCCGATTTAT GAAGCGCCGA GATAGCGATT CAGCTCCGTT TTTGTTGGAA 
GTTGGATACG AGGCAGAGAA TCCCTGGCGA ATTGCTGAAA GCCAGCCGCC GAAAGGCAAA
AGCGGTTTGG TCACGCTCGA ACTCTGCGCC GGTGCTGGGG GTCAAGCTCT CGGGTTAGAA
CAGGCCGGTA TCAATCATGT CGCGCTCGTA GAGATAAATA AACACGCATG CGAGACACTT
CGACTTAATC GTCCCAATTG GAAAGTGGTT GAGGGCGATC TGCAAACGTT CGATCCTTCC
CCGTACAAGG GCGCTGACAT TGTCTCGGCA GGATTGCCAT GCCCCCCCTT TTCTGTTGCC
GGAAAGCAGT TGGGAAAGTT GGACGAGAGA AATCTCTTTC CGGCGATGGT GAATGTCGTC
GACGCGGTAA GACCGCGAGC CGTTATGGTG GAGAATGTTC GTGGCATTCT TGATGCGGTA
TTCATTGACT ATCGCGAGCA CGTGAGCAAG CAGCTGCGAA AACTTGGATA TACCCCGGGT
TGGCATTTGA TGAACGCCTG CGAATTCGGA GTTCCTCAAC TTCGGCCGCG GGTTGTATTC
GTAGCGATGC GAAAGGAGTA TTCCGAGCAC TTCGCTTGGC CGCGCGCGAC TAACGAGCCT
CAGACGGTCG GAGATGTATT ATTCGACTTG ATGAGTGCGC GCGGCTGGAA AGGTGTGAAA
GCTTGGCGCG CGAAAGCAAA CGAGATTGCG CCAACGATTG TAGGGGGATC CCTGAAGCAC
GGCGGCCCAG ATCTTGGTCC CACGAGAGCA CGCCGCGCCT GGGAGGCACT CGGAGTGGAC
GGGAAGGGGA TCGCGGACGA TGTACCGGAG CGTGAGTTCG TAGGTATGCC CCGTCTCACT
GTTCGTATGG TCGCGCGCAT TCAGGGTTTT CCCGATGAAT GGCAGTTCGC GGGCAGGAAA
ACGCAAGCGT ACCGCCAGGT TGGAAATGCT TTTCCGCCGC CTTTCGCTCG TGCAGTTGCG
GAAAGCGTGA GTGCTTGCTT GTCGTCCGCA CGAAGGACAG TCCGAGTCAC CAGTGCTTAA
 
Protein sequence
MLKSRFMKRR DSDSAPFLLE VGYEAENPWR IAESQPPKGK SGLVTLELCA GAGGQALGLE 
QAGINHVALV EINKHACETL RLNRPNWKVV EGDLQTFDPS PYKGADIVSA GLPCPPFSVA
GKQLGKLDER NLFPAMVNVV DAVRPRAVMV ENVRGILDAV FIDYREHVSK QLRKLGYTPG
WHLMNACEFG VPQLRPRVVF VAMRKEYSEH FAWPRATNEP QTVGDVLFDL MSARGWKGVK
AWRAKANEIA PTIVGGSLKH GGPDLGPTRA RRAWEALGVD GKGIADDVPE REFVGMPRLT
VRMVARIQGF PDEWQFAGRK TQAYRQVGNA FPPPFARAVA ESVSACLSSA RRTVRVTSA