Gene Acid345_3753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3753 
Symbol 
ID4069328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4431985 
End bp4433040 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content48% 
IMG OID637985775 
Product5-methylcytosine-specific restriction enzyme subunit McrC 
Protein accessionYP_592827 
Protein GI94970779 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.537146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCC CTGTCGCAAA CGTCTACTAC CTCCTTTGTT ATGCTTGGGA CAAACTGGAA 
GAGAGAGATC TTGTCGATAT TCATCCGACC GAAGAAACGG ATTTGGTGAA CCTGTTCGCC
CGCGTTCTGA CCAACGGTAT CGACCATTTG CTGAAGAAAG GCATCGATCG TGGGTACCTA
CTTCATAGCG AAGAATCCTG CGTGTTGCGT GGGAGGATCG ATTTTCCACA ATCGATAAAA
CACATGCTCT TTCAGCGAGC GCAGGCGCAT TGCGAGTTTG ATGAATTGAG TTTTGATGTG
CTGCATAACC GTATTCTGAA ATCGACGATT ATGCGCTTAA TTAGGACTCG TGATCTAGAT
TCAGGAATTC GAGATCGTTT GCTTTTCCAA TATCGCTACT TTGCGGAGGT CGGGGACCTC
GATTTATCGG TTCAGATATT TGGCAAAGTA CAGCTTTACC GTAACAACCA CTTTTATGAT
TTCCTTTTAA GGGTCTGCGC GCTGCTTTTT GAGAATCTGC TTCCGACCCA AGAGCCTGGA
AATTGGCGGT TTAGGTCGTT CTTGCAGAAT CGGGAACAGA TGGCGTATGT CTTTGAGCGT
TTCGTACGCA ACTTTTACAA GAGGGAACTA CCAAGCGTGA GAGTTGACGG GCGATGCAAA
GTCAAGCGGG AGGACATAAA TTGGGGCATG ACACCTTCAG ACGACCTCAG CTCAGCTCTG
CTTCCCAAAA TGCAAACCGA TGTGTGTATC ACCACCGAGG CAAAAAGGAT CTTGGTTGAA
TGCAAATACG TCGATGATCC TCTTGAGCAG CGGGAGGAGA TGGCCCCGAA GCTGATTACT
ACTCATCTTT ACCAAGTGAA TGCTTACCTG GACAACTGGC CCGATTTGCC ACTCTACCGC
TCTTCCCGCG CCATATTGTT ATATCCACTC GCTACACGTC CGATCGCTGT CGAGTTCACT
CGTGCCGACG GGCAGCTCCT CAGCGTACGC ACCCTCAATT TGGCCCAGCA ATGGTCTGCT
ATTCACCAAG ACCTTCTTAG ATTGGTGGAC AATTGA
 
Protein sequence
MEIPVANVYY LLCYAWDKLE ERDLVDIHPT EETDLVNLFA RVLTNGIDHL LKKGIDRGYL 
LHSEESCVLR GRIDFPQSIK HMLFQRAQAH CEFDELSFDV LHNRILKSTI MRLIRTRDLD
SGIRDRLLFQ YRYFAEVGDL DLSVQIFGKV QLYRNNHFYD FLLRVCALLF ENLLPTQEPG
NWRFRSFLQN REQMAYVFER FVRNFYKREL PSVRVDGRCK VKREDINWGM TPSDDLSSAL
LPKMQTDVCI TTEAKRILVE CKYVDDPLEQ REEMAPKLIT THLYQVNAYL DNWPDLPLYR
SSRAILLYPL ATRPIAVEFT RADGQLLSVR TLNLAQQWSA IHQDLLRLVD N