Gene Arth_0863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0863 
Symbol 
ID4446631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp936316 
End bp937572 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID639688670 
ProductMcrBC 5-methylcytosine restriction system component-like protein 
Protein accessionYP_830361 
Protein GI116669428 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCCCC GGCCGGCGCG CCAGCCCTCC AGTGCCGGCA AGGCCGTCCG CCACATCGTG 
CTGGATGAGC TGTCCGGCGG CGTGGTGGAC AAGCTGGACC CCGCCAGCGC CGCGTTCGTG
AACGCCAGCG GCCTGGCCAA GGCGTCTCCC ATGGGGATGG GCCTTTTCAG GATCGAGCCG
GTGGGCATGG TCGGGTCGGT GCGGACGCCG ACCGTGCAGC TGGAAGTGCG GCCCAAGGAC
CGCCTGGGCC TCAGCCGGCT GCTGTTCCTC CTCAGCTATG CGGGAAACCA GGGCTTCCGT
GAAGACCCGG TGGCCGCCGT CGAGGACCCG GATCTGTGGA GCGCGCTCGC CGTGTCGCTG
GTGCAGCTGG CAGACCGTGC CCTGAGCCGC GGAGTCCTCC AGGGCTACCT CACCGTGGAC
GAATCACTGC GGACGGTGAA GGGCCGGATC CGGATCTCGG ACCAGATTTC GCGCCGGCCC
GGAATGCTTG TGCCGCTGGA AGTTTCCTAT GACGAATTCA CTGAAGACAT TGCCGAGAAC
CGCATCCTGC GGGCCGCGCT GGAGCGCATG GCCCGGGTCC CGCGCGTTCG GCCGGACGTC
CAGAGCCGCC TGCGGCTGCT CTTGGGAAAG CTCGACGCCG TCACGCGCCT TCGGCCCGGA
GCGCCACTTC CGCCGTGGCA GGCCACCAGG ATGAACACCC GGTACCACGC GGTGCTCCGC
CTGTCCGAAG TGATCCTGCG CAACGCCTCA GCCGAGGCGG GAGACGGCAA GCAGCAGACG
GCGTCGTTCG TGGTGGACAT GGGGCAGGTC TTCGAGGACT TTGTGGGCAC GGCCCTCCGC
GAAGCCATGA CGGCCTATCC CGGCGAGATG CGGCTGCAGT ACAACGCCCT GCTGAACGAG
GCCGTGCGTG ATTCCGACCG GCTCACGGTG AATCCGGACG CGGTGCACCT GCTTGGCGGC
CGTCCCGTGG TGGTCTACGA CACGAAGTAC CGGGCCGCGA CTGACCAGGG CGCATCCCTG
TCGGCGGACC ATTTCCAGAT GCTGGCCTAC TGCACGGCCC TACGCGTACC GACCGCCTGG
CTGGTGTACG CAGGCGCGGG GGAAATGAAG CTCCGCCGCA TCCTTAATAC GGATATCGAC
GTGGTGGAGT ACCCGCTTGA TCTTTCCCTG CCGCCGTCGG ACATCCTGGC AGCGGTTGCC
GACCTGGCAC AGCAGTCCTG GGGCGAAGTG GTGCGGCAGG CAGGACTTAA TCAATGA
 
Protein sequence
MPPRPARQPS SAGKAVRHIV LDELSGGVVD KLDPASAAFV NASGLAKASP MGMGLFRIEP 
VGMVGSVRTP TVQLEVRPKD RLGLSRLLFL LSYAGNQGFR EDPVAAVEDP DLWSALAVSL
VQLADRALSR GVLQGYLTVD ESLRTVKGRI RISDQISRRP GMLVPLEVSY DEFTEDIAEN
RILRAALERM ARVPRVRPDV QSRLRLLLGK LDAVTRLRPG APLPPWQATR MNTRYHAVLR
LSEVILRNAS AEAGDGKQQT ASFVVDMGQV FEDFVGTALR EAMTAYPGEM RLQYNALLNE
AVRDSDRLTV NPDAVHLLGG RPVVVYDTKY RAATDQGASL SADHFQMLAY CTALRVPTAW
LVYAGAGEMK LRRILNTDID VVEYPLDLSL PPSDILAAVA DLAQQSWGEV VRQAGLNQ