Gene B21_04175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04175 
SymbolmcrC 
ID8114995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4481311 
End bp4482357 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content38% 
IMG OID644850317 
Producthypothetical protein 
Protein accessionYP_003001890 
Protein GI251787586 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAACAGC CCGTGATACC TGTCCGTAAT ATCTATTACA TGCTTACCTA TGCATGGGGT 
TATTTACAGG AAATTAAGCA GGCCGATCTT GAAGCCATTC CCGGTAACAA TCTTCTTGAT
ATCCTGGGGT ATGTATTAAA TAAAGGGGTT TTACAGCTTT CACGCCGAGG GCTTGAGCTT
GATTACAATC CTAATACCGA GATCATTCCT GGCATCAAAG GGCGAATAGA GTTTGCTAAA
ACAATACGCG GCTTCCATCT TAATCATGGG AAAACCGTCA GTACTTTTGA TCTGCTTAAT
GAAGATACGC TGGCTAACCG AATTATAAAA AGCACATTAG CCATGTTAAT TAAGCATGAA
AAGTTAAACT CAACCATCAG AGATGAAGCT CGTTCACTTT ATAGAAAATT ACCGGGCATT
AGCACTCTTC ATTTAACTCC GCAGCATTTC AGCTATCTGA ATGGCGGAAA GAACACGCGT
TATTATAAAT TTGTTATCAG CGTCTGTAAG TTCATCGTCA ATAATTCTAT CCCAGGTCAA
AACAAAGGAC ACTACCGTTT CTATGATTTT GAAAGAAACG AAAAAGAGAT GTCATTACTT
TATCAAAAGT TTCTTTTTGA ATTTTGCCGC CGTGAATTAA CGTCTGCAAA TACAACCCGC
TCTTATTTAA AATGGGATGC ATCGAGCATA TCGGATCAGT CACTTAATTT GTTACCTCGA
ATGGAAACTG ACATCACCAT TCGCTCATCA GAAAAAATAC TTATCGTTGA CGCCAAATAC
TATAAGAGCA TTTTTTCACG ACGAATGGGC TCAGAAAAAT TTCACTCTCA AAATCTTTAT
CAACTGATGA ATTACTTATG GTCGTTAAAA CCTGAAAATG GCGAAAACAT AGGGGGTTTA
TTAATATACC CCCACGTAGA CACCGCAGTG AAACATCGTT ATAAAATTAA TGGCTTCGAT
ATTGGCCTGT GTACCGTCAA TTTAGGTCAG GAATGGCCGT GTATACATCA AGAATTACTC
GCCATTTTCG ATGAATATCT CAAATAA
 
Protein sequence
MEQPVIPVRN IYYMLTYAWG YLQEIKQADL EAIPGNNLLD ILGYVLNKGV LQLSRRGLEL 
DYNPNTEIIP GIKGRIEFAK TIRGFHLNHG KTVSTFDLLN EDTLANRIIK STLAMLIKHE
KLNSTIRDEA RSLYRKLPGI STLHLTPQHF SYLNGGKNTR YYKFVISVCK FIVNNSIPGQ
NKGHYRFYDF ERNEKEMSLL YQKFLFEFCR RELTSANTTR SYLKWDASSI SDQSLNLLPR
METDITIRSS EKILIVDAKY YKSIFSRRMG SEKFHSQNLY QLMNYLWSLK PENGENIGGL
LIYPHVDTAV KHRYKINGFD IGLCTVNLGQ EWPCIHQELL AIFDEYLK