Gene B21_04176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04176 
SymbolmcrB 
ID8114164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4482357 
End bp4483736 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content40% 
IMG OID644850318 
Producthypothetical protein 
Protein accessionYP_003001891 
Protein GI251787587 
COG category[V] Defense mechanisms 
COG ID[COG1401] GTPase subunit of restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCAA TTCAAGCCTG GATTGAAAAA TTTATTGAGC AAGCACAGCA AAAAAGTTCA 
CAATCCACCA AAGATTATCC AACGTCTTAC CGTAACCTGC GAGTAAAAGT GAGTTTCGGT
TATGGCAATT TTACATCTAT TCCCTGGTTT GCATTTCTGG GAGAAGGTCA GGAAGTTTCT
AACGGTATAT ATCCCGTTAT TCTCTATTAT AAAGATTTTG ATGAGTTGGT TTTGGCTTAT
GGTATAAGCG ACACGAATAA ACCACATGCC CAATGGCAGT TCTCTTCAGA CATACCTAAA
ACAATCGCAG AGTATTTCCA GACAACTTCA GGTGTTTATC CTAAAAAATA CGGACAGTCC
TATTACGCCT GTTCCCAAAA AGTCTCACAG GGTCTTGATT ATACCCGGTT TGCCTCCATG
CTGGACAACA TAATCAACGA CTATAAATTA ATATTTAATT CTGGCAAGAG TGTTATTCCA
CCTATGTCAA AAACTGAATC ATACTGTCTG GAAGATGCGT TAAATGATTT GTTTATCCCT
GAAACCACGA TAGAGACGAT ACTCAAACGA TTAACCATCA AAAAAAATAT TATCCTCCAG
GGGCCGCCCG GCGTTGGAAA AACCTTTGTT GCACGCCGTC TGGCTTACCT GCTGACAGGA
GAAAAGGCTC CGCAACGCGT CAATATGGTT CAGTTCCATC AATCTTATAG CTATGAGGAT
TTTATACAGG GCTATCGTCC GAATGGCGTC GGCTTCCGAC GTAAAGACGG CATATTTTAC
AATTTTTGTC AGCAAGCTAA AGAGCAGCCA GAGAAAAAGT ATGTTTTTAT TATAGATGAA
ATCAATCGTG CCAATCTCAG TAAAGTATTT GGCGAAGTGA TGATGTTAAT GGAACATGAT
AAACGAGGTG AAAACTGGTC TGTTCCCTTA ACCTATTCCG AAAACGATGA AGAACGATTC
TATGTCCCGG AGAATGTTTA TATTATCGGT TTAATGAATA CTGCCGATCG CTCTCTGGCC
GTTGTTGATT ATGCCCTGCG CAGACGATTT TCTTTCATAG ATATTGAGCC TGGTTTTGAT
ACACCACAGT TCCGTAATTT TTTACTGAAT AAAAAAGCAG AACCTTCATT TGTTGAGTCT
TTATGCCAAA AAATGAATGA GTTAAACCAG GAAATCAGCA AAGAGGCCAC TATCCTTGGG
AAAGGATTCC GCATTGGGCA TAGTTACTTC TGCTCCGGGT TGGAAGATGG CACCTCTCCT
GATACGCAAT GGCTTAAGGA AATTGTGATG ACGGATATCG CCCCTTTACT CGAAGAATAT
TTCTTTGATG ACCCCTATAA ACAACAGATA TGGGCCGACA AATTATTAGG TGACTCATAG
 
Protein sequence
MESIQAWIEK FIEQAQQKSS QSTKDYPTSY RNLRVKVSFG YGNFTSIPWF AFLGEGQEVS 
NGIYPVILYY KDFDELVLAY GISDTNKPHA QWQFSSDIPK TIAEYFQTTS GVYPKKYGQS
YYACSQKVSQ GLDYTRFASM LDNIINDYKL IFNSGKSVIP PMSKTESYCL EDALNDLFIP
ETTIETILKR LTIKKNIILQ GPPGVGKTFV ARRLAYLLTG EKAPQRVNMV QFHQSYSYED
FIQGYRPNGV GFRRKDGIFY NFCQQAKEQP EKKYVFIIDE INRANLSKVF GEVMMLMEHD
KRGENWSVPL TYSENDEERF YVPENVYIIG LMNTADRSLA VVDYALRRRF SFIDIEPGFD
TPQFRNFLLN KKAEPSFVES LCQKMNELNQ EISKEATILG KGFRIGHSYF CSGLEDGTSP
DTQWLKEIVM TDIAPLLEEY FFDDPYKQQI WADKLLGDS