Gene ECD_00971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00971 
SymbolyccW 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1033868 
End bp1034971 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content52% 
IMG OID 
Productpredicted methyltransferase 
Protein accessionACT42866 
Protein GI253977196 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.406744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGTA AAGCCAGCCT CGGTGAAACC ATCGATATTG TTGATCATCA GGGAAAATGG 
TTAGCACGCG GCGCTTATTC GCCAGCTTCG CAAATCCGGG CGCGCGTCTG GACGTTTGAC
CCGTCTGAGT CTATCGACAT TGCTTTTTTT TCCCGCCGTT TGCAACAAGC ACAAAAATGG
CGTGACTGGC TGGCGCAAAA AGATGGCCTC GACAGCTATC GTTTAATCGC CGGAGAATCT
GATGGCCTGC CGGGTATTAC TATCGATCGT TTCGGTAATT TTCTGGTGCT GCAACTGCTG
AGTGCTGGGG CAGAATATCA GCGCGCGGCA TTAATTAGTG CCCTGCAAAC GCTGTACCCG
GAATGTTCGA TTTACGATCG CAGCGACGTC GCGGTACGTA AAAAAGAAGG AATGGAGCTG
ACCCAGGGCC CCGTCACCGG CGAGTTGCCA CCTGCCCTGC TGCCGATTGA AGAACACGGA
ATGAAACTGC TGGTGGATAT TCAGCACGGA CACAAAACGG GCTACTACCT GGACCAGCGT
GATAGCCGCC TGGCTACCCG CCGCTACGTT GAAAATAAAC GTGTGCTGAA CTGTTTCTCC
TATACCGGTG GTTTCGCCGT ATCGGCACTG ATGGGCGGTT GCAGCCAGGT TGTCAGCGTT
GATACCTCCC AGGAAGCGCT GGATATTGCA CGGCAGAACG TTGAGCTGAA CAAACTGGAT
CTGAGCAAGG CTGAGTTTGT CCGTGATGAT GTCTTTAAAT TGCTGCGTAC TTATCGCGAT
CGCGGTGAAA AATTTGACGT TATCGTGATG GACCCGCCGA AGTTTGTTGA GAATAAAAGC
CAGTTGATGG GCGCGTGTCG TGGTTATAAA GACATCAACA TGCTGGCGAT TCAGCTGCTG
AATGAAGGCG GTATTCTCCT GACTTTCTCC TGTTCCGGTC TGATGACCAG CGATTTATTT
CAGAAAATCA TCGCGGATGC CGCAATTGAT GCCGGTCGTG ATGTACAATT TATAGAGCAG
TTCCGTCAGG CAGCCGATCA TCCGGTGATC GCTACCTATC CGGAAGGGCT ATATCTGAAA
GGGTTTGCCT GTCGCGTCAT GTAA
 
Protein sequence
MEGKASLGET IDIVDHQGKW LARGAYSPAS QIRARVWTFD PSESIDIAFF SRRLQQAQKW 
RDWLAQKDGL DSYRLIAGES DGLPGITIDR FGNFLVLQLL SAGAEYQRAA LISALQTLYP
ECSIYDRSDV AVRKKEGMEL TQGPVTGELP PALLPIEEHG MKLLVDIQHG HKTGYYLDQR
DSRLATRRYV ENKRVLNCFS YTGGFAVSAL MGGCSQVVSV DTSQEALDIA RQNVELNKLD
LSKAEFVRDD VFKLLRTYRD RGEKFDVIVM DPPKFVENKS QLMGACRGYK DINMLAIQLL
NEGGILLTFS CSGLMTSDLF QKIIADAAID AGRDVQFIEQ FRQAADHPVI ATYPEGLYLK
GFACRVM