Gene Dshi_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3643 
Symbol 
ID5714173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009955 
Strand
Start bp42877 
End bp44031 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content64% 
IMG OID641276561 
Productlytic transglycosylase catalytic 
Protein accessionYP_001541857 
Protein GI159046185 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0752757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGTGTA GGCTGCGTCA TATCCTGGCC CTTTGCCTGC TGCCTGGCGC CGCCTTTTCC 
CAAGGTGTGC CGACCAATGA CAGCGGGTTG ACCGCGCGCG ATATCGTCGA GACCGGCGAT
CGAGAGGCCG ACCTCGCCAT TCAAGCGGAC AAGCTCTCGG TGCGAGAACT CATCGCCGAA
ATCGAACGGG AGCAATTGGC CACCCTGCAG CGCATCCTCG ATGCCCAGAG CAGCTTCGGC
GGTCAGGGCC TGCCGGCCAT GGTCTCCGGG CTGGAAAGTG GCAGCGGCGA TCCCGACCGC
GCCGTGGAAG CCGTCTATGG CACGGGCGAG ATCGATCCCA ATCGCGGCGG TGCGCAGATG
TTCGGGGATG CGTCTGAGAA CATCGAGCAA CTCATCATCC GCGTCGCCCA GGAAACCAGC
GGCTTTGCCG GCGTTGGCCG CGCGGGCCTC TCCCCGGTTC AATGGCGCGC ACTCCTACAG
GCGCTCATCT GGCAGGAAAG CCGGTTTACC ATCGGCGCGC GCTCGCCGGT CGGCGCCTTT
GGCCTCACCC AGATCATGCC TGGGACGGCC AGCGATCTCG GCATCAACCC GGATTACTAT
GACAGCCCCT ACCTGCAGGT GCATGGCGGC GCGCGCTATC TCGCCACCCA GCTCAACACC
TTCGATGGCA ACATCATCAA CGCCCTCGCG GCCTATAATG CCGGCCCCGG CCGGGTCTTT
GAATATGGCG GGGTCCCACC TTTCCGCGAA ACCCAGCACT ACGTCCAGGT CATCCCCGAG
CGCTACAATC TCTATCTGAG CCGCATCGGC GGGATCGAAG CGCTCGGAAC GATCGATCCG
GCGCTTCTGG CCAATGCCAA CCTCTCGATC ACCGGGCATG GCGCGGCCTT CTATGGCAGC
AACTCCCCCG CTGCGATCAG GCAGGCCGCC CTGCGCATTG CGGATATCGT CGAGCGGATT
TCTGAGACGG AAGACATGCA GGAAAGCGTC GCGCTCAACA CCTATGCCCG CGCCGAACTC
GTGCGTCTCG TCGCCGCGCG TATTCGTCTT CAAGCGGCAC GGACCCGCGT GCTTTCGGCC
GAGGAATTGG CACAGGCCAG CGCCCGCATG GCCGAGGGCG CATTCATGGA ATTCACGATC
AGGGAGATAG ACTGA
 
Protein sequence
MKCRLRHILA LCLLPGAAFS QGVPTNDSGL TARDIVETGD READLAIQAD KLSVRELIAE 
IEREQLATLQ RILDAQSSFG GQGLPAMVSG LESGSGDPDR AVEAVYGTGE IDPNRGGAQM
FGDASENIEQ LIIRVAQETS GFAGVGRAGL SPVQWRALLQ ALIWQESRFT IGARSPVGAF
GLTQIMPGTA SDLGINPDYY DSPYLQVHGG ARYLATQLNT FDGNIINALA AYNAGPGRVF
EYGGVPPFRE TQHYVQVIPE RYNLYLSRIG GIEALGTIDP ALLANANLSI TGHGAAFYGS
NSPAAIRQAA LRIADIVERI SETEDMQESV ALNTYARAEL VRLVAARIRL QAARTRVLSA
EELAQASARM AEGAFMEFTI REID